Sec. 14.3 had used a factor in front
of the utilities, and it was said that smaller
leads to a more
random choice, while larger
leads to a stronger preference for
the best options. What happened to this
in the theoretical
treatment of this chapter?
In fact, the from Sec. 14.3 is related
to the width parameter
showing up in some equations of this
chapter. It is however not systematically treated by this text. The
reason for this is that in the maximum likelihood estimation, it does
not show up as a separate variable anyway. But what is the reason for
this now?
What happens here is that the maximum likelihood estimation
automatically includes the meaning of the prefactor or
into the other
. So if the theoretical form says
[[also: which assumptions were made? Also see in ``improvements'']]