Sec. 14.3 had used a factor in front of the utilities, and it was said that smaller leads to a more random choice, while larger leads to a stronger preference for the best options. What happened to this in the theoretical treatment of this chapter?
In fact, the from Sec. 14.3 is related to the width parameter showing up in some equations of this chapter. It is however not systematically treated by this text. The reason for this is that in the maximum likelihood estimation, it does not show up as a separate variable anyway. But what is the reason for this now?
What happens here is that the maximum likelihood estimation
automatically includes the meaning of the prefactor or
into the other . So if the theoretical form says
[[also: which assumptions were made? Also see in ``improvements'']]