upvote
What does "restricts tweaking of sampling parameters" mean?
reply
"enforcing Temperature=1.0 and TopP=0.95 in Thinking mode, with mandatory validation that thinking content is correctly passed back."
reply
If the post training is done with specific sampling parameters it would make sense to only use the parameters it was trained with.
reply
Yes, but post training cannot possibly account for all possible use cases. Sane defaults are fine, you can't really do much about sampling parameters in chatbots and coding harnesses anyway. And when making an API call, you have to actively change the parameter in your payload. I don't believe there's any real risk.
reply