upvote
It's a comment. On Hacker News. Not the RL subreddit, or whatever. I'm just amazed at the jargon. I'm sure it's useful, but one could just call it model output.
reply
> one could just call it model output.

That would be incorrect. My other reply attempts to address this.

reply
But the probability vector is the output of the LLM, no?
reply