upvote
What of it?

For me too, it was around that time last year, with GPT-5, Claude Sonnet 4.5 and then Gemini 3 that I started feeling that these models are clearly becoming great at reasoning. I'm not at all opposed to saying that they are around PhD-level on at least some domains.

reply
I think there's a lot of difference between sounding like someone and being someone. The models are excellent at pretending indeed.
reply
exactly. this is what whole RL thing is optimizing for, even if that is not the intent.
reply