undefined

points

[-]

What of it?

For me too, it was around that time last year, with GPT-5, Claude Sonnet 4.5 and then Gemini 3 that I started feeling that these models are clearly becoming great at reasoning. I'm not at all opposed to saying that they are around PhD-level on at least some domains.

by kmaitreys2 hours ago|

parent|

[-]

I think there's a lot of difference between sounding like someone and being someone. The models are excellent at pretending indeed.

by 0123456789ABCDE2 hours ago|

parent|

[-]

exactly. this is what whole RL thing is optimizing for, even if that is not the intent.