upvote
What I’ve usually seen is 4.7 -> 4.5 -> 4.6 in terms of quality. Though 4.7 seems to hallucinate more than before.
reply
Anthropic themselves have (had?) this thing where Opus is used for planning and Sonnet for coding.
reply
I thought this was a costs saving measure: we plan with the frontier model / SOTA, then code with something cheaper.

But then, Anthropic employees don't have rate limits, right?

reply