upvote
But what about real price in real agentic use? For example, Opus 4.5 was more expensive per token than Sonnet 4.5, but it used a lot less tokens so final price per completed task was very close between the two, with Opus sometimes ending up cheaper
reply
How does it work exactly? How this model is cheaper and has the same perf as Opus 4.5?
reply
Distilling from a teacher (Opus 4.5) and scaling RL more.
reply
this is called progress
reply
I'm asking technically how progress works. What is actually being improved here
reply
Or, we can bleed out cash for a very long time.
reply