upvote
I'm doubtful that the companies telling their employees to burn more tokens are doing careful evaluations of cost versus benefit. People on an expense account don't shop around much.

Maybe they'll penny-pinch later after running through their AI budgets?

reply
Did anybody compared these directly using exactly same prompts and harness? I assume V4 Pro could be real frontier model, and if it's true, it'd be better to use it in automation or routine steps instead of simple models (e.g. haiku or even sonnet if V4pro is better)
reply