Hacker News
new
past
comments
ask
show
jobs
points
by
happyPersonR
5 hours ago
|
comments
by
overfeed
3 hours ago
|
next
[-]
Os there a cost benchmark out there? I wonder how frontier models are doing over time for cost per problem solved.
reply
by
drob518
3 hours ago
|
prev
|
[-]
I think they are optimizing for one-shot performance because that will drive usage. They can’t afford to look bad in the benchmarks. And if that means consuming an order of magnitude more tokens, well, that’s good for business, too.
reply