undefined

points

by 2001zhaozhao19 hours ago|

[-]

I don't have anything to add, but I like how you guys are actually sending people to communicate in Hacker News. Brilliant.

by oliver23613 hours ago|

parent|

[-]

you're one of them?

by simianwords19 hours ago|

prev|

[-]

Maybe a good idea to be more explicit about this -- maybe a cost analysis benchmark would be a nice accompaniment.

This kind of thing keeps popping up each time a new model is released and I don't think people are aware that token efficiency can change.

by tedsanders19 hours ago|

parent|

[-]

Agreed. Would be great if everyone starts reporting cost per task alongside eval scores, especially in a world where you can spend arbitrary test-time compute. This is one thing I like about the Artificial Analysis website - they include cost to run alongside their eval scores: https://artificialanalysis.ai/

by dannyw12 hours ago|

parent|

prev|

[-]

Their subscription subscribers will see/feel the difference irregardless, API pricing is hopefully read by devs that know about token efficiency and effort.