upvote
That presumes that there is a linear scale that measures performance. This can be tested: https://en.wikipedia.org/wiki/Rasch_model

Even assuming this holds, what utility you gain by the best models depend completely by your workload. If you have tasks that require performance 10 and DeepSeek has 9, you will gladly pay for SotA models.

reply
And yet it seems that 90% are happily paying for the marginal 10% capability and saturate datacenters.
reply
Happy to pay for? Or happy to spend other people's money on?
reply
That is called marketing.
reply
not necessarily. it might just as well be 'time is money'.
reply
If the second-best lamp is 90% as good and 10x cheaper, most people will use the second-best lamp...
reply
deleted
reply
That’s what he said?
reply