undefined

points

[-]

There is actually a real bull case for xAI (that I don't endorse), e.g. from people who think that chips & computer is the main determiner of model quality. xAI may plausibly soon have the biggest training apparatus of anyone.

I think talent is more important than compute, as I wrote in my Jan 2026 predictions that Anthropic would end up on top this year: https://futuresearch.ai/blog/forecasting-top-ai-lab-2026/

by scottyah3 hours ago|

prev|

[-]

If you don't spend any time comparing models to the point where you don't know about benchmarks, why do you care where people think the line for SOTA is?

by mikkupikku2 hours ago|

parent|

[-]

The benchmark game is wholly gamed, but the proof is in the pudding. I know people using Anthropic, OpenAI, and Gemini. Chinese models locally. But who uses Grok for anything but porn? Whatever the benchmarks might say, Grok is just trash in practice. They spent too much time teaching it to be edgy and not enough time teaching it to code.

by ahahahahah3 hours ago|

prev|

[-]

> I assume "extremely skeptical" is you being generous

I'm not sure that's the case. Every value in this forecast is absurd, I actually think the author is sincere in there feeling that they are being extremely skeptical.