undefined

points

[-]

Vibes > Benchmarks. And it's all so task-specific. Gemini 3 has scored very well in benchmarks for very long but is poor at agentic usecases. A lot of people prefering Opus 4.6 to 4.7 for coding despite benchmarks, much more than I've seen before (4.5->4.6, 4->4.5).

Doesn't mean Deepseek v4 isn't great, just benchmarks alone aren't enough to tell.

by snovv_crash9 hours ago|

prev|

[-]

With the ability of the Qwen3.6 27B, I think in 2 years consumers will be running models of this capability on current hardware.

by colordrops9 hours ago|

prev|

[-]

What's going to change in 2 years that would allow users to run 500B-800B parameter models on consumer hardware?

by DiscourseFan9 hours ago|

parent|

[-]

I think its just an estimate

by indigodaddy8 hours ago|

parent|

[-]

But the question remains