upvote
Running DeepSeek V4 without extreme quantization locally requires a lot of hardware.

The IQ2 quants that fit into 128GB machines are very degraded.

reply
That is true, it is a 1.6T parameters model so it requires a great deal of memory. I also heard there's a 2bit quantization that works well on Apple metal.
reply
From what I read, ds v4 is very close with opus 4.6 performance.
reply
The full model is, not the quantized versions.
reply
yeah that goes without saying. how can openweight, quantized version beat SOTA :)
reply