undefined

points

by binyu9 hours ago |

comments

by Aurornis9 hours ago|

[-]

Running DeepSeek V4 without extreme quantization locally requires a lot of hardware.

The IQ2 quants that fit into 128GB machines are very degraded.

by binyu9 hours ago|

parent|

[-]

That is true, it is a 1.6T parameters model so it requires a great deal of memory. I also heard there's a 2bit quantization that works well on Apple metal.

by tuananh9 hours ago|

prev|

[-]

From what I read, ds v4 is very close with opus 4.6 performance.

by DeathArrow6 hours ago|

parent|

[-]

The full model is, not the quantized versions.

by tuananh5 hours ago|

parent|

[-]

yeah that goes without saying. how can openweight, quantized version beat SOTA :)