upvote
> In general, quantizing down to 6 bits gives no measurable loss in performance.

...this can't be literally true or no one (including e.g. OpenAI) would use > 6 bits, right?

reply
Did you run say SWE Bench Verified? Where does this claim coming from? It's just an urban legend.
reply