Hacker News
new
past
comments
ask
show
jobs
points
by
kgeist
3 hours ago
|
comments
by
Aurornis
3 hours ago
|
[-]
I should clarify that I'm referring generically to the types of quantizations used in local LLM inference, including those from Unsloth.
Nobody actually quantizes every layer to Q4 in a Q4 quant.
reply