Hacker News
new
past
comments
ask
show
jobs
points
by
mudkipdev
5 hours ago
|
comments
by
entropicdrifter
4 hours ago
|
next
[-]
I'd rather see a distill on the 26B model that uses only 3.8B parameters at inference time. Seems like it will be wildly productive to use for locally-hosted stuff
reply
by
indrora
4 hours ago
|
prev
|
[-]
gemma4-31b-it-claude-opus-4-6-distilled-abliterated-heretic-GGUF-q4-k-m
reply