Hacker News
new
past
comments
ask
show
jobs
points
by
aliljet
8 hours ago
|
comments
by
satvikpendem
5 hours ago
|
next
[-]
Unsloth Studio with its MTP support:
https://unsloth.ai/docs/models/qwen3.6#mtp-guide
reply
by
julianlam
6 hours ago
|
prev
|
next
[-]
Try llama.cpp and Qwen3.6-35B-A3B
Good balance of intelligence and speed.
reply
by
plagiarist
7 hours ago
|
prev
|
[-]
I think their Max models are far bigger than fits on consumer hardware. People are typically using Apple, AMD Halo, or dGPUs if/when they do smaller versions. Those are all varying degrees of "affordable."
reply