undefined

points

by aliljet8 hours ago |

comments

by satvikpendem5 hours ago|

[-]

Unsloth Studio with its MTP support: https://unsloth.ai/docs/models/qwen3.6#mtp-guide

by julianlam6 hours ago|

prev|

[-]

Try llama.cpp and Qwen3.6-35B-A3B

Good balance of intelligence and speed.

by plagiarist7 hours ago|

prev|

[-]

I think their Max models are far bigger than fits on consumer hardware. People are typically using Apple, AMD Halo, or dGPUs if/when they do smaller versions. Those are all varying degrees of "affordable."