undefined

points

[-]

> On a side note, any self hosted model I can get for my PC? I have 96 GB of RAM.

Try the 8 bit quantized version (UD-Q8_K_X) of Qwen 3.6 35B A3B by Unsloth: https://huggingface.co/unsloth/Qwen3.6-35B-A3B-GGUF

Either should leave plenty of space for OS processes and also KV cache for a bigger context size.

I'm guessing that MoE models might work better, though there are also dense versions you can try if you want.

Performance and quality will probably both be worse than cloud models, though, but it's a nice start!

[-]

> and it was blocked from the start.

Wait - what?