undefined

points

by canpan12 hours ago |

comments

by wktmeow9 hours ago|

[-]

That’s the exact ram/vram combo of my desktop - what model would you suggest for that gaming pc setup?

by canpan8 hours ago|

parent|

[-]

I would recommend to start withQwen 3.6 35B at maybe Q5, it should be fast in that setup. For intelligence Qwen 3.7 27b, is smarter but will run much more slow. Others also mention gemma 4, which might be worth a try.

by solenoid093710 hours ago|

prev|

[-]

> Feels like SOTA from maybe a year ago?

Agree but only for small projects. SOTA from a year ago still wins on larger projects

by ai_fry_ur_brain11 hours ago|

prev|

[-]

"Coding system" "can really do coding locally"

Vibe coders out here thinking all software development is solved by because they made an (ugly and unoriginal) dashboard for their SaaS clone and their single column with 3x3 feature card landing page thats identical to every other vibe coders "startup"

by 6 hours ago|

parent|

[-]

deleted

by DrBenCarson11 hours ago|

prev|

[-]

How are you using that RAM with the GPU?

by canpan11 hours ago|

parent|

[-]

Llama.cpp with automatic offload to main memory. You can also use Ollama, it is easier, but slower.

by reverius426 hours ago|

parent|

[-]

For those who want a GUI, LM Studio does this too (with llama.cpp as the backend I think). I'm getting great (albeit slow) results with Qwen3.6-35B MoE on 8GB GPU RAM, 40GB system RAM.