I’m not having it build whole features from scratch, though. I give it pretty explicit instructions closer to the class or function level, and it still saves me an immense amount of time, while I’m very connected to the code that’s written.
Definitely the sweet spot for me.
For 24GB VRAM cards (e.g. 4090) you can use Q6_K (22.5GB) or Q5_K_M (19.5GB) quants, possibly offloading some of the weights to RAM.
At any rate it makes a stolen backpack or spilled drink a lot less damaging.
Unsloth recommends 18GB of RAM for Qwen3.6-27B (for their version of the model).
Sent from my 8gb M2 Mac mini.
I struggle to imagine purchasing multiple 1k+ cards on my own dime.