undefined

points

by wesammikhail23 hours ago |

comments

by freedomben23 hours ago|

[-]

Plus you can control thinking time a lot more, so when Anthropic lobotomizes Opus on you...

by verdverm23 hours ago|

prev|

[-]

My experience with qwen-3.6:35B-A3B reinforces this, gonna give this a spin when unsloth has quants available

Gemini flash was just as good as pro for most tasks with good prompts, tools, and context. Gemma 4 was nearly as good as flash and Qwen 3.6 appears to be even better.

by cassianoleal23 hours ago|

parent|

[-]

> when unsloth has quants available

https://huggingface.co/unsloth/Qwen3.6-27B-GGUF

by verdverm23 hours ago|

parent|

[-]

That was quick (compared to the 1T Kimi-2.6, not surprising)

by danielhanchen22 hours ago|

parent|

[-]

Haha :) We had some issues with Kimi-2.6 since it was int4 and we were investigating how to handle it :)

by verdverm19 hours ago|

parent|

[-]

Appreciate what y'all do! We were slacking about how many HGX-B300 it would take to run Kimi and it looks like we could actually fit 2-3 Kimis on a single HGX.

by dudefeliciano23 hours ago|

prev|

[-]

> Size of the model isnt all that matters.

What matters is the motion in the tokens