undefined

points

[-]

They really are. Benchmaxxing is real… but also the Qwen 3.5 series of models are still very impressive. I’m looking forward to trying out Gemma

by j453 hours ago|

prev|

[-]

Definitely have to use each model for your use case personally, many models can train to perform better on these tests but that might not transfer to your use case.