upvote
They really are. Benchmaxxing is real… but also the Qwen 3.5 series of models are still very impressive. I’m looking forward to trying out Gemma
reply
Definitely have to use each model for your use case personally, many models can train to perform better on these tests but that might not transfer to your use case.
reply