Hacker News
new
past
comments
ask
show
jobs
points
by
BoorishBears
3 hours ago
|
comments
by
girvo
8 minutes ago
|
next
[-]
They really are. Benchmaxxing is real… but also the Qwen 3.5 series of models are still very impressive. I’m looking forward to trying out Gemma
reply
by
j45
3 hours ago
|
prev
|
[-]
Definitely have to use each model for your use case personally, many models can train to perform better on these tests but that might not transfer to your use case.
reply