Hacker News
new
past
comments
ask
show
jobs
points
by
tarruda
4 hours ago
|
comments
by
lostmsu
56 minutes ago
|
[-]
Tuned Qwen 3.5 27B beats Step 3.5 on almost all benchmarks, so the point about the size class is moot.
reply
by
tempaccount420
34 minutes ago
|
parent
|
[-]
Benchmarks are not interesting in deciding the "size class". Bigger size means more knowledge. Also, the Qwen 3.5 27B is a dense 27B active parameter model. StepFun 3.5 Flash has 11B active parameters.
reply