Hacker News
new
past
comments
ask
show
jobs
points
by
juancn
10 hours ago
|
comments
by
fancyfredbot
3 hours ago
|
[-]
The article is about two models which have either 2B or 4B parameters. Both are dense models. The 2B version will certainly use less power than qwen3-coder-next.
The models are quite good. They aren't just a tech demo.
reply