Hacker News
new
past
comments
ask
show
jobs
points
by
Mashimo
13 hours ago
|
comments
by
esafak
8 hours ago
|
next
[-]
That's how it compensates for its small size. To accomplish a task of certain difficulty either you know more and think less, or vice versa.
reply
by
Tepix
12 hours ago
|
prev
|
[-]
That's been my experience as well. Huge amounts of reasoning. The model itself is good but even if you get twice as many tokens as with another model, the added amount of reasoning may make it slower in the end.
reply