upvote
That's how it compensates for its small size. To accomplish a task of certain difficulty either you know more and think less, or vice versa.
reply
That's been my experience as well. Huge amounts of reasoning. The model itself is good but even if you get twice as many tokens as with another model, the added amount of reasoning may make it slower in the end.
reply