upvote
That just means it cost a lot.

Does it perform meaningfully better than the Kimi model given all that extra compute? And proportionally to the amount spent?

reply