points
Does it perform meaningfully better than the Kimi model given all that extra compute? And proportionally to the amount spent?