points
“The results show something close to inverse scaling: small, cheap models outperform large frontier ones.”