undefined

points

[-]

If the trick were genuinely useful, and was well circulated months ago, the resource-starved inference providers would have squeezed this trick dry already, instead of wasting 60% of their tokens, waiting for users to implement it themselves in 5 minutes of effort.

by solenoid093754 minutes ago|

parent|

[-]

100%.

I miss when HN was mostly people that knew what they were talking about.