undefined

points

by u_sama16 hours ago |

comments

by lbreakjai16 hours ago|

[-]

Solve? You solve a problem, not something you introduced on purpose.

by HarHarVeryFunny15 hours ago|

prev|

[-]

It seems a lot of the problem isn't "token shrinkage" (reducing plan limits), but rather changes they made to prompt caching - things that used to be cached for 1 hour now only being cached for 5 min.

Coding agents rely on prompt caching to avoid burning through tokens - they go to lengths to try to keep context/prompt prefixes constant (arranging non-changing stuff like tool definitions and file content first, variable stuff like new instructions following that) so that prompt caching gets used.

This change to a new tokenizer that generates up to 35% more tokens for the same text input is wild - going to really increase token usage for large text inputs like code.

by mnicky11 hours ago|

parent|

[-]

> things that used to be cached for 1 hour now only being cached for 5 min.

Doesn't this only apply to subagents, which don't have much long-time context anyway?

by fetus816 hours ago|

prev|

[-]

on Tuesday, with 4.6, I waited for my 5 hour window to reset, asked it to resume, and it burned up all my tokens for the next 5 hour window and ran for less than 10 seconds. I’ve never cancelled a subscription so fast.

by u_sama16 hours ago|

parent|

[-]

I tried the Claude Extension for VSCode on WSL for a reverse engineering task, it consumed all of my tokens, broke and didn't even save the conversatioon

by fetus815 hours ago|

parent|

[-]

That’s truly awful. What a broken tool.