With Sonnet it's a bit better, but I can get the same performance with GPT-5.4.
Now I'm pretty much paying the 20€ for Claude Pro so it can plan/review stuff and then I use pi.dev + GPT-5.4 for the actual work.
That said, I seem to be caught in that 2% test if I open in a private tab. What nonsense. I wouldn't be paying for Claude if it wasn't for its quality abilities, which necessarily includes Claude Code.
I find that with Opus 4.7 I can do two messages. Once I had a short session with 4-5 messages and it consumed $10 in extra usage.
This relegated Claude to a backup option in addition to Codex, which has the better desktop app anyway, and much better usage limits.
I’m considering to even cancel Claude entirely.