ive been trying to make the case all year that if we're going to let employees do shit with ai, lets try claude. in the past like.. 2-3 weeks all that goodwill has basically evaporated.
local inference needs to take off asap because all of these entities actually suck and i wouldn't trust a single sla with anthropic. they are not acting like a serious company right now, this is a joke.
No serious business uses Pro or Max, they are all on Anthropic API billing.
In fact with this move it is plainly obvious that Anthropic is moving compute from prosumers towards enterprise.
It will be interesting to see how it all plays out, but I suspect if cost continues to increase and output only improves incrementally from here, that the cost will be the final decider rather than the competence.
I could see it being a thing we use only sometimes, for some things, but ultimately remain reliant on developers to get the work through the pipeline.
Larger companies are using Claude through AWS Bedrock and are willing to easily pay $5k+ per engineer per month for it.
Developer salaries are driven up by scarcity - scarcity of developer skills overall and scarcity of developer skills in specific places like California. If AI models destroy the scarcity then the price worth paying for a coding agent will drop dramatically.
Maybe Anthropic can get away with it for a couple of months. But this will not last.
So the % is debatable of course. There's cases where an AI agent can save weeks worth of investigation, there's cases where you are mainly blocked due to processes, and many different circumstances. It's up to every company on their own to decide it. But if they decide it's 50%, why shouldn't they spend 50% of salary on it?
Like imagine a large company with thousands of microservices. You need to build a feature, before you had to setup cross timezone team meetings to figure out who owns what, what is happening in each microservice, how it all connects together. But now you can essentially send an AI Agent to scour and prepare all this material for you, which theoretically in this planning could save hours of back and forth meetings.
If 1 hour / 1 eng costs $200, then a 10 people 1h meeting avoided would save $200 x 10 = $2000 alone.
I don't see it as a replacement for dev, it's more of a multiplier.
This not nothing.
With Sonnet it's a bit better, but I can get the same performance with GPT-5.4.
Now I'm pretty much paying the 20€ for Claude Pro so it can plan/review stuff and then I use pi.dev + GPT-5.4 for the actual work.
That said, I seem to be caught in that 2% test if I open in a private tab. What nonsense. I wouldn't be paying for Claude if it wasn't for its quality abilities, which necessarily includes Claude Code.
I find that with Opus 4.7 I can do two messages. Once I had a short session with 4-5 messages and it consumed $10 in extra usage.
This relegated Claude to a backup option in addition to Codex, which has the better desktop app anyway, and much better usage limits.
I’m considering to even cancel Claude entirely.