I can't trust Anthropic to manage their products in a way that supports my workflow.
ive been trying to make the case all year that if we're going to let employees do shit with ai, lets try claude. in the past like.. 2-3 weeks all that goodwill has basically evaporated.
local inference needs to take off asap because all of these entities actually suck and i wouldn't trust a single sla with anthropic. they are not acting like a serious company right now, this is a joke.
No serious business uses Pro or Max, they are all on Anthropic API billing.
In fact with this move it is plainly obvious that Anthropic is moving compute from prosumers towards enterprise.
It will be interesting to see how it all plays out, but I suspect if cost continues to increase and output only improves incrementally from here, that the cost will be the final decider rather than the competence.
I could see it being a thing we use only sometimes, for some things, but ultimately remain reliant on developers to get the work through the pipeline.
Larger companies are using Claude through AWS Bedrock and are willing to easily pay $5k+ per engineer per month for it.
Developer salaries are driven up by scarcity - scarcity of developer skills overall and scarcity of developer skills in specific places like California. If AI models destroy the scarcity then the price worth paying for a coding agent will drop dramatically.
Maybe Anthropic can get away with it for a couple of months. But this will not last.
So the % is debatable of course. There's cases where an AI agent can save weeks worth of investigation, there's cases where you are mainly blocked due to processes, and many different circumstances. It's up to every company on their own to decide it. But if they decide it's 50%, why shouldn't they spend 50% of salary on it?
Like imagine a large company with thousands of microservices. You need to build a feature, before you had to setup cross timezone team meetings to figure out who owns what, what is happening in each microservice, how it all connects together. But now you can essentially send an AI Agent to scour and prepare all this material for you, which theoretically in this planning could save hours of back and forth meetings.
If 1 hour / 1 eng costs $200, then a 10 people 1h meeting avoided would save $200 x 10 = $2000 alone.
I don't see it as a replacement for dev, it's more of a multiplier.
This not nothing.
With Sonnet it's a bit better, but I can get the same performance with GPT-5.4.
Now I'm pretty much paying the 20€ for Claude Pro so it can plan/review stuff and then I use pi.dev + GPT-5.4 for the actual work.
That said, I seem to be caught in that 2% test if I open in a private tab. What nonsense. I wouldn't be paying for Claude if it wasn't for its quality abilities, which necessarily includes Claude Code.
I find that with Opus 4.7 I can do two messages. Once I had a short session with 4-5 messages and it consumed $10 in extra usage.
This relegated Claude to a backup option in addition to Codex, which has the better desktop app anyway, and much better usage limits.
I’m considering to even cancel Claude entirely.
A/B testing people without their informed consent is immoral, unethical, and should be illegal.
so, what i'm saying is : I think a lot of companies align themselves with the cash first and then measure whether or not the negative image/user impact is manageable .
(in fact I know they operate this way.)
Sure. Let me just A/B test whether or not you'll respond positively or negatively to having your news delivered via push notification or delayed by 10 minutes.
I'm sure you would appreciate being tested on without your consent, just so that I can make an extra quick buck at your expense. Nothing amoral or unethical about it.
I agree, but can you really use Claude Code on the Pro plan as a full time developer, or professional 'knowledge worker' without hitting the usage limits fairly early in the day anyway?
I'm in the academia, and Claude's performance in my field could be described as a very fast junior grad student. When I use Claude Code, I typically spend a few hours figuring out what needs to be done exactly, and describing it in sufficient detail. Then Claude does it in 30 minutes, while an actual student would need days. And then I spend anything from minutes to days evaluating the results, depending on if it needs to be tested with real data and how much weirdness those tests uncover.
But I also have other work to do beyond guiding the automated grad student. Which means my Claude Code usage rarely exceeds 1–2 hours/week.
I have Pro Claude, Plus GPT and Pro Gemini. When one runs out I switch to another project on the next LLM. If I really need a task finished I'll restart it on another LLM, but I'm loathe to do that as it eats tokens just getting back up to speed.
It seems weird to segment this way though. Surely it’s better to just give Sonnet to your bottom tier, rather than cut out the entire Claide Code product entirely?
Give folks a taste rather than lock the whole product behind a $100/mo plan.