undefined

points

[-]

For personal use I've noticed Claude (via the web-based chat UI) making really bizarre mistakes lately like ignoring input or making completely random assumptions. At work Claude Code has turned into an absolute dog. It fails to follow instructions and builds stuff like a lazy junior developer without any architecture, tests, or verification. This is even with max effort, Opus 4.6, multiple agents, early compaction, etc. I don't know what they did but Anthropic's quality lead has basically evaporated for me. I hope they fix it because I've since adapted my project's Claude artifacts for use with Codex and started using it instead - it feels like Claude Code did earlier this year.

I'd like to give the new GLM models a try for personal stuff.

by lelanthran5 hours ago|

prev|

[-]

> I easily get $1K+ of usage out of my $100 max sub. And that's with Opus 4.6 on high thinking.

And people keep claiming the token providers are running inference at a profit.

by gruez5 hours ago|

parent|

[-]

>And people keep claiming the token providers are running inference at a profit.

Not everyone gets $1K of usage, and you don't know how fat the per-token margins are. It's like saying the local buffet place is losing money because you eat $100 worth of takeout for $30.

by lelanthran2 hours ago|

parent|

[-]

> Not everyone gets $1K of usage, and you don't know how fat the per-token margins are.

Well, we're going to find out sooner rather than later. Right now you don't know how thin (or negative) the margins are, either, after all.

All we know for certain is how much VC cash they got. Revenue, spend, profit, etc calculated according to GAAP are still a secret.

by kitsune14 hours ago|

parent|

prev|

[-]

[dead]

by tern1 hours ago|

prev|

[-]

According to the meter, I used $15k in tokens with my Max plan (along with $5k of Codex tokens) in the last 30 days. That built an entire working and (lightly) optimized language, parser, compiler, runtime toolchain among other things.

by Aurornis4 hours ago|

prev|

[-]

Some of the newer models available on OpenRouter are good, but I agree that none of them are a replacement for Opus 4.6 for coding.

If you're trying to minimize cost then having one of the inexpensive models do exploratory work and simple tasks while going back to Opus for the serious thinking and review is a good hybrid model. Having the $20/month Claude plan available is a good idea even if you're primarily using OpenRouter available models.

I think trying to use anything other than the best available SOTA model for important work is not a good tradeoff, though.

by mikeocool3 hours ago|

prev|

[-]

Yeah — I just created an anthropic API key to experiment with pi, and managed to spend $1 in about 30 minutes doing some basic work with Sonnet.

Extrapolating that out, the subscription pricing is HEAVILY subsidized. For similar work in Claude Code, I use a Pro plan for $20/month, and rarely bang up against the limits.

by causal3 hours ago|

parent|

[-]

And it scales up - the $200 plan gets you something like 20x what the Pro plan gets you. I've never come close to hitting that limit.

It's obviously capital-subsidized and so I have zero expectation of that lasting, but it's pretty anti-competitive to Cursor and others that rely on API keys.

by walthamstow2 hours ago|

parent|

prev|

[-]

I ran ccusage on my work Max account and I spend what would cost $300 a week if it was billed at API rates.

by nothinkjustai5 hours ago|

prev|

[-]

Not everyone is just vibecoding everything and relying on agents running sota models to do anything tho.