undefined

[-]

Claude is also weird for being the only coding assistant that for some reason doesn't support AGENTS.md. Codex, Amp, Cursor all of them support it and read from it, but not claude which forces it's users to use CLAUDE.md instead.

The issue is the higest voted issue on their gitlab repo: https://github.com/anthropics/claude-code/issues/6235

by ValentineC7 hours ago|

[-]

My CLAUDE.md is just:

    @AGENTS.md

And Claude processes it just fine.

(I see that it's a common workaround, and there's a comment in the above link saying just this: https://github.com/anthropics/claude-code/issues/6235#issuec...)

It's a hassle having to add it to every repo that I use Claude with though, and I often use other models and harnesses too for the more trivial tasks.

by troupo5 hours ago|

[-]

I beg people to learn what symlinks are. The fact that "put @AGENTS.md in there" is a "common workaround" shows why programmers (good ones at least) are not going anywhere soon.

by fortuitous-frog4 hours ago|

[-]

I used to use a symlink but was concerned that Claude might see the presence of an "AGENTS.md" file (in e.g, a "List Files" tool call output or from a direct `ls`), be curious and attempt to read it directly (not knowing that it's the same as the "CLAUDE.md" file auto-injected by the harness), and essentially double the token impact / context bloat. Indeed, I did some local experimentation and noticed this was the case, which is why I switched to the explicit "@AGENTS.md" approach.

So perhaps there's no need to be rude about it :)

by lijok3 hours ago|

[-]

Good programmers know symlinks are not portable

by _alternator_2 hours ago|

[-]

Neither are line endings. But like line endings, I just put it into git and get over it.

by ValentineC4 hours ago|

[-]

One bonus to this approach is that I can add Claude Code-specific stuff in there, that I wouldn't need for other harnesses.

by layer84 hours ago|

[-]

Symlinks aren’t portable.

by ahtihn4 hours ago|

[-]

Symlinks are a pain if you're on Windows, I'd rather not bother with them.

by qwertox2 hours ago|

[-]

I never had any issues with them. And they fixed my problems in a good way.

by drysine1 hours ago|

[-]

>Symlinks are a pain if you're on Windows

How so?

by dpkirchner4 hours ago|

[-]

I'm pretty sure some agent harnesses read both files when present, so this @ "aliasing" is more token efficient.

by 4 hours ago|

[-]

deleted

by chorkpop7 hours ago|

[-]

CLAUDE.md has been incredibly successful for them advertising wise. I wouldn’t expect them to admit AGENTS.md exists anytime soon.

by bandrami6 hours ago|

[-]

So there's this amazing thing called a symlink

by hk__21 hours ago|

[1]: https://github.com/dutifuldev/claude-md-symlinker#claude-md-...

[-]

Not amazing enough. Let me present you claude-md-symlinker [1], an 8k-lines Rust project designed to do exactly the same as a symlink but in an obscure manner, running as a Linux service and maintaining its own database. This is like left_pad but for symlinks.

(I’m not the author obviously)

by denkmoon30 minutes ago|

[-]

Haven’t even had my coffee yet and this is definitely the most cursed thing I’ll see today, thanks!

by datsci_est_20154 hours ago|

[-]

Literally trying to use file naming to build a moat. “We can’t switch to Cursor, we’d have to rename all of our files from CLAUDE.md to AGENTS.md!”

by hexsprite6 hours ago|

https://github.com/hexsprite/claude-agents-md

[-]

I created a Claude Code plugin to load AGENTS.md. Uses symlinks but it’s better than no support.

by hk__21 hours ago|

[-]

What’s actually better is doing a symlink by hand. 1- if you didn’t know it you learn what’s a symlink; 2- if that’s in a git repo you share it with all your coworkers without all of them needing to install the plugin; 3- security-wise it’s way better than running some 100+ lines of Bash from a repo on GitHub.

by arcanemachiner5 hours ago|

[-]

Claude Code supports native imports: `@AGENTS.md`

by OJFord5 hours ago|

[-]

To be fair if you can do it through some kind of plugin or skill it does spare you having a CLAUDE.md of `@AGENTS.md` in every repo individually.

by troupo5 hours ago|

[-]

You two realise that symlinks exist, right? That you need neither a "plugin" nor a "native import"?

by anon3738397 hours ago|

[-]

[flagged]

by sethaurus3 hours ago|

https://news.ycombinator.com/newsguidelines.html

[-]

> Be kind. Don't be snarky. Converse curiously; don't cross-examine. Edit out swipes.

by kokada10 hours ago|

[1]: https://github.com/anthropics/claude-code/issues/69238#issue...

[-]

Not that Claude Code is much better, I just hit this issue[1] because it seems setting DO_NOT_TRACK=1 seems enough to get a really strange behavior in the newest versions of CC.

Edit: I think I misunderstood OP, they're saying that CC is even worse and not better than Codex CLI.

by mvATM9911 hours ago|

[-]

Yeah exactly.

I'm not exactly building TUI's every day, but even i felt pain when i read that "small game engine" post

by matheusmoreira9 hours ago|

[-]

At least game engines manage to render their frames properly. Claude Code sometimes eats entire paragraphs of text output, resulting in things such as numbered lists jumping from 2 to 4 out of nowhere.

I'd just ask Claude to repeat himself at first but it happens so often that I actually made a little tool to dig up the output inside the session history and present it properly in a separate terminal.

by TacticalCoder9 hours ago|

[-]

> I'm not exactly building TUI's every day, but even i felt pain when i read that "small game engine" post

The bigger issue is they where somehow thinking it was "cool" and "advanced" while it's just a kludgy rube-goldbergy monstrous hack.

Which is of course only semi-working: to me the model thinking what you see is what it outputs in the TUI is the deal-breaker for me. It's of course not working like that for they're apparently, in their "game engine", converting on the fly a headless browser to approximated characters to display in the terminal. So the model tells you he did output ASCII but people are copy/pasting (because, yes, at times you want to copy/paste) Unicode chars.

Plenty of bug reports and pissed users.

That's the bigger issue.

The biggest issue is those thinking a 10 GB VM required to run a headless Electron browser and then fuxx0ring characters conversion is somehow an achievement.

by dyarosla8 hours ago|

[-]

Whats the small game engine post?

by gbnwl8 hours ago|

https://xcancel.com/trq212/status/2014051501786931427#m

[-]

by Dilettante_2 hours ago|

[-]

Seeing Jonathan blow invoked and his response in the comments tickled me pink

by inigyou2 hours ago|

[-]

Was it Casey Muratori that spoke about an AI educing allocations from 10k per frame to 200 per frame, but the manual programming work got it to 0 instead?

by varjag9 hours ago|

[-]

Right, just yesterday I found my laptop kinda hot. And what do you think, it was good old Claude deciding to load a few cores with completely idling prompts.

by cozzyd6 hours ago|

[-]

Yeah, somehow claude-cli sitting in a tmux session doing nothing uses my laptops battery twice as fast. Powertop suggests a ton of wakeups, but why? Is it busy polling the input or something?

by varjag15 minutes ago|

[-]

Polling the Department of Trade for Order 66

by sambcui6 hours ago|

[-]

I don’t know if you can resonate, but I feel like the Vibe Coded codex and Claude Code desktop apps are iterating way faster than they should be.

by malfist6 hours ago|

[-]

How are they iterating? I've not noticed anything major changing between the versions of my claude code. Other than that sometimes this version includes /btw and sometimes it's missing.

by iLoveOncall8 hours ago|

[-]

Surprisingly Kiro is fine (I work at Amazon but not at all on the Kiro team). I prefer it to anything else I've tried (except Amazon Q Developer in IntelliJ, but it's now deprecated).

by epistasis6 hours ago|

[-]

Kiro is surprisingly good, if the interface for saving and resuming was slightly more reliable, and there was the hope of remote sessions, I'd probably switch to it full time. I vastly prefer it to having to fight against buggy force-fed features like UltraPlan or whatever.

by r_lee10 hours ago|

[-]

if we are at 10x with AI and near AGI or ASI, then how is it possible that these products (Codex, Claude Code CLI) are still such garbage?

shouldn't this "agentic AI revolution" have long solved this already?

no way they're over there saying "we are on it plz wait" or that "it's too much effort"?

by igleria10 hours ago|

[-]

This is the biggest elephant in the room I have seen in my decade+ career. At the same time, look how bad Apple is in software compared to its hardware... It's not an AI only problem, it's almost like software in general gets a free pass on being very unsafe or low quality because no one wants to face the same "profit reducing red tape" that civil engineers or similar face.

by CharlieDigital10 hours ago|

[-]

Anthropic were the progenitors of the Model Context Protocol. Claude Code does not fully implement the client end of the protocol. A protocol; a literal pre-defined spec that an agent should be able to one-shot. Neither does Codex. Codex does not implement MCP Prompts.

(I want Codex to implement MCP Prompts because then we have one central way to ship skills from a server).

The fact that neither platform can implement a protocol given what is functionally infinite frontier model tokens really says a lot. I do not care what kind of random project some influencer can ship with a swarm of 1000 agents. If you cannot make the basics work, it is a farce.

by deathbob7 hours ago|

[-]

It still boggles my mind that Anthropic would invent the MCP protocol but not fully implement it.

Especially when fully implementing it (prompts, resources, tools) is easily done in harnesses that don’t ship with MCP but allow good extension / modification like Pi.

Claude not being able to see its own usage or self invoke slash commands is also very frustrating.

by oblio7 hours ago|

https://www.joelonsoftware.com/2002/01/06/fire-and-motion/

[-]

> It still boggles my mind that Anthropic would invent the MCP protocol but not fully implement it.

> Do they just want to force you to keep busy reacting to their volleys, so you can’t move forward?

by CharlieDigital5 hours ago|

[-]

> ...Do they just want to force you to keep busy

Given functionally unlimited access to tokens with frontier models, there is really no "force you to keep busy"; it should just bake overnight. We're talking about a rather simple and well-defined specification; not something novel and complex.

by oblio21 minutes ago|

[-]

My point is that there is a chance that Anthropic launched MCP:

1. not fully believing in it

2. but knowing the hype around MCP would force other AI labs to implement it (standard enterprise checkbox behavior)

3. thus wasting some of their competitors' development cycles

* * *

And let's be real here, the entire discussion happens in the context of a basic bug in a coding agent, do we really believe that these labs have hit AGI in coding?

Random example:

Go to claude.ai or gemini.google.com (I imagine OpenAI is in a similar situation).

Type a question, press enter. Wait 2 seconds, then turn on airplane mode.

Not only does the connection cut off, but even if you reconnect 20 minutes later, you still won't get the answer.

Their website works in purely sync mode!!!

We knew better than this when HTML was invented, 30 years ago.

Do their products give you the impression than "baking features with unlimited tokens overnight" leads to decent products?

by thewebguyd5 hours ago|

[-]

> same "profit reducing red tape" that civil engineers or similar face.

I don't think we should ever head toward licensing/a credential body for software development, but I do think now is a good time to have discussions around liability for defective products.

A good start would be to stop allowing companies to disclaim all warranties of fitness for a particular purpose in their EULAs. The joke of Microsoft Copilot applies here where they have a big disclaimer that "Copilot is for entertainment purposes only" while advertising says otherwise. Not even the chrome EULA will agree that its fit for purpose as a web browser. The clause is a get out of jail free card that shifts all liability and risk to the end user.

by datsci_est_20154 hours ago|

[-]

> I don't think we should ever head toward licensing/a credential body for software development, but I do think now is a good time to have discussions around liability for defective products.

Liability is how a credential body would organically grow. It already exists in the security, compliance, and enterprise parts of the software world.

by inigyou2 hours ago|

[-]

That can be okay. The problems we're worried about come when it's government mandated.

The EU Cyber Resilience Act puts heavy liability on vendors for software vulnerabilities that get exploited, including in open-source components they incorporate. OSS devs are shielded - liability is on the companies who incorporate OSS into commercial stuff.

by datsci_est_20153 minutes ago|

[-]

In practice, what’s the difference between a government mandated license and a government that quickly rules in favor of parties who are damaged by companies that don’t use licensed software engineers?

E.g. “Your software caused serious damages to our company / livelihood, and you best hope that it turns up in discovery that you used properly licensed software engineers who were following licensing best practices, otherwise this will be a slam dunk case.”

Genuinely an interesting question to me. Seems like the latter is a better option, generally, but it does lock restorative justice behind a paywall - you have to be able to afford a lawyer.

by forshaper6 hours ago|

[-]

How much of all this is due to hardware improving, and software bloating enough to fill the capacity?

by thewebguyd6 hours ago|

[-]

> shouldn't this "agentic AI revolution" have long solved this already?

Daily reminder that Anthropic took over a year to fix the Claude Code terminal flickering issue despite proclaiming all over the internet that software development as a "solved problem."

Apple forked over $250 Million in a class action over false advertising for Apple Intelligence. When do we start seeing the same for the misleading and outright false claims coming out of the frontier labs about the model capabilities? At this point the marketing is doing more harm than the technology itself because its warping the perceptions of those at the top that make decisions. The only reason tokenmaxxing was ever a thing was because marketing mislead execs and technology decisions were made based on vibes instead of evidence.

by mannanj29 minutes ago|

[-]

Why is not a thing that people track the lies of people as they are public, and tie them to their reputation over time for anyone to find?

by mannanj3 hours ago|

[-]

As long as a majority of the people of the living class are gullible and naive and sick, entrained behavior from the institutions and media they are made to consume, they stop seeing the misleading and false claims. Or at least they myopically see it short enough to complain about it in an ineffective way, then continue to consume the next big lie or slop. Until something happens that channels that accumulated rage finally into a cause they feel makes things right (assuming they have not already died and the next generation has been groomed to fall for the rich man's trap) and those who's family and next generation is to continue the extraction and trickery hides behind an anonymous personality or system.

by jeffybefffy5199 hours ago|

[-]

Because vibe coding is a toy… thats the secret.

You can use it to accelerate development certainly, but that requires careful change->review cycles. The developer still needs to be in heavy control, versus vibe coding having an agent own the code base.

by Nullsession47 minutes ago|

[-]

[flagged]

by hombre_fatal9 hours ago|

[-]

Like anything, you have to decide between polish vs switch to any other task in the queue. If you choose too much from the latter, then polish suffers, yet that's a human thing.

Also, Codex and Claude Code aren't as bad as people say. I think most of the noise is embellished by the "hah see? AI sucks" angle.

It's kind of like how HNers would claim to your face that you can't actually build anything with Javascript and Node.js (JS just sucks too much), then they'd list off a few footguns that were supposed to demonstrate why. In other words, champing at the bit for JS to lead people to catastrophize issues that were pretty mediocre.

by geodel6 hours ago|

[-]

> yet that's a human thing.

is this joke?

Here we are talking about trillon dollar AI companies who claim AI can fix decade old bugs and create new compilers, OSs and what not. Are parallel agents working autonomously to fix issues as well as create new features not allowed at these companies?

by hombre_fatal6 hours ago|

[-]

Humans still decide what LLMs do in a code base, full stop.

by quikoa2 hours ago|

[-]

Yea it's too bad these poor scrappy startups cannot afford engineers to build decent software.

by inigyou2 hours ago|

[-]

Why not just tell it to do everything on the task list and then tell it to fix all bugs?

by coldtea8 hours ago|

[-]

>Like anything, you have to decide between polish vs switch to any other task in the queue

Why do you "have to decide"? Let some agents go at both of those, isn't that what they claim people can just do?

>Also, Codex and Claude Code aren't as bad as people say. I think most of the noise is embellished by the "hah see? AI sucks" angle.

Why shouldn't it? They're not the ones making the extraordinary claims.

by hombre_fatal6 hours ago|

[-]

> Why do you "have to decide"? Let some agents go at both of those, isn't that what they claim people can just do?

Because your code is still marching somewhere in tokens per second. You have to decide where they are allocated: polish or the next thing. Humans still are the ones prompting LLMs and deciding what is done.

> isn't that what they claim? Why shouldn't it? They're not the ones making the extraordinary claims.

Even if I grant that someone else makes excessive claims, why would that let you off the hook to stay grounded?

Though I don't grant it. Maybe if Anthropic claimed that Opus makes all decisions at the company and builds all software without humans doing all the prompting, the critics would make more sense.

Until then, it looks more like a double standard: if software built with AI has any issues, then see, AI is shit and the humans who invoked it had no role in it. e.g. it could be the case that Anthropic's Claude Code engineers just aren't doing as much polish as they should.

Better answer: Someone asked why might it be the case that AI-written software has issues, and it has a real answer. Marketing claims are a different conversation.

by geodel6 hours ago|

[-]

> Maybe if Anthropic claimed that you could write an unsupervised loop that writes perfect software, the critics would make more sense.

Or to be upstanding, ethical companies that they are. Just put disclaimer after every prompt response and on their website "AI generated code has no absolutely no guarantee of quality or correctness. Human prompter must be held accountable for any mistake or inaccuracies."

Hope it wouldn't be too much bother to these important companies.

by thewebguyd5 hours ago|

[-]

See, but that would counter act all of their marketing and hurt the feelings of all the execs that desperately want to believe that software development is "solved" and in the near future they won't have to hire those expensive, pesky developers ever again.

by geodel4 hours ago|

[-]

Two trends I see at work:

1) No more human written code in projects, all code must be AI generated.

2) Developers are responsible for all code AI generated.

Combine that with fear of losing job and you have no one calling out management bullshit on their face.

by hombre_fatal2 hours ago|

[-]

I don't see how these things conflict. Nor did I get the point you were making in the sarcastic upstream comment.

It is obviously the case that you can both delegate code implementation to AI and also be responsible for it. You are signing off on the code you submit to a project no matter where you got it from nor how it was generated nor who you delegated the task to ("actually my friend wrote it so if it sucks don't look at me").

AI didn't change this, nor will it until there are no more humans in the loop.

by tadfisher1 hours ago|

[-]

They don't conflict, if the generated code is acceptable. Maybe I'm holding it wrong, or I'm not using the right combination of plugins and MCPs. But if I'm not allowed to manually correct the generated output, then I am forced into a loop of generating corrections until it's good enough to stake my job on. I hope you can see that such a policy would be ridiculous.

by rjh291 hours ago|

[-]

Gemini is also buggy as heck and has been buggy for years. For a company of Google's size with "all the power of AI" it's seriously embarrassing.

by ValentineC7 hours ago|

[-]

The "AI revolution" feels like it's creating a bunch of ultra-smart AI models are scarily good at cracking most of human-created security (Mythos), but also happen to be careless snobs that just leave litter and mess in their wake.

by mnicky8 hours ago|

[-]

If the code churn is high the investment to refactoring etc is less beneficial than may be obvious. I don't remember the details but I heard in some podcast that the code base of Claude Code changes so fast that any piece of code won't be there for long..

[-]

In other words it's an ever moving vibe fest, with random bugs and misbehaviors each time they roll the dice...

by tartoran7 hours ago|

[-]

Yes, it’s very characteristic of gen-AI era.

by 7 hours ago|

[-]

deleted

by tartoran7 hours ago|

[-]

If they respected their users they’d at least pin some versions that are more stable.

by layer84 hours ago|

[-]

The issue is that apparently AI coding means that developers stop caring about software quality. Which puts the whole purpose into question.

by fg13710 hours ago|

[-]

You are asking too many good questions.

by user4392810 hours ago|

[-]

The products generally work just fine on my MacBook.

I have not encountered major issues in either the Claude Code CLI, the Codex Desktop app, or Claude Desktop app.

They generally get the job done. I don't measure disk writes or analyze the GPU usage.

by 9 hours ago|

[-]

deleted

by Zababa9 hours ago|

[-]

A simple explanation is that they are "good enough" for most people and they have better things to do. Even if tomorrow I was 100 times as productive, I still wouldn't have time to do literally everything and I would have to prioritize.

[-]

You might not.

But the Claude Code team has ONE job.

And they have full access to a platform that they advertise as "humanity-threat" level good, and claim that it can automate everything code related...

by Zababa7 hours ago|

[-]

I think they have more than one job, they have to balance new features with improving the software itself. And Anthropic has to balance investing resources into Claude Code vs on infra or other things.

Not that I'm happy with the current state of things, in fact I'm quite sad that improvements in capacity to do things doesn't translate into better quality.

by troupo5 hours ago|

[-]

> they have to balance new features with improving the software itself.

What new features?

> And Anthropic has to balance investing resources into Claude Code vs on infra or other things.

It seems they are doing neither? Their vibe-coders boast everywhere that they no longer even work, but just endlessly prompt Claude Code in a loop. Perhaps that's why there's no polish? Perhaps that's why their spring post about Claude Code issues reads like "these are all issues that would take a junior programmer a day to test and fix before they ever reached production"? https://www.anthropic.com/engineering/april-23-postmortem

by reducesuffering3 hours ago|

[-]

Claude Code has been out for just 1 year and has millions of users already, being a major contribution to roughly $40 billion in revenue. By any stretch it is one of the most extremely fast developed products driving the most important workflow for millions of people already.

"Why isn't literally everything about a product that came out a year ago with an extremely fast scaling userbase solved?" is what I hear.

The goalposts will keep moving until AGI is undeniable.

by nicce11 hours ago|

[-]

Not only Codex, but I can't leave ChatGPT app in macOS open for few hours, because it will consume 60 gigabytes of RAM over time and crashes all the apps.

Mindboggling. Or can't use Google's AI Studio in browser because it takes 100% CPU.

Need to write own app for everything???

by nbaksalyar5 hours ago|

[-]

It's not just Google AI Studio, it's also Google proper. Just one search result page consumes gigabytes of RAM. How did this happen? I've switched to DDG and never looked back.

by veber-alex9 hours ago|

[-]

ChatGPT works ok for me but Whatsapp consumes 1000% cpu after the mac wakes up after sleep.

I swear a few years ago shit like this didn't happen on macOS.

[-]

A few years ago vibe-coded crap apps like that didn't exist on macOS.

by porridgeraisin11 hours ago|

[-]

the damn chat.openai.com webapp lags a lot as well on long chats, typing takes so long.

by lelandfe1 hours ago|

[-]

I believe this is because they don’t lazyload the document.

The entire conversation sits in the DOM.

by porridgeraisin15 minutes ago|

[-]

Yeah - is such a large dom of essentially text expected to be that slow?

For what it's worth I use a small script that essentially deletes old chat messages DOM nodes which significantly speeds it up.

by rsfern8 hours ago|

[-]

In my experience the input field lags on short chats too, sometimes in the middle of writing the second or third prompt. Are they running some kind of prospective evaluation or something?

by nicce8 hours ago|

[-]

When you are writing completely new prompt - it sends every character to server when writing and tries to make suggestions based on that.

And keeps doing it in intervals in /prepare endpoint, during each prompt.

So if you are working with something sensitive - don't write it to browser directly and edit it there.

by porridgeraisin7 hours ago|

[-]

But then why does it become dog slow when the chat becomes bigger?

by mannanj33 minutes ago|

[-]

Plot twist: the companies are unethically (blaming it on AI slop code) harvesting your GPU and CPU resources to generate tokens for their products consumers in a decentralized fashion. Not unlike those old monero or other crypto spyware miners.

by CryZe7 hours ago|

[-]

> THE SPINNER MESSAGE CAUSES 100% GPU USAGE ON AN MBP M5!!

This seems to be a common Chromium problem across tons of software. GitHub has the same issue with its spinners, VSCode as well.

by xpct11 hours ago|

[-]

Well thank you for your service. I thought about trying out Codex after the disaster that is Claude Code. I'll be fine without either one on my machine

by jofzar11 hours ago|

[-]

Imo codex is significantly better then Claude code for me ATM.

by christophilus8 hours ago|

[-]

Codex is much better, which is to say, it’s only pretty bad.

by comboy11 hours ago|

[-]

I mean, Codex CLI is really bad. But Claude's CLI is so much worse.

Welcome to the world of tomorrow!

by tonic_note2 hours ago|

[-]

They wrote an entire blog post about how Codex is entirely AI written and they militantly refuse to do anything by hand. Figures https://openai.com/index/harness-engineering/

by giancarlostoro7 hours ago|

[-]

> It's a tragedy. The world needs competition to anthropic.

I agree, though Sam Altman's company is the last option I'd want to replace Claude with. I would sooner exhaust every open model.

by markdog128 hours ago|

[-]

This software has been terrible for me. Burns tokens like crazy, and fails. Most times I try to use the browser plugin, it just says it can't use the plugin. When it does work, it takes minutes to click a button. Unusable workflow.

I ask to generate a png with an alpha channel. It can't. Instead, it outputs a chroma-keyed image, then generates a python script to remove chroma key (fails), then a js script (which also fails). Then my 5h allotment is up.

It's frustrating because if it worked as they advertise, it'd be an amazing tool.

by EMM_3868 hours ago|

[-]

Although they can technically do it, I wouldn't be asking LLMs to generate binary files like PNG with alpha channels, no matter how simple that may seem. If it's easy enough to manually create one yourself, I would do that.

The best way for LLMs to do this is likely to write a scratch program (which is what it seems to have reached for in the second half), write code (which they are good at) and have the library create the image.

At some point it is just easier to handle such things yourself, and use them with text-based formats.

by y1n02 hours ago|

[-]

> THE SPINNER MESSAGE CAUSES 100% GPU USAGE ON AN MBP M5!!

Is that m5 specific? I’m not seeing on my m4 and I use codex (desktop and cli) quite a bit.

by l33tman12 hours ago|

[-]

This was fixed long ago, if I'm thinking of the same bug. It was stuck in an inf loop all the time the codex window was open.

by cncjvu712 hours ago|

[-]

Nah it's still doing weird shit. Uninstalled that crapware last week.

by Zenul_Abidin3 hours ago|

[-]

I have been wondering why my battery dies quickly when I have codex open, even in my tray

I only noticed the CPU spike with Process Explorer also in my tray.

by 7 hours ago|

[-]

deleted

by jiggawatts57 minutes ago|

[-]

Excel would use 100% CPU if you left a cell selected and your screen saver turned on because you were idle.

That bug caused untold grief for multi-user session hosts / terminal servers / Citrix XenApp for a better part of a decade. Way before slopware!

People forget that software written by people is… even worse.

It’s like the self driving car debate: sure, the robot taxis will kill people on occasion, but people kill people reguarly.

by xenator9 hours ago|

[-]

I have exactly the same problem with Time Machine spinner on macOS. It even doesn't rotate.

Somewhere should be rare specialists with diploma who are capable of fixing such problems with waiting lists for years ahead.

by fps-hero5 hours ago|

[-]

> THE SPINNER MESSAGE CAUSES 100% GPU USAGE ON AN MBP M5!!

One conspiratorial idea I had was that this isn't a bug, and that Codex was actually doing computation on users' hardware under the guise of "thinking". Like Folding@home, or bitcoin mining malware, involuntarily on paying customers. Your usage is being subsidized by your personal compute hardware that you can't take advantage of unless it was being applied at massive scale.

This would make even more sense when you consider that thinking and response time metrics aren't publicly being tracked. There is an assumption that LLM interaction is being processed as fast as possible, but this doesn't align with the reality of fixed hardware and oversubscription. Of course throttling is occurring. So, if you can take advantage of local compute, delay the responses and you have even more access compute!

I find it difficult to believe that given the scale, number of users, and money involved, that someone hasn't fixed this "bug".

by CSMastermind5 hours ago|

[-]

Lol this was my theory as well.

by tengada18 hours ago|

[-]

I had the exact same frustration and switched to Pi and have had zero complaints

by hokkos11 hours ago|

[-]

is it closed source ? i can see the rust code in repo contrary to the JS in claude code repo, are you mixing them up ?

by nicce11 hours ago|

[-]

Codex CLI is the main Rust code. There is Codex Desktop separately, using Electron and the same Codex CLI.

by ljlolel7 hours ago|

[-]

Building an open source native swift version that doesn’t have that bug: https://github.com/Lore-Hex/Quillcode

by seviu11 hours ago|

[-]

To be fair with Codex, you can use any harness you want with it. Access is not gatekeeper by a crappy full of slop electron app.

So just move to PI, or whatever.

Claude on the contrary, forces all plan users to use their horrible app, which, if you ever dared to use cowork, only once, will run a 2GB VM on app start, no f's given. at all.

Not justifying it. But if you use the official Codex app, thats on you. If you use the official Claude app, it's because you are forced to.

Sidenote unrelated to the post: since the Fable thing, and after serious thinking, I moved to open source models. I still have the basic OpenAI sub, but then easy lifting is now done elsewhere.

[-]

>if you ever dared to use cowork, only once, will run a 2GB VM on app start, no f's given. at all.

Of all the issues, this seems like the most tame. I mean, there are single Chrome tabs that can use 300MB or even 700MB. A 2GB VM for what is likely isolated local testing of scripts and commands or local lightweight first-level inference to help guide the main harness sounds reasonable.

by thewebguyd5 hours ago|

[-]

Not being able to use my own harness on the subscription plan is my biggest gripe with Anthropic/Claude. For what I work on, I still get better results with Opus than I do with GPT5.5-codex, but damn do I hate that I either have to PAYG or I'm stuck using Claude Code.

by drdexebtjl6 hours ago|

[-]

I haven’t ever tried Cowork, and Claude Desktop shipped a 10 GB VM image on the tiny internal storage of my Macbook.

No way to remove it without hacks like creating an empty, read-only file in its place.

Having this slop installed and automatically updating is a liability.

by NamlchakKhandro8 hours ago|

[-]

Pi mono is the only true harness. Everything else is crap

by Supermancho6 hours ago|

[-]

If Pi can't use my MCPs, it's too big a step backward. Is the common tooling: https://github.com/nicobailon/pi-mcp-adapter ?

by epistasis5 hours ago|

[-]

I imagine the answer varies greatly but what use cases do people find for MCP over standard command line calls? The only time I use MCP is when I'm supposed to test that an MCP is working. Everything else is just incredibly clunky and ugly. For example, connecting to Linear to grab a ticket is far easier and cleaner to copy and paste the text rather than to have the agent call the MCP and look it up by ticket name.

I'm hoping I can find somebody else's MCP that could actually help me for once!

by Supermancho3 hours ago|

[-]

I use a Godot MCP in addition to my Godot IDE and the Figma MCP

The Godot Engine API (100s of calls) is not worth memorizing and Figma is visual based, along with a complicated engine-specific serialization.

Much easier to ask an LLM to find out why one thing affects another or how to optimally generate a specific change by exploring those APIs. When the LLM eventually fails, to explain the available functions and triage a solution.

by jorl178 hours ago|

[-]

Claude code (desktop) and Codex (desktop) are both absolutely dogshit pieces of software. I can't pick which one is worse. I'd be sort of ashamed to say I actively worked on them, regardless of how they can empower people. Cursor's new UI is similarly terrible. They're all slowly getting better, but too slow for my taste.

They are incredibly slow in unpredictable ways, eat up memory at an insane rate, and just feel like they were built with no regards for UX. Like they crammed together all the engineers with no idea of how to build a coherent and predictable UI and let them loose on the product without proper designers.

The other day Codex (desktop) was eating up 70GB of RAM on my machine. What had I done? Literally nothing. I opened it and let it update once.

Another one with Codex was when I had a specific conversation where no activity was happening and which would make the app spin up all of my CPU cores, rendering it barely usable. It would take seconds to react to anything or update the UI. The conversation wasn't even in focus!!! Restarting the app wouldn't help. After I archived it, it suddenly got better

Claude Code Desktop used to be so, soo, soo slow and eat up so much RAM. It was unusable for anything other than playing around when I first tried it. It also didn't communicate any of what it would do. Using it was like living in a world with no affordances, constantly afraid of interacting with them and being faced with some sort of destructive action. Still, it has definitely been improving in terms of the UI experience.

Cursor's new agents mode suffers from similar issues. Obscenely slow, hogging CPU without anything going on, breaking with existing UX patterns (some of them already well implemented in their other, more polished, previous version), confusing buttons and labels which don't explain what to do and that sometimes do destructive operations on your code.

My favorite cursor absurdity is that if you use their workflow to create a worktree and the worktree setup script fails, the following happens:

1. The agent has no idea that it failed, let alone have any logs of the failure

2. Often you yourself don't get access to the logs of what failed in that script. Don't ask me, half the time it just says it failed with no further logs.

3. When you do get the logs, you cannot copy them in ANY way. You can't even select them. I have had to resort to taking a screenshot to do OCR on it

I've also had cursor repeatedly have concurrency/race condition bugs when creating multiple worktrees in parallel. I have 5 tasks, I spin them up all together so they can create 5 worktrees and they crash with random internal cursor errors. Wasn't the point of this abhorrent new UI you've stuffed me with to enable parallelism?

It's like people aren't even testing the shit they ship. Which I guess they aren't.

I'm a big believer in AI and think it is changing the world and will continue to do so, but I almost get offended at how bad these products for which I am paying (sometimes quite a lot!) are. There's "move fast and break stuff" and then there's "build crap to call stuff".

by ljlolel7 hours ago|

[-]

That’s why I’m building an open source native Swift version: https://github.com/Lore-Hex/Quillcode

by iknowstuff7 hours ago|

[-]

I’ve been using Codex and Claude in Zed via ACP. Some bugs but overall very pleasant experience vs anything Cursor.

by stellamariesays7 hours ago|