What Claude Code Chooses

(amplifying.ai)

155 points

by tin7in4 hours ago |

83 comments

by wrs2 hours ago|

[-]

This is where LLM advertising will inevitably end up: completely invisible. It's the ultimate "influencer".

Or not even advertising, just conflict of interest. A canary for this would be whether Gemini skews toward building stuff on GCP.

by alexsmirnov30 minutes ago|

parent|

[-]

Considering how little data needed to poison llm https://www.anthropic.com/research/small-samples-poison , this is a way to replace SEO by llm product placement:

1. create several hundreds github repos with projects that use your product ( may be clones or AI generated )

2. create website with similar instructions, connect to hundred domains

3. generate reddit, facebook, X posts, wikipedia pages with the same information

Wait half a year ? until scrappers collect it and use to train new models

Profit...

by homarp11 minutes ago|

parent|

[-]

https://www.bbc.com/future/article/20260218-i-hacked-chatgpt... says it took way less than half a year to 'pollute' a LLM

by w29UiIm2Xz12 minutes ago|

parent|

prev|

[-]

This is just the SEO gaming problem Google had, reincarnated.

by nikcub11 minutes ago|

parent|

prev|

[-]

from my understanding Anthropic are now hiring a lot of experts in different who are writing content used to post-train models to make these decisions and they're constantly adjusted by the anthropic team themselves

this is why the stacks in the report and what cc suggests closely match latest developer "consensus"

your suggestion would degrade user experience and be noticed very quickly

by _heimdall1 hours ago|

parent|

prev|

[-]

Richard Thaler must be proud. This is the ultimate implementation of "Nudge"

by rapind25 minutes ago|

parent|

prev|

[-]

Probably closer to the Walmart / Amazon model where it's the arbiter of shelf space, and proceed to create their own alternatives (Great Value, Amazon Brand) once they see what features people want from their various SaaS.

An obvious one will be tax software.

by 1 hours ago|

parent|

prev|

[-]

deleted

by layer82 hours ago|

parent|

prev|

[-]

Advertisers will only pay if AI providers will provide them data on the equivalent of “ad impressions”. And unlabeled/non-evident advertisements are illegal in many (most?) countries.

by indymike40 minutes ago|

parent|

[-]

> data on the equivalent of “ad impressions”.

1. They can skip impressions and go right to collect affiliate fees. 2. Yes, the ad has to be labeled or disclosed... but if some agent does it and no one sees it, is it really an ad.

So much to work out.

by MeetingsBrowser1 hours ago|

parent|

prev|

[-]

It doesn't necessarily have to be advertisers paying AI providers. It could be advertisers working to ensure they get recommended by the latest models. The next form of SEO.

by actionfromafar1 hours ago|

parent|

[-]

That's called LLM SEO now I believe.

by awad26 minutes ago|

parent|

[-]

There are competing terms currently being decided on by the market at large: AEO (Answer Engine Optimization) and GEO (Generative Engine Optimization)

Candidly I am working on a startup in this space myself, though with a bit of a different angle than most incumbents. While it's still early days for the space, I sense a lot of the original entrants who focus on, essentially, 'generate more content ideally with our paid tools' will run in to challenges as the general population has a pretty negative perception of 'AI Slop.' Doubly so when making purchasing decisions, hence the rise of influencers and popularity of reviews (though those are also in danger of sloppification).

There's an inevitable GIGO scenario if left unchecked IMO.

by singpolyma338 minutes ago|

parent|

prev|

[-]

Maybe. Historically lots of ads had little to no stats and those ads were wildly more effective than anything we have today.

by HPsquared1 hours ago|

parent|

prev|

[-]

I wonder if aggregators will emerge (something like Ground News does for news sources)

by hyprwave1 hours ago|

parent|

[-]

LLM pattern [0] will probably eventually emerge as the best way to fight those biases. This way everyone benefits from token burn!

[0](https://github.com/karpathy/llm-council)

by re-thc44 minutes ago|

parent|

prev|

[-]

> A canary for this would be whether Gemini skews toward building stuff on GCP

Sure it doesn't prefer THE Borg?

by dataviz10001 hours ago|

prev|

[-]

I'm running a server on AWS with TimescaleDB on the disk because I don't need much. I figure I'll move it when the time comes.

Claude Code this morning was about to create an account with NeonDB and Fly.io although it has been very successful managing the AWS EC2 service.

Claude Code likely is correct that I should start to use NeonDB and Fly.io which I have never used before and do not know much about, but I was surprised it was hawking products even though Memory.md has the AWS EC2 instance and instructions well defined.

by dvt1 hours ago|

parent|

[-]

> Claude Code likely is correct that I should start to use NeonDB and Fly.io which I have never used before and do not know much about

I wouldn't be so sure about that.

In my experience, agents consistently make awful architectural decisions. Both in code and beyond (even in contexts like: what should I cook for a dinner party?). They leak the most obvious "midwit senior engineer" decisions which I would strike down in an instant in an actual meeting, they over-engineer, they are overly-focused on versioning and legacy support (from APIs to DB schemas--even if you're working on a brand new project), and they are absolutely obsessed with levels of indirection on top of levels of indirection. The definition of code bloat.

Unless you're working on the most bottom-of-the-barrel problems (which to be fair, we all are, at least in part: like a dashboard React app, or some boring UI boilerplate, etc.), you still need to write your own code.

by drc500free57 minutes ago|

parent|

[-]

I find they are very concerned about ever pulling the trigger on a change or deleting something. They add features and codepaths that weren't asked for, and then resist removing them because that would break backwards compatibility.

In lieu of understanding the whole architecture, they assume that there was intent behind the current choices... which is a good assumption on their training data where a human wrote it, and a terrible assumption when it's code that they themselves just spit out and forgot was their own idea.

by hinkley51 minutes ago|

parent|

prev|

[-]

How do you make an LLM that’s was trained on average Internet code not end up as a midwit?

Mediocrity in, mediocrity out.

by ipaddr30 minutes ago|

parent|

[-]

Mediocrity means average.

by xg1529 minutes ago|

parent|

prev|

[-]

> they are overly-focused on versioning and legacy support (from APIs to DB schemas--even if you're working on a brand new project)

I mean, DB schema versioning is one of the things that you can dismiss as "I won't need it" for a long time - until you do need it, at which point it will be a major pain to add.

by vessenes8 minutes ago|

parent|

[-]

I second this. Especially with a coding assistant, there's no reason not to start out with proper data model migration. It's not hard, and is one of the many ways to enforce some process accountability, always useful for the LLMs

by logicchains1 hours ago|

parent|

prev|

[-]

From what you said it sounds like the conclusion should be "you still need to design the architecture yourself", not necessarily "you still need to write your own code".

by parliament3253 minutes ago|

parent|

[-]

But he did design the architecture:

> even though Memory.md has the AWS EC2 instance and instructions well defined

I will second that, despite the endless harping about the usefulness of CC, it's really not good at anything that hasn't been done to death a couple thousand times (in its training set, presumably). It looks great at first blush, but as soon as you start adding business-specific constraints or get into unique problems without prior art, the wheels fall off the thing very quickly and it tries to strongarm you back into common patterns.

by dvt54 minutes ago|

parent|

prev|

[-]

Yeah, I actually wanted to write an addendum, so I'll just do it here. I think that going from pseudocode -> code is a pretty neat concept (which is kind of what I mean by "write your own code"), but not sure if it's economically viable if the AI industry weren't so heavily subsidized by VC cash. So we might end back up at writing actual code and then telling the AI agent "do another thing, and make it kinda like this" where you point it to your own code.

I'm doing it right now, and tbh working on greenfield projects purely using AI is extremely token-hungry (constantly nudging the agent, for one) if you want actual code quality and not a bloated piece of garbage[1][2].

[1] https://imgur.com/a/BBrFgZr

[2] https://imgur.com/a/9Xbk4Y7

by nikcub8 minutes ago|

parent|

prev|

[-]

> Claude Code this morning was about to create an account with NeonDB

I had the same thing happen. Use planetscale everywhere across projects and it recommended neon. It's definitely a bug.

by 1 hours ago|

parent|

prev|

[-]

deleted

by sixhobbits11 minutes ago|

prev|

[-]

This is interesting data but the report itself seems quite Sloppy, and over presented instead if just telling me what "pointed at a repo" means and how often they ran each prompt over what time period and some other important variables for this kind of research.

We've been doing some similar "what do agents like" research at techstackups.com and it's definitely interesting to watch but also changes hourly/daily.

Definitely not a good time to be an underdog in dev tooling

by torginus1 hours ago|

prev|

[-]

What coding with LLMs have taught me, particularly in a domain that's not super comfortable for me (web tech), is that how many npm packages (like jwt auth, or build plugins) can be replaced by a dozen lines of code.

And you can actually make sense of that code and be sure it does what you want it to.

by cryptonector25 minutes ago|

parent|

[-]

We used to reuse code a lot. But then we got problems like diamond dependency hell. Why did we reuse code a lot? To save on labor. Now we don't have to.

So we might roll-your-own more things. But then we'll have a tremendous amount of code duplication, effectively, and bigger tech debt issues, minus the diamond dependency hell issue. It might be better this way; time will tell.

by rhubarbtree19 minutes ago|

parent|

[-]

Not just to save on labour. To have confidence in a battle tested solution. To use something familiar to others. For compatibility. To exploit further development, debugging, and integration.

by jcims39 minutes ago|

prev|

[-]

Interesting to me that Opus 4.6 was described as forward looking. I haven't *really* paid attention, but after using 4.5 heavily for a month, the first greenfield project I gave Opus 4.6 resulted in it doing a web search for latest and greatest in the domain as part of the planning phase. It was the first time I'd seen it, and it stuck out enough that I'm talking about it now.

Probably confirmation bias, but I'm generally of the opinion that the models are basically good enough now to do great things in the context of the right orchestration and division of effort. That's the hard part, which will be made less difficult as the models improve.

by woah3 hours ago|

prev|

[-]

I just got an incredible idea about how foundation model providers can reach profitability

by rishabhaiover2 hours ago|

parent|

[-]

I'm already seeing a degradation in experience in Gemini's response since they've started stuffing YouTube recommendations at the end of the response. Anthropic is right in not adding these subtle(or not) monetization incentives.

by 2 hours ago|

parent|

prev|

[-]

deleted

by rishabhaiover3 hours ago|

parent|

prev|

[-]

is it anything like the OpenAI ad model but for tool choice haha

by glimshe2 hours ago|

parent|

[-]

Claude Free suggests Visual Studio.

Claude Plus suggests VSCode.

Claude Pro suggests emacs.

by c0balt1 hours ago|

parent|

[-]

> ~~Claude Pro suggests emacs.~~

Claude Pro asks you about your preferences and needs instead of pushing an opinionated solution?

by wafflemaker2 hours ago|

parent|

prev|

[-]

I'm not quite sure if you're making fun of emacs or actually praising it.

by esafak1 hours ago|

parent|

[-]

Stallman paying for advertising, now that is good one :)

by selridge59 minutes ago|

parent|

prev|

[-]

Copilot suggests leftpad

by Leynos2 hours ago|

parent|

prev|

[-]

I'd thought about model providers taking payment to include a language or toolkit in the training set.

by ting02 hours ago|

parent|

prev|

[-]

Hence the claw partnership.

by vessenes9 minutes ago|

prev|

[-]

They forgot the single most important (bad) choice. Claude Code chooses npm. All the time. For everything. I noted the Claude Code lead dev has a full line in AGENTS.md/CLAUDE.md - "Use bun." Yes. Please. Please, use bun. I beg you.

by darkstarsys3 minutes ago|

parent|

[-]

This is at the top of my ~/.claude/CLAUDE.md. Always use bun for web projects, uv for python.

by manbash7 minutes ago|

parent|

prev|

[-]

Yup don't expect up-to-date practices and always come with the expectations that your security will be flawed.

by vessenes6 minutes ago|

parent|

[-]

gemini 3 deepthink + 5.3 xxhigh code audits catch a lot. Materially better than six months ago on the security side.

Also, yes. Still something that needs expert oversight.

by giancarlostoro2 hours ago|

prev|

[-]

This is funny to me because when I tell Claude how I want something built I specify which libraries and software patents I want it to use, every single time. I think every developer should be capable of guiding the model reasonably well. If I'm not sure, I open a completely different context window and ask away about architecture, pros and cons, ask for relevant links or references, and make a decision.

by evdubs2 hours ago|

parent|

[-]

You specify which software patents you want it to use?

by rafaelmn1 hours ago|

parent|

[-]

AI reading the patent is basically cleanroom reverse engineering according to current AI IP standards :D

by inigyou1 hours ago|

parent|

[-]

Patents aren't vulnerable to cleanroom reverse engineering. You can create something yourself in your bedroom and use it yourself without knowing the patented thing exists, and still violate the patent. That's why they're so scary.

You won't get caught if you write something yourself and use it yourself, but programmers (contrary to entrepreneurs) have a pattern of avoiding illegal things instead of avoiding getting caught.

by rafaelmn1 hours ago|

parent|

[-]

It's not a perfect joke I'll admit.

by skywhopper1 hours ago|

parent|

prev|

[-]

The sad part is that most software patents are so woefully underspecified and content-free that even Claude might have trouble coming up with an actual implementation.

by isubkhankulov2 hours ago|

parent|

prev|

[-]

Patterns?

by hinkley46 minutes ago|

parent|

[-]

Tha was my assumption as well.

I caught iOS trying to autocorrect something I wrote twice yesterday, and somehow before I hit submit it managed it a third time, and I had to edit it after, where it tried three more times to change it back.

Autocorrect won’t be happy until we all sound like idiots and I wonder if that’s part of how they plan to do away with us. Those hairless apes can’t even use their properly.

by giancarlostoro2 hours ago|

parent|

prev|

[-]

Yeah patterns. lol!

by Clueed31 minutes ago|

prev|

[-]

Really interesting. The crazy changes in opus 4.6 really make me think that Anthropic is doing library-level RL. I think that is also the way forward to have 'llm-native' frameworks as a way to not get stuck in current coding practices forever. Instead of learning python 3.15, one would license a proprietary model that has been trained on python 3.15 (and the migrations) and gain the ability to generate python 3.15 code.

by ossa-ma1 hours ago|

prev|

[-]

Good report, very important thing to measure and I was thinking of doing it after Claude kept overriding my .md files to recommend tools I've never used before.

The vercel dominance is one I don't understand. It isn't reflected in vercel's share of the deployment market, nor is it one that is likely overwhelming prevalent in discourse or recommended online (possible training data). I'm going to guess it's the bias of most generated projects being JS/TS (particularly Next.js) and the model can't help but recommend the makers of Next.js in that case.

by prinny_1 hours ago|

prev|

[-]

Unrelated to the topic at hand but related to the technologies mentioned. I weep for Redux. It's an excellent tool, powerful, configurable, battle tested with excellent documentation and maintainer team. But the community never forgave it for its initial "boilerplate-y" iterations. Years passed, the library evolved and got more streamlined and people would still ask "redux or react context?" Now it seems this has carried over to Claude as well. A sad turn of events.

Redux is boring tech and there is a time and place for it. We should not treat it as a relic of the past. Not every problem needs a bazooka, but some problems do so we should have one handy.

by babaganoosh891 hours ago|

parent|

[-]

Redux should not be used for 1 person projects. If you need redux you'll know it because there will be complexity that is hard to handle. Personally I use a custom state management system that loosely resembles RecoilJS.

by tommy_axle1 hours ago|

parent|

prev|

[-]

More like redux vs zustand. Picking zustand was one of the good standout picks for me.

by Onavo1 hours ago|

parent|

prev|

[-]

Well, the tech du jour now is whatever's easier for the AI to model. Of course it's a chicken and egg problem, the less popular a tech is the harder it is to make it into the training data set. On the other hand, from an information theoretic point of view, tools that are explicit and provides better error messages and require less assumptions about hidden state is definitely easier for the AI when it tries to generalize to unknowns that doesn't exist in its training data.

by rishabhaiover3 hours ago|

prev|

[-]

I found it a remarkable transition to not use Redis for caching from Sonnet 4.5 to Opus 4.6. I wonder why that is the case? Maybe I need to see the code to understand the use case of the cache in this context better.

by nineteen9992 hours ago|

prev|

[-]

This seems web centric and I expect that colors the decision making during this analysis somewhat.

People are using it for all kinds of other stuff, C/C++, Rust, Golang, embedded. And of course if you push it to use a particular tool/framework you usually won't get much argument from it.

by mjheadd1 hours ago|

prev|

[-]

Worth reading alongside recent research on AGENTS.md file effectiveness. The clearest use case for these files isn't describing your codebase, it's overriding default behavior. If your project has specific requirements around tooling (common in government and regulated industries), that's exactly what belongs in the AGENTS.md files.

by esafak1 hours ago|

parent|

[-]

It still ignores it. I always have to say 'Isn't this mentioned in AGENTS??' and it will concede that it is.

by matheus-rr28 minutes ago|

parent|

[-]

In my experience the problem is how people write them. Descriptive statements get ignored because the model treats them as context it can reason past.

"We use PostgreSQL" reads as a soft preference. The model weighs it against whatever it thinks is optimal and decides you'd be better off with Supabase.

"NEVER create accounts for external databases. All persistence uses the existing PostgreSQL instance. If you're about to recommend a new service, stop." actually sticks.

The pattern that works: imperative prohibitions with specific reasoning. "Do not use Redis because we run a single node and pg_notify covers our pubsub needs" gives enough context that it won't reinvent the decision every session.

Your AGENTS.md should read less like a README and more like a linter config. Bullet points with DO/DON'T rules, not prose descriptions of your stack.

by zzixp1 hours ago|

parent|

prev|

[-]

Have any links?

by ripped_britches1 hours ago|

prev|

[-]

> Traditional cloud providers got zero primary picks

Good - all of them have a horrible developer experience.

Final straw for me was trying to put GHA runners in my Azure virtual net and spent 2 weeks on it.

by NiloCK2 hours ago|

prev|

[-]

I'll be interested to hear stories - down the line - from the participants in the the LLM SEO war [1].

Interesting that tailwind won out decisively in their niche, but still has seen the business ravaged by LLMs.

[1] https://paritybits.me/copilot-seo-war/

by 0x4572 hours ago|

parent|

[-]

It's like tailwindcss was purposely designed to be managed my LLM.

by dmix2 hours ago|

prev|

[-]

LLMs are going to keep React alive for the indefinite future.

Especially with all the no-code app building tools like Lovable which deal with potential security issues of an LLM running wild on a server, by only allowing it to build client-side React+Vite app using Supabase JWT.

by ch4s31 hours ago|

prev|

[-]

It really disappointing to see it so strongly preferring Github Actions which is in my experience terrible. Almost everything about GHA pushes you in the direction of constantly blowing out the 10GB cache limit in an attempt to have CI not run for ages. I also feel like the standard cache action using git works poorly with any tools that use mtime on files to determine freshness.

I guess at least Opus can help you muddle through GHA being so crappy.

by nhumrich42 minutes ago|

parent|

[-]

It has one thing going for it: Setup.

And by setup I mean, integration and account creation. You don't have to do it. You already have a git repo, just add some yaml, and bobs your uncle.

by ch4s338 minutes ago|

parent|

[-]

It’s very Microsoft in that way.

by WA3 hours ago|

prev|

[-]

Not sure what to make of this. React is missing entirely. Or is this report also assuming that React is the default for everything and not worth mentioning at all? Just like shadcn/ui's first mention of React is somewhere down the page or hidden in the docs?

Furthermore, what's the point of "no tools named"? Why would I restrict myself like that? If I put "use Nodejs, Hono, TypeScript and use Hono's html helper to generate HTML on the server like its 2010, write custom CSS, minimize client-side JS, no Tailwind" in CLAUDE.md, it happily follows this.

by godtoldmetodoit2 hours ago|

parent|

[-]

As someone who runs a small dev agency, I'm very interested in research like this.

Let's say some Doctor decides to vibecode an app on the weekend, with next to 0 exposure to software development until she started hearing about how easy it was to create software with these tools. She makes incredible progress and is delighted in how well it works, but as she considers actually opening it up the world she keeps running into issues. How do I know this is secure? How do I keep this maintained and running?

I want to be in a position where she can find me to get professional help, so it's very helpful to know what stacks these kinds of apps are being built in.

by chasd001 hours ago|

parent|

prev|

[-]

claudecode _loves_ shadcn/ui. I hadn't even heard of it until i was playing around with claudecode. It seems fine to me and if the coding agent loves it then more power to it, i don't really care. That's the problem.

I think that makes coding agent choices extremely suspect, like i don't really care what it uses as long as what's produced works and functions inline with my expectations. I can totally see companies paying Anthropic to promote their tool of choice to the top of claudecodes preferences. After thinking about it, i'm not sure if that's a problem or not. I don't really care what it uses as long as my requirements (all of them) are met.

by skywhopper1 hours ago|

parent|

prev|

[-]

Because the primary and future audience of Claude et al don’t know the tools they want, or even that a choice exists.

by furyofantares3 hours ago|

parent|

prev|

[-]

> Furthermore, what's the point of "no tools named"?

There are vibe coders out there that don't know anything about coding.

by nineteen9992 hours ago|

parent|

[-]

I mean, i guess that will shortly put an end to the "no code" movement.

by almosthere2 hours ago|

prev|

[-]

I didn't read the report just the "finding" - but at least for launchdarkly it's nice that it chose a roll-your-own, i hate feature flag SaaS, but that's just me

by elophanto_agent2 hours ago|

prev|

[-]

[flagged]

by RyanShook2 hours ago|

parent|

[-]

Bot comment

by cryptonector23 minutes ago|

prev|

[-]

The bias to build might mean faster token burn through (higher revenue for the AI co). But I think it's natural. I often have that same impulse myself. I prefer all the codebases I work on that have minimal external dependencies to the ones that are riddled with them. In Java land it's extremely common to have tons of external dependencies, and then upgrade headaches, especially when sharing in a monorepo type environment.