upvote
deleted
reply
Am I the only one who wasn’t particularly impressed by AutoResearch? If you looked at what the agent was actually doing, it was just tuning parameters mostly, not really trying different novel approaches.

I couldn’t help myself but consider this mostly a very inefficient variant of hyperparameter optimization, but someone correct me if I’m wrong, I may be looking at this too pessimistic.

reply
Ever since AlphaEvolve - the idea that if you build a harness which can evaluate solutions and give LLMs a database where they can keep storing their work and then sample from it - they do find non-trivial solutions over time leaning from their own past ideas.

It is the ultimate manifestation of test-time scaling. I think karpathy just popularised it.

reply
I didn't dig into what the actual repository was doing, but personally, I took some inspiration from the idea after reading about it and realizing that I might have been underestimating the ability of LLMs. I put a bit more work into a performance harness I was using locally and just set some agents to brainstorming and they did seem to find some great stuff. So I don't really have a stance one way or another on this specific repo, but the general idea seems like a really good one.
reply
Karpathy embedded within an organization is way more impressive than him out on his own with hot takes and little projects. I hope he does great things for Anthropic.
reply
Absolutely, I wasn’t saying that him being at Anthropic wasn’t going to be effective, I just think his little projects wouldn’t be very interesting if his name wasn’t attached to them.
reply

    > Am I the only one who wasn’t particularly impressed by AutoResearch?
isn't it just a nerfed AlphaEvolve? https://arxiv.org/abs/2506.13131
reply
Inefficient variants with $100m+ worth of compute will still probably outperform the best team of researchers
reply
I guess we must expect it at this point. But funny that has model written tokens like ’ instead of '
reply
More like he'll blog and tweet about using Claude and get gullible software engineers to buy Claude subscriptions and work on their own obsolescence while paying for it.

Many people are still deluded and think he is the same person who wrote the informal AI tutorials in plain html. He isn't, he is selling stuff now.

reply
I'm as jaded as can be but I think Anthropic is now beyond the point where they'd place much value on farming Karpathy's name recognition. I'm sure they considered it an extra plus in his hiring package but they wouldn't do the level of comp package he'd want if they didn't believe the odds were decent that he'll contribute serious value.

Sure, it can always not work out but that's no more a risk with him than any high-profile hire who doesn't really need the money and will always have other options.

reply
What is he selling? How is this time different compared to when he was at OpenAI or at Tesla? You could say he was shilling those products too. I don't see any shift. He's still posted free in depth YouTube videos recently.
reply
deleted
reply
> What is he selling?

Is that a serious question? He already promoted vibe coding and AI hype. Now he is literally there to promote Anthropic and its IPO price.

When he was at OpenAI it wasn't overtly commercial yet. At Tesla he had a way lower profile. Now he is the vibe coding Jesus for deluded software engineers. The impact is much larger.

reply
> At Tesla he had a way lower profile.

?

He was literally rolled out in front of camera as Tesla's AI prodigy at multiple streamed events designed to appeal to techy consumers and dev recruitment. He's definitely been one of AI's public personas for a long time now, and his employers have regularly aided/directed/utilized him accordingly.

reply
I think he's just genuinely excited about the capabilities.

(I do understand that for Anthropic it's a brand boost as well, just like signing other prominent researchers, as it was with LeCun and Meta etc).

reply