Can you cite specifics? "I won't speak bad about someone, but also won't speak good about others" resulted in a comment that seems to contribute nothing
I’m hoping Karpathy will make Claude Code better, in the meantime I’m super happy seeing a small product manager like Tibo fucking crushing it on Codex
My point is that product velocity is visible in shipped workflow improvements, not prestige hires
Prestige is fickle, look at academia today
Joking aside, there are small communities pushing codex and AI to the bleeding edge of what's possible.
Here I'll give you an example. The last few updates from Boris at CC have been tweaks to the system prompt to make it use less compute, effectively making the system dumber, making it tell you to go to bed. I mean come on! Tibo has been impressing me, bc they're building the things these small communities are building.
One of the things these bleeding edge guys and girls have been working on is a /goal feature, essentially ralph loops. Codex released it as a feature the other day. I can't help but be impressed. As an ex-pm, this is product management.
Then you take a look at what the Chinese are doing on their own forums, and it just makes what Google and Anthropic are doing look outdated. OpenAI feels competitive, which I like. What's coming will not be kind to us, we adapt or we die.
I am sure there is an element of reality in it's capabilities, but there's also a significant amount of "We don't have the compute to handle this at scale", and "look look, we have the best model. It's so good that you can't even compare it to other models. That is how good we are."
The Claude maximalists that can never see any wrong in anything and the users that care about actual capability
These guys are going to be in for a rude awakening when the Chinese are steamrolling us with data centers you can see from space and better models, Amodei will tell you that himself
Adapt or die
What codex is a few steps away from doing is changing fundamentally a lot of workflows.
Remote codex with their computer use is basically you at your computer doing things, 24/7.
Then they added gpt images 2.0
what codex can do, in a few more product iterations, is show you visually side by side “would you prefer this (A) or that (B)” in a series of questions. This is what some open source researchers have been up to. That’s no longer guessing.
I’m not trying to hype a company i have no stake in, but they’ve been killing it.
It’s extremely compute intensive, but also very satisfying.
Example 1, just from top of my mind, Composer 2.5 released today. Go look at their benchmark.
Composer 2.5 and Opus 4.7 ranked around the same, meanwhile gpt-5.5 was miles ahead.
You wouldn’t have caught me dead using a gpt model 2 years ago
They are all going to get their lunch eaten by the Chinese.
In the USA with access to most of the world's capital, they've succumbed to the temptation of "bigger, faster, harder"
Whilst the Chinese, with enough capital only, have had to think.
The Chinese models are already miles ahead on cost/inference basis and will probably pass all the USAnian companies in five years
The age of UASnian engineering dominance are coming to an end.
Let's all hope she goes quietly - not at the moment