A typical session is the agent establishing a metrics and log baseline, creating the code, compiling, deploying, observing, fixing, redeploying, observing metrics, determining the outcome and commiting.
I really, really, don't look at the code anymore.
UPDATE:
so my point is: it won't have my stewarding the code anymore, but it will have the infrastructure (and ultimately the real world) providing feedback on the traces.
Maybe we need some form of long-term training. How long does the code that the AI wrote stick around before being rewritten.
I guess we can do this retroactively too if we could somehow tag AI-written lines of code in the VCS, then in a couple years we can check which parts lasted.
sorry. how do you know. i am so curious about where exactly gains are coming from but so hard to even get a little bit of insight.
i wish govt would fund these labs and make it free and opensource. way better investment than stupid overseas wars.
It would be impossible for the govt to allocate this much capital towards such a moonshot, and even if they could, they would do it in a way that would get 90% frittered away to fraud and waste
https://www.whitehouse.gov/presidential-actions/2025/11/laun...
Is this supposed to be impressive? Five years for the equivalent of, what, Colossus 1? What a joke
Doctrine and propaganda can make someone that sure, and the thing they're sure about doesn't even have to be true.
> There's been massively successful government funded and run projects before. Soviets beat the Americans to space, after all.
Don't let facts get in the way of ideology!
Also the Americans subsequently beating the Soviets to the moon was the government literally allocating huge amounts of capital towards the literal trope-namer moonshot.
You have a false definition of "impossible." It would be true to say it could be challenging, given current political dysfunction, but it's not impossible.
> ...and even if they could, they would do it in a way that would get 90% frittered away to fraud and waste
Same with private business.
I'd prefer government funding, because there a greater number of important goals than the two or three the market is capable of optimizing for.
Oh, how funny.