upvote
+1 to the CI/isolation point. That is the part that makes these setups work for me too: make the failure cheap to reproduce, make stderr visible, make the agent rerun the same command after the patch. A lot of bad agent behavior is really just "it never got a clean signal".

The part that still bites me is across sessions. A tight loop fixes this run, but next week the agent can walk into the same rake again: same wrong import path, same misuse of an internal API, same CI-only dependency issue. After patching the same class of failure a few times, I started writing those down outside the chat context so the next run sees the failure pattern before it guesses.

reply
> with screenshots broadly showing

Why is it always so un-specific with you AI-boosting bunch, whenever you get pressed for concrete results? Suddenly it's not so magical any more, but merely screenshots showing "broadly" the progress, or it's the Nth version of a note-taking app, or something you merely did for a demo presentation. But nothing ever of use with you folks.

reply
you said:

> it picked up the general path immediately

I said:

> Or they have spent a lot more time and effort on it than they claim.

You said:

> You imply I'm merely "pointing CC at godot and it made a game"; I never said it was simple

Well. I dont care enough to argue with you, but Im not the one being contrary here.

Readers can google “claude with godot” for a guide on setting it up and decide if that counts as picking it up immediately or not, and if what you said is honest, or hype.

What I said is not that I dont believe youre using claude; but that I roll my eyes at the unbounded enthusiasm for using AI agents with the magical pretence that its easy and productive straight away.

Its not.

Your post gave the impression that it is.

That makes me roll my eyes.

> But I had already answered, before your comment, with screenshots

> Of course these are basic placeholders for a few hours of work

Lord, spare me. You spent a few hours vibing and came to the conclusion that everything is golden?

…and yet you have a:

> I do have a careful setup involving CI and isolation.

So what, you spent more time on your setup than actually coding before posting?

/shakes-head

Whatever man.

Have fun. I stand by what I posted before.

reply