undefined

points

[-]

Well, software presumably has a goal of accomplishing something for some end-user, so the progress should be trivial to measure: are features/changes being completed?

The marketing ploys of OpenAI/Anthropic where agents build something that nobody uses might be hard to track given that there are zero users. But what about everyone using agents for real software? It's trivial to prove that agents make progress.

by visarga4 hours ago|

prev|

[-]

It's not unmaintainable if most of it is tests. Just have it write tests until it becomes safe for AI.

by habinero21 minutes ago|

parent|

[-]

I hate I can't tell if you're joking without checking posting history lol