upvote
But how do they scale the reviewing of the agentic output? Or they just blindly trust it and worst case scenario they get to write a sob story on HN about how Claude has deleted the production db?
reply
> But how do they scale the reviewing of the agentic output? Or they just blindly trust it and worst case scenario they get to write a sob story on HN about how Claude has deleted the production db?

Thats a fantastic question. Here's my take: https://news.ycombinator.com/item?id=47917314 - would love your thoughts on it.

In short, I think you're asking a billion dollar question - how do we solve the verification, validation, and QA bottleneck?

The way I handle it for my personal projects is I invest tremendous time and effort into writing thorough test and validation suites.

I bet the next billion dollar companies will be those addressing this verification, validation, and QA bottleneck.

reply
A company can operate aimlessly for a long time and carry along due to inertia and/or monopoly position. So chances are nobody (competent) is reviewing it.
reply
Have the agents review their own output, obviously. What could go wrong
reply
deleted
reply