upvote
> it cheats at verification. Even with specific instructions how to verify, it still cheats.

As I responded to another commenter, as a prediction engine, the LLM is trying to predict what you want. It, at one level, correctly predicts that you want tests to pass.

Maybe try telling the LLM that you're a verification engineer, and you get bonuses for finding bugs?

Think about it. All those security researchers wouldn't be finding real bugs in real programs using LLMs if this were an insurmountable problem.

reply