upvote
It's a nice idea, but how do you know the agent is aligned with what it thinks the intent is?
reply
or moreso, what happens at compact boundaries where the agent completely forgets the intent
reply