upvote
"The cold hard fact is: LLMs are an unreliable tool, and using them without checking their every action is extremely foolish."

You mean checking every action of theirs outside the sandbox I suppose? Otherwise any attempt at letting an agent do some work I would consider foolish.

reply
The AI company has skin in the game which motivates them to produce reliable AIs.
reply
Can you actually sue Anthropic over this when they clearly state that AI can make mistakes and you should double-check everything it does?
reply
You can fire Anthropic. Anthropic can decide it's losing too many customers and do something about it.
reply
Doesn't seem to be working though. :(
reply