I did tell Fiu initially to reply to some emails as a test, but it was too expensive to maintain.
Having the agent reply would have been more fun and a better excercise, but too expensive.
Customer service software regularly uses AI responses for email. Is the issue that your agent using the claw for more than needed (like it's clicking send rather than just accessing an API?)
It's helpful with the actual technical changes needed, it just has no concept of what they translate to in the real world.
Btw my company is spending > $100/day in relatively cheap Gemini tokens for this work. It's easy to see why one might want to be cautious about exposing a token-burning service to the internet.
This is like saying "try to hack my computer and steal my crypto wallet" but your computer can't send any packets
Think about it man, your test proved nothing. All it showed is that people who know nothing about jailbreaking, and tried casually, couldn't jailbreak Opus.
Do you think NSA or Mossad was trying to jailbreak your OpenClaw?
Why would any actually "serious" hacker use a vulnerability to hack a no-name's phone or mac? They are too busy trying to hack actually valuable targets.
Did the OP actually think he was going to get serious LLM exploiters to give up their jailbreaks for this "fun" experiment? Instead he got a bunch of hackernews readers to try one or two casual attempts and then he declared victory over jailbreaks?
Does the OP think this was science? That it proves LLMs cannot be jailbroken?
Think about it, if you had an actual jailbreak for Opus 4.8, why would you use it for a very public, silly experiment?
You would be selling it to the highest bidder, or to Anthropic, or using it on some high value target.
Also, the average person has no idea about the field of jailbreaking. It's like asking the average person to hack a random IP and expecting them to do it.
If you go and do your research on actual people who research jailbreaks and publish them, they are increasingly sophisticated and multistep, and unless you know this, you would have zero chance of just randomly jailbreaking Opus 4.8.