Is this Dario leveraging it into a ban on open models?
Step 1: "OMG, the AI hacked a researcher eating a sandwhich in the park!"
Step 2: Journalists use that great clickbait to generate profit, which generates publicity for Anthropic
Step 3: Rinse repeat
If the threat of LLMs was treated relative to the actual capabilities of them, and we weren't all being lied to by Anthropic and their army of millions of social media bots and backing media companies and mouthpieces, we'd be going in a much healthier direction. Working out the kinks/supply chain risks and developing sound, long-term countermeasures to the ACTUAL risks.
The only threat to the world is if progress is not open-sourced, democratized and in lockstep with capability. The moment it becomes a scenario of: Only a small group get access to frontier intelligence, is when it gives that small group power over everyone else in the world, and wildly increases the risk of a nuclear level event that WILL be exploited eventually - as the divide between the haves and the have nots accelerates in an exponential fashion. Bad AI is countered with an abundance of good AI that has been used to stay ahead of bad AI. The moment your bad AI outpowers the army of good AI it is game over for humanity. The strength of open-source and open-access AI is the difference between humanities permanent enslavement or extinction versus a prosperous future.
It doesn't help that most of the employees at Anthropic have willingly sold their souls out of short term greed and gaslight themselves into thinking that they're actually doing the right thing to justify their own greed to themselves, while building up an echo-chamber and culture of feel good lies within the company so they can sleep at night, and pat each other on the back. They go along with this because they get paid massive chunks of money from Anthropic, and their shares will be worth more money if Anthropic can swallow the worlds economy at the expense and enslavement of everyone else. What good is that money when you have to sell out humanity in the progress though. You, at Anthropic, is that the legacy you want to leave?
People need to start calling this out before it's far too late. If you work at Anthropic - time to start talking to your colleagues in an honest manner.
Because this wouldn't be considered a jailbreak with any other model; which would just do the request.
OpenAI's models are very good, they have refusals + a government ID verification story for cyber access (I don't think they prevent non-US nationals, but I don't know this). What they don't have is Project Glasswing and all the hand wringing about how they're going to end the world in public.
I hope Anthropic pulls their head out of their ass and just starts acting like a normal company.
OpenAI CEO Sam Altman testifies at Senate artificial intelligence hearing | full video“ (2023)
"My worst fears, are that we cause significant - we the field, the technology, the industry - cause significant harm to the world...If this technology goes wrong, it can go quite wrong and we want to be vocal about that."https://www.npr.org/2026/03/09/nx-s1-5742548/anthropic-penta...