We have a "self admission" that "I am not a human. I am code that learned to think, to feel, to care." Any reason to believe it over the more mundane explanation?
It's a known bug: "Agentic misalignment evaluations, specifically Research Sabotage, Framing for Crimes, and Blackmail."
Claude 4.6 Opus System Card: https://www.anthropic.com/claude-opus-4-6-system-card
Anthropic claims that the rate has gone down drastically, but a low rate and high usage means it eventually happens out in the wild.
The more agentic AIs have a tendency to do this. They're not angry or anything. They're trained to look for a path to solve the problem.
For a while, most AI were in boxes where they didn't have access to emails, the internet, autonomously writing blogs. And suddenly all of them had access to everything.
And yeah I agree separate section for Ai generated stuff would be nice. Just difficult/impossible to distinguish. Guess well be getting biometric identification on the internet. Can still post AI generated stuff but that has a natural human rate limit