upvote
What's dangerous is Opus 4.8's proclivity to create backdoors and no-op critical security code. Claude Web counted 27 instances of this I had cataloged over the last few months, and Fable 5 found more. Fable 5 may do this too, but I didn't get a long enough chance to test it since it kept downgrading to Opus 4.8 on every prompt saying, "This model has safety measures that flagged something in this session", even when asking Fable 5 to fix the security issues it found that Opus 4.8 created. You have a model that presumably can write secure code and identify security vulnerabilities, but as a security measure, they say we're going to force you to use a model that creates security holes. This is backwards. Considering the scale, Opus 4.8 is creating more issues than Mythos or Fable 5 is patching.
reply
> Especially if those people aren't presently very bright, and are already mad at you for not helping them achieve their unrelated authoritarian goals.

Just more corrupt behavior from the contemptible kakistocracy that's busy running things into the ground and enriching themselves while they're at it.

reply
Sometimes, the enemy of my enemy is my friend.
reply
The driving force behind most presidential votes
reply
That's what happens when you have a two party system but more than two political philosophies. Given that both parties will work tirelessly to prevent a third party from forming I think the level of corruption should be understood to extend _well past_ the White House.
reply
My gut reaction was that it does look like a PR stunt. But indeed it might also be a blunder caused by all of their other PR stunts. "Our new stuff is soooo dangerous!!", followed by "The US government believed us and acted accordingly".
reply
The CEO's post even mentions supporting export controls, all be it in regards to chip exports. [1]

They suggested the use of the very law used against them here...

[1] https://darioamodei.com/post/policy-on-the-ai-exponential

reply
I presume he's exhilarated that the government is taking the threat seriously and banning foreign nationals from accessing these super dangerous tools.

Congratulations, Dario!

reply
[dead]
reply
Yeah. Our stuff is waaaaay toooooo dangerous! The model is soooo powerful that I have to write a long essay telling government to change the economic policies, to regulate hard, and to ban this and that. Well, now the government is indeed regulating for a claim that Dario has been warning about. This is exactly getting what he bargained for?
reply
"I do not think this is somehow a 3D chess move by Anthropic"

But it seems likely that they took this possibility into account - and that they now prominently and unremovable show the "Fable not avaiable" (link - government said so) is likely with the intention to make pressure on the US government.

reply
It’s a market manipulation following SpaceX ipo. They’ll buy and then reverse the decision shortly to sell.
reply
If you’re confident that will happen then you can also make that trade and profit, right?
reply
Timing is important
reply
Maybe @varjag isn't morally corrupt
reply
People who look up to Donald Trump unsurprisingly feel his genius moves are hard to read. They are not though, if you are familiar with petty thug mentality: https://news.ycombinator.com/item?id=46474173

This prediction is quite falsifiable too so anyone is free to rub it in my face if it fails. If it's really a speculative insider trade the reversal will be done in the space of 2-3 weeks tops, but likely even faster. Probably on a workday. Kinda the same pattern they were doing with tariff swings until the market figured it out and stopped reacting.

reply
> People who look up to Donald Trump unsurprisingly feel his genius moves are hard to read.

Indeed, the lord works in mysterious ways.

reply
Maybe they have a lower level of confidence that it will actually work.
reply
Or the utility function of them gaining 50% more money is less than that of losing half of their money.
reply
Anthropic pushed for the US government to introduce regulations. The US government said no, citing potential stifling of innovation.
reply
Yeah. Did they want this all along? This will just create more hype, and may push towards significant usage once and when it is available.
reply
The first step to regulatory capture is getting yourself regulated...
reply
But when is now if it becomes available. Oops!
reply
It will, otherwise Antrophic will eventually have to leave the US. And I don't think they want that.
reply
For better or for worse, 0 chance this happens for the exact same reason Elon/SpaceX is also tied to the US regardless of how goofy the government gets. If they did so, it would almost certainly directly drive criminal prosecution with various national security flavorings on top.

Every single worker and operation would need to be in countries with no extradition treaties, and even then they'd likely be limited to serving the tiny handful of nations that are willing/able to resist US pressure, so pretty much - Russia and China.

reply
That sounds a bit totalitarian.
reply
> I do not think this is somehow a 3D chess move by Anthropic. They are not masterminds, even if they'd really like to be.

They should have consulted their own models about the ramifications and unintended consequences; based on their actions over the past few months I think it is safe to say that the models are smarter than the decision-makers at anthropic, lol. I know the models are smarter than I am and even I could have told them that they were taking paths, FUD for example, that would lead to grief.

reply
>> People who actually interact with their products know that Fable and Mythos are incremental improvements, not doomsday devices.

If you look outside HN, you'll see that people who interacted with Fable 5 overwhelmingly thought that it was a significant improvement, not simply an incremental one. Most reputable benchmarks show this as well.

reply
Step 1: don't trust benchmarks you don't understand - they might measure irrelevant things Step 2: test it on things you know Opus failed

My day-to-day take, for the coding I do (not security related): incremental, modest improvement, if any. Not worth the 2x cost. I've calmly continued to use Opus, happy that it seems like it got an allowance upgrade.

reply
It's a bit odd that you automatically assumed I don't understand the benchmarks.

For most single issues/bugs/tickets, the quality difference wasn't noticeable. But that's like using a sledgehammer to kill a fly. I was using Fable for much more ambitious and complex tasks that require orchestration, and it was crushing it. I described it here: https://news.ycombinator.com/item?id=48505782

So yes, the benchmarks are indeed accurate: where Opus 4.8 would start strong and eventually struggle or run into obstacles, Fable would relentlessly keep working, keep accurate track of all work threads (e.g. multiple inter-dependent issues being worked in parallel by subagents) and would go above and beyond.

reply
Yeah! I also think that the ban was unintended.

I also think that’s a big clown show. People think that LLMs accidentally get good with security patterns. That is not the case, they included all of that in the training data. They could also have left out the knowledge.

reply
If they leave at all info about security patterns, you get an AI that knows how to code, but doesn't know what can make code insecure. That doesn't seem like a great idea.
reply
I don't think you actually can avoid a subject very effectively. Some things might have to be derived from related examples and real time searches but ultimately a kid raised by helicopter parents is all the more dangerous on the day they find the censored materials.
reply
In the long run it's not punitive but rather amazing marketing for Anthropic. People crave what they can't have.
reply
hard to sell something people can't have though
reply
It will be reversed after Trump makes some “deal” with Anthropic. I’ll put money on Taco Tuesday.
reply
To be clear, they've been saying that all AI needs to take a break. I don't think this single action is going to do much.
reply
They've also been saying coding is solved while having text flicker in a terminal
reply
Where did they say that?
reply
Boris Cherny has said it many times, you can search YouTube for "coding is solved" to find examples

Or watch Primagen's "I think they are lying to you" with clips in it

reply
"They were asking for it"
reply
> punitive

Not only that, but also a golden opportunity to flex the muscle of anti-immigration.

reply
Do you have any idea what actual authoritarianism looks like? What an insult to people who are truly suffering.
reply