undefined

upvote

points

by ivraatiems18 hours ago |

upvote

by espeed4 minutes ago|

[-]

What's dangerous is Opus 4.8's proclivity to create backdoors and no-op critical security code. Claude Web counted 27 instances of this I had cataloged over the last few months, and Fable 5 found more. Fable 5 may do this too, but I didn't get a long enough chance to test it since it kept downgrading to Opus 4.8 on every prompt saying, "This model has safety measures that flagged something in this session", even when asking Fable 5 to fix the security issues it found that Opus 4.8 created. You have a model that presumably can write secure code and identify security vulnerabilities, but as a security measure, they say we're going to force you to use a model that creates security holes. This is backwards. Considering the scale, Opus 4.8 is creating more issues than Mythos or Fable 5 is patching.

reply

upvote

by replwoacause17 hours ago|

[-]

> Especially if those people aren't presently very bright, and are already mad at you for not helping them achieve their unrelated authoritarian goals.

Just more corrupt behavior from the contemptible kakistocracy that's busy running things into the ground and enriching themselves while they're at it.

reply

upvote

by sh34r16 hours ago|

[-]

Sometimes, the enemy of my enemy is my friend.

reply

upvote

by MattDamonSpace16 hours ago|

[-]

The driving force behind most presidential votes

reply

upvote

by themafia53 minutes ago|

[-]

That's what happens when you have a two party system but more than two political philosophies. Given that both parties will work tirelessly to prevent a third party from forming I think the level of corruption should be understood to extend _well past_ the White House.

reply

upvote

by resonious16 hours ago|

[-]

My gut reaction was that it does look like a PR stunt. But indeed it might also be a blunder caused by all of their other PR stunts. "Our new stuff is soooo dangerous!!", followed by "The US government believed us and acted accordingly".

reply

upvote

by ApolloFortyNine6 hours ago|

[-]

The CEO's post even mentions supporting export controls, all be it in regards to chip exports. [1]

They suggested the use of the very law used against them here...

[1] https://darioamodei.com/post/policy-on-the-ai-exponential

reply

upvote

by the-grump3 hours ago|

[-]

I presume he's exhilarated that the government is taking the threat seriously and banning foreign nationals from accessing these super dangerous tools.

Congratulations, Dario!

reply

upvote

by viking1233 hours ago|

[-]

[dead]

reply

upvote

by hintymad1 hours ago|

[-]

Yeah. Our stuff is waaaaay toooooo dangerous! The model is soooo powerful that I have to write a long essay telling government to change the economic policies, to regulate hard, and to ban this and that. Well, now the government is indeed regulating for a claim that Dario has been warning about. This is exactly getting what he bargained for?

reply

upvote

by lukan5 hours ago|

[-]

"I do not think this is somehow a 3D chess move by Anthropic"

But it seems likely that they took this possibility into account - and that they now prominently and unremovable show the "Fable not avaiable" (link - government said so) is likely with the intention to make pressure on the US government.

reply

upvote

by varjag10 hours ago|

[-]

It’s a market manipulation following SpaceX ipo. They’ll buy and then reverse the decision shortly to sell.

reply

upvote

by andsoitis4 hours ago|

[-]

If you’re confident that will happen then you can also make that trade and profit, right?

reply

upvote

by tartoran2 minutes ago|

[-]

Timing is important

reply

upvote

by kitd3 hours ago|

[-]

Maybe @varjag isn't morally corrupt

reply

upvote

by varjag2 hours ago|

[-]

People who look up to Donald Trump unsurprisingly feel his genius moves are hard to read. They are not though, if you are familiar with petty thug mentality: https://news.ycombinator.com/item?id=46474173

This prediction is quite falsifiable too so anyone is free to rub it in my face if it fails. If it's really a speculative insider trade the reversal will be done in the space of 2-3 weeks tops, but likely even faster. Probably on a workday. Kinda the same pattern they were doing with tariff swings until the market figured it out and stopped reacting.

reply

upvote

by yakshaving_jgt1 hours ago|

[-]

> People who look up to Donald Trump unsurprisingly feel his genius moves are hard to read.

Indeed, the lord works in mysterious ways.

reply

upvote

by thomastjeffery2 hours ago|

[-]

Maybe they have a lower level of confidence that it will actually work.

reply

upvote

by vkou1 hours ago|

[-]

Or the utility function of them gaining 50% more money is less than that of losing half of their money.

reply

upvote

by lateral_cloud15 hours ago|

[-]

Anthropic pushed for the US government to introduce regulations. The US government said no, citing potential stifling of innovation.

reply

upvote

by shibaprasadb15 hours ago|

[-]

Yeah. Did they want this all along? This will just create more hype, and may push towards significant usage once and when it is available.

reply

upvote

by deepsquirrelnet21 minutes ago|

[-]

The first step to regulatory capture is getting yourself regulated...

reply

upvote

by morkalork15 hours ago|

[-]

But when is now if it becomes available. Oops!

reply

upvote

by lukan5 hours ago|

[-]

It will, otherwise Antrophic will eventually have to leave the US. And I don't think they want that.

reply

upvote

by somenameforme2 hours ago|

[-]

For better or for worse, 0 chance this happens for the exact same reason Elon/SpaceX is also tied to the US regardless of how goofy the government gets. If they did so, it would almost certainly directly drive criminal prosecution with various national security flavorings on top.

Every single worker and operation would need to be in countries with no extradition treaties, and even then they'd likely be limited to serving the tiny handful of nations that are willing/able to resist US pressure, so pretty much - Russia and China.

reply

upvote

by lukan2 hours ago|

[-]

That sounds a bit totalitarian.

reply

upvote

by ninjagoo16 hours ago|

[-]

> I do not think this is somehow a 3D chess move by Anthropic. They are not masterminds, even if they'd really like to be.

They should have consulted their own models about the ramifications and unintended consequences; based on their actions over the past few months I think it is safe to say that the models are smarter than the decision-makers at anthropic, lol. I know the models are smarter than I am and even I could have told them that they were taking paths, FUD for example, that would lead to grief.

reply

upvote

by enraged_camel3 hours ago|

[-]

>> People who actually interact with their products know that Fable and Mythos are incremental improvements, not doomsday devices.

If you look outside HN, you'll see that people who interacted with Fable 5 overwhelmingly thought that it was a significant improvement, not simply an incremental one. Most reputable benchmarks show this as well.

reply

upvote

by rad_val2 hours ago|

[-]

Step 1: don't trust benchmarks you don't understand - they might measure irrelevant things Step 2: test it on things you know Opus failed

My day-to-day take, for the coding I do (not security related): incremental, modest improvement, if any. Not worth the 2x cost. I've calmly continued to use Opus, happy that it seems like it got an allowance upgrade.

reply

upvote

by enraged_camel2 hours ago|

[-]

It's a bit odd that you automatically assumed I don't understand the benchmarks.

For most single issues/bugs/tickets, the quality difference wasn't noticeable. But that's like using a sledgehammer to kill a fly. I was using Fable for much more ambitious and complex tasks that require orchestration, and it was crushing it. I described it here: https://news.ycombinator.com/item?id=48505782

So yes, the benchmarks are indeed accurate: where Opus 4.8 would start strong and eventually struggle or run into obstacles, Fable would relentlessly keep working, keep accurate track of all work threads (e.g. multiple inter-dependent issues being worked in parallel by subagents) and would go above and beyond.

reply

upvote

by augunrik4 hours ago|

[-]

Yeah! I also think that the ban was unintended.

I also think that’s a big clown show. People think that LLMs accidentally get good with security patterns. That is not the case, they included all of that in the training data. They could also have left out the knowledge.

reply

upvote

by johntb8631 minutes ago|

[-]

If they leave at all info about security patterns, you get an AI that knows how to code, but doesn't know what can make code insecure. That doesn't seem like a great idea.

reply

upvote

by porkchoppers3 hours ago|

[-]

I don't think you actually can avoid a subject very effectively. Some things might have to be derived from related examples and real time searches but ultimately a kid raised by helicopter parents is all the more dangerous on the day they find the censored materials.

reply

upvote

by amirathi16 hours ago|

[-]

In the long run it's not punitive but rather amazing marketing for Anthropic. People crave what they can't have.

reply

upvote

by sznio14 hours ago|

[-]

hard to sell something people can't have though

reply

upvote

by r053bud3 hours ago|

[-]

It will be reversed after Trump makes some “deal” with Anthropic. I’ll put money on Taco Tuesday.

reply

upvote

by egonschiele17 hours ago|

[-]

To be clear, they've been saying that all AI needs to take a break. I don't think this single action is going to do much.

reply

upvote

by verdverm15 hours ago|

[-]

They've also been saying coding is solved while having text flicker in a terminal

reply

upvote

by lukan7 hours ago|

[-]

Where did they say that?

reply

upvote

by verdverm5 hours ago|

[-]

Boris Cherny has said it many times, you can search YouTube for "coding is solved" to find examples

Or watch Primagen's "I think they are lying to you" with clips in it

reply

upvote

by unethical_ban16 hours ago|

[-]

"They were asking for it"

reply

upvote

by jimmydoe16 hours ago|

[-]

> punitive

Not only that, but also a golden opportunity to flex the muscle of anti-immigration.

reply

upvote

by Rover2221 hours ago|

[-]

Do you have any idea what actual authoritarianism looks like? What an insult to people who are truly suffering.

reply