Anthropic got the most rewarding hype ever in the history of mankind.
Imagine a private company invents a piece of technology soooo good that the US government has to issue a ban.
Did the government ban any models from Google or OpenAI? Nah, Russian/Chinese spies and ISIS are welcome to use those dumb models.
Anthropic will probably go for $2T IPO now.
For all we know there are might be several reasons for that ban e.g.
1) There is an actual security threat and its just simple as that.
2) Someone wants Anthropic to be valued way higher and the companies that have invested in Anthropic already... This ban only validates this product and will move the market in higher valuation of Anthropic due to their model being "so good gov had to ban"
3) Someone doesn't like Anthropic and just wants to shut down its current edge (highly unlikely, if there was no IPO filing in place it could be possible but now the valuation just goes up, same as the 2 As that have invested in them)
4) Someone freaked out that we'll be left out of jobs soon so wants to slow down progress, tbh using fable so far I can tell that a lot of jobs can be made redundant cause of that...
For me the most likely for now is 2, then 1 and then maybe 4.
On June 22 Chatgpt will most likely come out with their new model too, which as I understand will be an answer to mythos. Lets see if the US gov goes the same route.
Even if this is just temporary, your #3 is more in direct conflict with #2 than you seem to be willing to admit: if you were to own stock in a company that you know has a powerful product and a market lead, but they have been required to take a time out in the market for a year, that should be devastating for their valuation.
For the Gov to come out and block a model for national security, its gonna swing the market into thinking "oh anthropic really has the next generation of LLMs out there, its that good Gov banned it, this company is going to the moon".
The part of banning non US nationals, I believe is a legality, as in they have to trust US citizens to do right by their country. I don't think in court a whole ban on a product for security reasons would stand. (The judge would ask for the gov to explain why all US nationals are a security threat to their country)
Nevertheless, again I am standing behind number 2 personally as the main reason for such a thing, market manipulation is not new and its currently at its all time high. Also anthropic is part of this manipulation so far, with every other AI company out there.
Again I am just presenting my POV, it could as well just be number 1... A gov became competent enough to find security threads before they happen :)
There are lots such products, like weapons-grade radioactive material, weapons outside the toy gun range, various biological material, ...
So it seems perfectly possible to bad products for security reasons.
It might be a good marketing trick but it is not a good thing in the stock market given historical trends.
Your view highly screams you only have a superficial understanding of financial markets and you shouldn't extraploate that to "this is how market works because its all hype and everything is vapor"
Anthropics reputation as a near-term world-ender boosts their IPO directly.
I feel you are just talking a hypothetical without having any basis. You think it'll have an impact on IPO directly and that it will be a positive one. But you have no proof or historical precidence for the same. Meanwhile we have historical proof that markets reacts negatively when a company is blocked by the government on selling their top products freely. And that is most likely going to happen here as well.
Public perception might be changed by these "ohhh its so scary guys" marketing but these don't translate to actual market perception when it comes to actual facts and numbers on the financials.
Now it’s a competition between products on the near frontier. Anthropic has executed well on products so far. They blew up thanks to Claude Code, not Opus by itself.
(2) This "hype" meme is overrated. Enterprises (ones without a horse in the race, at least) will choose the model their best engineers ask for, or their competitors will lap them. I have been finding Codex more useful (even than Fable) but for a lot of tasks it seems that Claude Code is faster. This is one customer base where the general consensus here on HN is more influential than anything the Trump administration could do or anything Anthropic could say.
(3) "US government seems out to kill you" does not necessarily make valuation go up, and we've already seen this administration in an avoidable spat with Anthropic.
(4) This seems way less likely than a mix of (1) and (3) to me. The arguments for banning a useful technology to save jobs haven't really made sense since cars or indoor plumbing and don't get taken too seriously in either party at senior levels. That could change but it will take a lot for it to change.
Fable had some really good cross project awareness. My only complaint is it backported a feature to my test application and then they killed it before I could finish debugging it. The new model behavior in the replacement application was 100% superior. I just didn't know it was going to start porting fixes so readily between projects. Awareness in the new model is amazing and the feedback I've had from other developers is the same. It feels like ultrathink with double the agents of xhigh effort. The real issue is they shipped it with incomplete guardrails and someone likely found an exploit.
1. https://www.theguardian.com/us-news/2016/sep/13/donald-trump...
Also, might be a way to further screw with Anthrophic because they refused to remove their guardrails Pentagon, getting the opposite result of what was intended.
This is about grift, somehow, full stop.
I neither like nor support Anthropic, but there's just no sense in pretending the Trump administration is anything other than a kleptocracy or interpreting their actions under any other lens.
This is potential tyranny aimed at Anthropic, specifically.
Being a US ally has become meaningless, and using a company that’s not targeted today does nothing to protect you tomorrow.
It's a clear statement that European morals are purely performative
Just like how the EU is hostile towards US companies, but very light to the touch when it comes to corruption with HSBC, FIFA or VW. With such hostile and erratic allies, who needs enemies?
Let's not even get into Orban. You can never trust the EU again since who knows if they're capable of electing someone like that in the future? Trust is broken forever
Whether you believe that is another thing. But that’s the signal. It’s amazing marketing for them, even if a pain in the ass for customers rn
Investors will have so much FOMO over this
Major alarm bells should be ringing for anyone not using a US-based LLM.
Apple's G4 was banned for export. Although it was not a direct order from US government. They fell into an outdated bracket of computing power exports limits. They sure did use it for advertising it.
What? Anthropic is not a TikTok sensation. It's a business tool. Businesses need to know their tools work reliably.
When you are situated in a banana republic and the chief banana is out to get you (and demonstrates that they can and will on a whim) that is not great hype but a potential death sentence for you as a service provider.
You are one degree away from becoming forever branded as unusable. (Theoretically until people trust that a sane administration is in control again, but that might as well be forever on current AI timelines, given how much cashflow you need just to keep going)
It pretty much is. Claude is more of a meme than a tool. It's been second best (and more expensive option) for most of the time but people somehow keep talking about it. I'm getting strong Apple vibes from this one.
Google and OpenAI will eventually catch up and be banned as well. Therefore, this ban isn't really a huge concern for Anthropic since their competitors will be banned eventually.
All this does is proving to investors that Anthropic is indeed ahead of its competitors.
" We have reviewed a report that we believe is the basis of the government's directive and validated that the level of capability displayed there is widely available from other models (including OpenAI’s GPT-5.5)"
The administration just doesn't like anthropic. OpenAI is in bed with the trump Administration.
What they know right now is that the model is so advanced the US government has to ban it, and the model comes out of Anthropic. Not Google. Not OpenAI.
Paying for 365 days of service but getting 364 would normally get you a full refund, not just a 1 day credit according to visa/MasterCard rules.
Nah, SpaceX just IPO'd.
"Mythos capabilities" is not some magic threshold. This is exactly the type of language that people used about GPT-4 in 2023. Today, I can run models far stronger than GPT-4 on my laptop at speeds better than GPT-4 offered.
Anthropic are quite good at coining sticky phrases like "Mythos-class models", but these are manipulative attempts to shape the discourse for business purposes and should be identified as such.
They used this type of language with GPT-2. Le sigh, yawn.
Yes, some of it looks silly now, though it's always easy to criticize with hindsight: the models could do unexpectedly impressive things and we didn't fully know the limit yet, it was a black box.
Remember you're critcising the org that actually made it public to people earlier than any other: the uncertainty was a temporary caution. The "open" in OpenAI was because they made it available, unlike Google at the time.
When the company that enables this, makes the predictions in the first place, that is a self-fulfilling prophecy.
Ever since the original GPT-2 "it's too powerful to release!" I've realized that whatever is the current state of open models represents what we really have access to.
It's shocking to me how many people on HN, who engage in long conversations about LLMs and AI, have never actually run a model on their own hardware.
All you need is a reasonably good macbook pro/studio or an RTX [3-5]090 and you can run useful models in the >= 30 tokens/second range (much higher if you choose the GPU path). The difference between what you can run on this hardware and what you can run on hardware that costs 2-5x is not that big. Don't be fooled by people on Twitter/X claiming you need some outrageous setup.
It's also increasingly clear that frontier models are nowhere near close to pushing the limits of efficiency. Quantization, MoE, and other techniques have dramatically improved even in the last year.
For work, of course use OpenAI/Anthropic models, but for anything personal, anyone who considers themselves a "real engineer" should be running local models, using open harnesses and seeing what they can accomplish with these.
Even if open releases slow down or even stop, we have the foundation, right now, for smart engineers to squeeze something quite useful out of. Hopefully we'll one day figure out how to train large models in a federated way. But either way: not your weights, not your inference.
but i also dont doubt in a few years time models with those benchmarks will be able to be run locally
still many many breakthroughs to be had
Yesterday Fable 5 finally solved a non trivial problem I had (after working on it for a few hours), and I went to bed excited. Waking up to find that Fable 5 is not available anymore, I guess I should feel happy I have the code it produced yesterday. But frankly I feel like a child having their candy taken from them.
We need open available models as smart as the current US proprietary ones. If intelligence like that becomes common property, i forsee a better future for human kind!
The US government found a jailbreak that allowed the user to make Fable do bad things, this is so dangerous that this model must be held back in areas that are not the US...
If this is so dangerous why allow US nationals access to it? Are there no evil people in the US?
Going back to my perspective: let's say I control a big enterprise or a government body, how should I view this or US technology? Should I be like: yes, let's use US tech, they are a reliable partner and would never abruptly cut us off! Or should I be like: there are competent alternatives out there and if your work hinges on wether or not you had access to Fable 5, then your business is probably not going to survive for long.
I don't think it makes sense the assume the US considers any country its ally.
The full quote makes it clear Kissinger was saying, in the context of the Vietnam War, that the US should come to the aid of their friends:
> "Word should be gotten to Nixon that if Thieu meets the same fate as Diem, the word will go out to the nations of the world that it may be dangerous to be America's enemy, but to be America's friend is fatal."
It shares that intelligence with the countries it was gathered from. It's been explicitly stated many times that this is an intentional work around for weak constitutional provisions on protection of citizens.
I spy on your guys. You spy on mine. We all share notes.
This isn't a "The US vs. The World" this is "The Wealthy vs. The Poor."
I'm glad Europe is finally waking up to this reality.
But in general I agree, the other parts got a big wakeup call.
I’m Scandinavian. The US is an adversary; please wake up.
How come the EU is making a "digital sovereignty" push? Why are only EU people allowed to compete for EU services? Are there no evil people in the EU?
Training best models is hella expensive. Anthropic spent fortune to train it, and it definitely plans to make a fortune with it either. This US decision, if not reversed, would cause Anthropic potentially tens of billions of dollars of revenue loss. When company heads to IPO, and burning cash faster than it generates it, such moment can change their entire trajectory, plans for the future, hiring, new models development, etc.
Alright, one might say that “US will fund it directly and LLMs will move from free market to controlled and funded by government assets”.
But even then. Training new models is expensive not only in terms of computer, and not only in terms of utilities, data centers, etc. But in terms of talent either. It is hard to retain top talents with you when they should just train special models for government. I am not sure we are in 1945 again and that top tier AI researches will agree to sit in silo and work for models which only privileged selected organizations might use. Whenever government steps into control, freedom and creativity is affected.
P.S. Where I agree, though, is that we witness the start of government censorship of AI models. Imagine soon Anthropic open back access to Fable. Can we know what they put inside and which capability limitations, derived based on ID/IP, they enforce? No, we can’t. Here I agree that at least the era of government censorship begins.
But it is! How many times have OpenAI and Anthropic threatened the rest of humanity with extinction at the hands of their LLMs? Monthly I think.
They were hoping for a government supported monopoly. Careful what you wish for.
To the tech-libertarian crowd on HN this is the definition of evil. To everyone else it's responsible behavior and common sense.
Yes, from chatgpt 0.1 onwards. Everything has been a dangerous frontier model and has threatened humanity with extinction before.
Banning foreign nationals from using your technology only makes sense if you don't rely on foreign nationals to build it in the first place.
Or are we so far along now we think we don't need them anymore.
I'm wondering if they might go for a restricted access model that goes beyond passport or citizenship, where people can still use it, but you have to be individually vetted, and put on a list to get security clearance.
And also, even though Anthropic may not have labs themselves directly, there is a funnel of research that comes in the form of papers and conference tracks.
The AI community is pretty tight knit, and not having access to frontier models affects everyone.
That's separate from the question of whether Fable 5 and Mythos 5 are unsafe. I don't really know. Here's a few things that seem real, though: These models probably have some level of capability to assist with bioterrorism, Anthropic has self-admitted that their own safety measures are imperfect [1], so it should come as no surprise that jailbreaks seem far more possible than Anthropic is leading you to believe in this blog post [2].
[1] https://www.anthropic.com/news/fable-mythos-access: "We suspect that perfect jailbreak resistance is not currently possible for any model provider."
[2] https://x.com/elder_plinius/status/2064776322979676227
If Amazon sold a book that taught someone how to commit bioterrorism, would there be action against them to stop selling it? Its an imperfect analogy, but the parallels are there. LLMs don't get a free pass because they're also so good at writing typescript for beige CRUD apps and bedtime stories.
One thing I hope we align on: Synthetic safeguards (steering, rejections, etc) on top of models to block illegal/sensitive topics isn't good enough. Anthropic has self-admitted that it isn't good enough. We need the technology to lobotomize these capabilities the public deems too unsafe to allow out of the models at the most fundamental level. And, we need to align on what the scope of these forbidden fruit topics are. This is, actually, the only way open source continues to thrive. I want open source models to thrive, but they won't be allowed to thrive, nor should we want them to thrive, if they're teaching people how to engineer novel viruses and other horrible stuff.
Plenty of useful things get free passes to be dangerous. Traffic accidents are the leading cause of accidental deaths in 11 states, but we don't ban cars because they're dangerous. There's plenty of safety features, but we acknowledge and accept that people will die. People like to pretend that they won't sacrifice safety for convenience, but they continue to do it time and time again.
We need legislation that empowers a Federal agency adjacent to the CDC or FTC with the power to enumerate specific capabilities models could exhibit that we deem dangerous, and require model manufacturers to guarantee that their models cannot exhibit these capabilities. The reality is, zero of the safeguard systems invented by the frontier labs today are sophisticated enough to do this. The labs are extremely, extremely bad at Safety, relative to both how much impact their products are having on the world and how good other industries like Medicine and Manufacturing have gotten at safety. I'm talking a total and complete culture shift.
Google had a "appoint Eric Schmidt as CEO" moment like this. There were kids running it before; make them rich, and retire them to an island somewhere, because the impact your products are having on the world cannot possibly be allowed to continue with your current leadership at the helm. Dario is that problem now. I think Sam/OpenAI can adapt and mature. I have zero faith that Dario and the furry EA cult in Anthropic leadership will. So, this is what being unable to mature looks like; the public will make your company safe, one way or another.
I don't necessarily disagree, but who is going to pay for improvements in models if they're not commercially available? Are AI companies going to become defence contractors with (US?) government paying for the training?
I don't know much it cost to go from AI model Foo 4 to Foo 5, but it's going to cost a pretty penny to eventually go to 6 and 7. These companies are doing so in the expectation of eventually charging customers to recoup the costs: if the only customer(s) is government(s) then the per-unit cost will be much higher. An analogy: you can get a 'civilian' toilet for USD 200, an FAA/EASA-approved toilet for airlines 2000, and a MILSPEC/NASA toilet for 20000.
Now this limited-customer model is certainly doable—tank manufacturers have a smaller base of customers than pickup truck manufacturers—but AI companies probably want regulatory clarity on what they're allowed to sell, and to whom, before they start developing new products/services.
I think there's a world where the US funds development of the next model in exchange for exclusivity, at which point they could "release" the previous version from that exclusivity.
The "Mythos capabilities" is a pretty arbitrary threshold. It could have happened any point before or after. China treats this technology in very different ways. And China's economy and foreign policies are very different, in such export-based economy export controls make much no sense. Unless something changes drastically in the global relations. I expect the opposite to happen, if the US leave a void globally, china will be even more incentivised to fill it.
This been said, it is likely that big models are soon no longer "open sourced" (qwen has already started), but because the chinese companies will prefer to sell access themselves, not because of government intervention.
In any case, what happens now is a huge thing, and not sure we grasp its consequences yet. It is definitely beyond any ai safety marketing stunt. Even if this is rolled back at some point, after "guarantees" are established, it has already moved things to change drastically.
That’s also the difference between being totalitarian government and laws and regulations based order.
Also foreign investments into Chinese AI labs have already been forbidden and asked to exit
They probably have gotten their PR or in the process, but naturalization requires five years after that, so there must be some still not citizen yet.
The fact that Anthropic are willing to dumb-down their own model responses to "Prevent foreign competitors from using the model to accelerate R & D and protect our leading position." [1] adds credence to this speculation. Anthropic are scared of their own model's power in the hands of competitors: it has nothing to do with security.
I remember something with either ChatGPT or Claude, way early on, where I had to upload my passport to use some level of it (maybe it was the OpenAI API).
Anyway, there's no way they just shut this completely down, the revenue from mythos is huge. So if they can't get the government to budge they'll find a way to be compliant without completely shutting down.
But I think my larger points stands: even if we do see Fable access again, this is the beginning of government restriction of LLMs and we are going to see more and more of it. In fact, I would be very surprised if we ever see an open weight model with Mythos capabilities. Chinese labs have been consistently releasing open models 6-12 months behind the frontier. In 6 months we may see them go dark.
Similarly, in the US I think we can expect more and more government restrictions on the strongest LLMs, in ways that may go beyond flimsy checks like uploading a valid US passport. It may not happen this year but I think it will happen eventually.
It still surprises me sometimes that LLMs are just available for _anyone_ to use. Isn't it odd that it turned out this way? When I grew up reading sci-fi I thought AI, if I ever saw it in my lifetime, would be something locked up behind the walls of big corporations and governments. But instead we have all been able to use it for an infinity of banal purposes for $100 a month. This is a strange situation but we have got used to it. But it may not continue that way.
Who knows, maybe the good stuff is locked up. If one of these corporations had something very special they may very well find it more profitable to enjoy the competitive advantage of using it for themselves than marketing it.
I assume it's some of their best training data.
We already have the rails for automated customer identification from US banking.
I think there is a larger "AGI" category error with all this too that is akin to the old futurist idea of driving nuclear powered cars in the "future". The Ford Nucleon.
Nuclear power comes to us in a mix of electricity from the utility company but is far too dangerous for an individual to posses nuclear material for personal nuclear reactors.
An electric car does run on nuclear energy in some sense but not the way the Ford Nucleon was envisioned.
The error of the AI bubble is that we are pricing these companies with SaaS multiples when they are eventually going to be public utilities. There is really no other way to handle the dual use nature of anything close to "AGI".
No, Dario said himself AI is national state weapon, then the government will not cease control.
What would happen is that we will have a more lobotomized and even more neurotic safeguards put in place in order to comply, and your data will be boardly sharing with the government.
Moving forward, above certain parameter size of model, it will require your self-identification in order to be used.
And before either 'aisle' piles on - I'm pretty sure the concern is bipartisan.
>> As part of our measures to keep our services safe and secure we may ask you to verify your age or identity, and we've described what we collect and how
The problem is that there are network effects at play - the more people you have using your models, the more training and fine-tuning data you're accumulating, so the faster you can develop the next frontier model. Not to mention the fact that more users means more revenue to fund your next-gen model training.
Perhaps the US administration is gambling that US citizens on their own provide enough of a training data and revenue flywheel for them to keep their AI development edge.
The next interesting question will be - will the US share this capability with her traditional strategic allies (e.g. five-eyes countries), or is it truly America First (or, 'America Alone')?
There is no way to enforce access of one and not the other, not with the state of tech in the US (and most countries without a great firewall). Bypassing such controls is as easy as a pilfered credit card (or some other american-looking payment method) and a vpn - both trivial to come by.
But that in itself is telling in a way: if national security was a true concern, access should be limited to people who passed background checks.
I guess I'm possibly giving them too much credit, but if the people who sent the letter have their head screwed on straight, "protecting national security by disallowing specifically non-citizens from using it" can really only be read as a smokescreen, or at very best a small part of the actual picture.
I’ve wondered this but then wouldn’t a large amount of input now just be AI output from a previous PR/client email/spec document/chat. Training of that would be an issue leading to distillation?
I think this is bang on. The motives are kind of irrelevant, because now that the precedent has been set, I suspect they'll be much more likely to go here for future restrictions. It's very convenient (even if true) to just say "security reasons".
This logic is flawed. China had no incentive to release SOTA models to the world in the first place when OpenAI were milking everyone with closed source paid models. What changes now? Nothing. In fact, this is even more incentive for them to capture marketshare and dependence on Chinese models as the world will simply just use alternatives. Not bow down to restrictions. If your logic were correct that people would just comply, then the tons of VPN services wouldn't have a market in the first place.
No way they will pass on this one.
That being said, they could still keep some other model from public while doing a PR stunt like this to eat their cake and have it too.
Commercialization - not open source - is the name of the game now in Chinese AI [2].
[0] - https://www.ft.com/content/b39da303-3188-447b-8b65-3dd8dad8b...
[1] - https://www.digitimes.com/news/a20260609VL215/alibaba-ceo-ai...
I personally see it as a net good if companies fearmongering for marketing purposes then have to face consquences from people taking their marketing at face value.
Hopefully it teaches them and others not to do it anymore.
So I doubt this action alone is enough to really stop other nations from getting access to state of the art AI. I think the US would have to go much further to really stop other nations from getting access to state of the art AI.
But even if their networks are secure, I think that spies who are willing to coerce people, trick people and go in person to data centers or offices could find a way to get those models and other things.
Not at all beyond the capabilities of any of the top ~9 or so best State actors.
Edit: To answer your question, very easily on the 20TB.
One crude method with a simple device in particular works well if you just clone the monitor data and then use HDMI and pass through. Then just cat dir in encrypted chunks to something like a USB key connected to the passthrough. 4TB USB keys are out there. A week of that gets you 20TB.
What is likely is that 'understanding of techniques' could be leaked.
Often, it's just well enough to know 'the approach' being used.
Thus they are far less likely to do something like this.
That would also significantly dampen the commercial incentives to develop such strong models, given the high costs involved.
On the other hand, such a future would probably save white-collar jobs.
based on Anthropic's own self promotion. no reason to think that Chinese models are not just as good or better. the key thing here is training on machine code and dis-assembled binaries and the Chinese have a complete data set of pirated software, with no limitations on how they use it. I seriously doubt they are actually behind.
> only if you're not a US citizen, but in practice, even if you are
the issue here is that Anthropic needs a legal opinion that their mechanisms for detecting foreign users in the US are compliant, which is technically hard to do, and a complex intersection of technical details and national security law, so getting a legal opinion can't happen overnight. it will be back.
This statement is utter nonsense. And if you think about it, it's in exactly the same spirit as calling for a wide ban on science books or education.
Despite the damage it’s better to build up an immunity through ongoing exposure, unless you want to end up like the previous American civilisations.
Honestly, and I don't say it lightly, long term this may have bigger impact on humanity as a whole than Iran war and its varying outcomes ( and consequences ). Separately, note how much this news was not really reported much today. Granted, a lot was happening, but it is telling.
It doesn't have to be free, we have the means to transact in a p2p fashion electronically as well.
I can't agree more. This is a precedent not just in denial but possible vagueness. Judiciaries have 'vagueness doctrines' to counter such laws/directives but _these_ may be re-trumped by the deference given to national security.
If we don't get soon a framework by which models may be measured as 'too powerful' vs 'not too powerful' we supercharge the self-dealing (corruption) that this administration has brazenly adopted. Many fingers can be put on many scales; groks may be given a pass while others are held to higher "standards".
Will OpenAI now just asymptotically bump its versions to 5.99999999 to stay under a limit that nobody really understands?
I realize that this has all just happened and we might get some good rigorous clarification from our government.... sigh. We are living in a kakistocracy. Who am I kidding?
China has flaws, plenty of them, but there's no real evidence to believe their motivations or mechanisms of pursuing motivations are that similar to that of the United States.
That part is up to Anthropic. KYC[0] is not exotic, it's just a pain in the butt: if Fable is that good, they can do the KYC.
I don't think this is the right move from the government, but we shouldn't pretend that "citizens only" is an insurmountable hurdle for a company that just got a $65B capital infusion.
It truly is a pain in the butt. But if access to (US banking | Fable) is worth it, you do the annoying work, and the customers accept the annoying limitations.
Anthropic gets into argument with US government over model usage -> Release a model calling it too advanced for safe use -> release the model to public knowing well that this admin has thinnest of skins and will do something
Regulatory capture in roundabout way. Now it is going to take crying wolf over other companies/countries developing “Mythos grade model” to kick off action especially in next two years of this admin.
Companies will keep improving models because AI is not yet fully there. But it is incredibly naive to think governments were ever going to allow state of the art technology to be released to public or do things this publicly. Every company wants to show off and get publicly restricted because it shows off their strength.
I can only say well played Anthropic.
That's a bold prediction considering that's true today...
Government always restricted data, tools, technology. In France for instance you're not allowed to have a gun, but policemen have.
What's the difference ?
Imo china, and deepseek will keep its open source model because they invest in long term. At some point they could do something similar, but not now.
USA government is just hurting AI development in their country, and that's good news to me.
In my experience it’s not, the only difference I noticed between it and Opus was its taking much more time to respond.
If you think there is ever going to be an open source Deepseek "AGI" model I just don't think that is thinking things through.
It is the main error of the AI bubble. At some level of intelligence, the dual use nature of a model is too dangerous for a purely hands off approach.
It is like thinking at the advent of the automobile that you will be able to drive your car at any speed, without a license , any place you want.
It is inevitable and the huge sums of money being burned to build these future highly regulated public utilities probably aren't going to be happy with the returns they get from funding a highly regulated public utility.
I think this also misses the point. The precedent here almost surely implies that it will be illegal to use these frontier models as well.
I can see a future where weights are distributed on the darkweb or bittorrent, or people are trying to use small fly by night hosts of models.
But if this says these models are dangerous and the companies and people can't be trusted with them, then I don't see why that wouldn't also apply to open weight models.
But who do you trust more to make these decisions? A democratically elected government or a private company?
The government is full of stupidity and this is indeed a big moment, but Anthropic has been begging for this outcome in their public messaging. If their fear-mongering was genuine, then great, they got their pause. If not, then what exactly did they want to happen?
I mean, maybe in principle, but if the object is just hobbling Anthropic you might still get OpenAI's latest model without that much trouble.
A pathogen with a very long incubation time and a high fatality rate would be about as bad as nuclear war. Maybe we need to figure out how to possibly defend against one person doing this before making it easy for anyone to do it.
and OSS models gave way to running it with freedom and security.
So OSS models have always tried to catch up to the frontier and lag behind 3-6 months. For my use cases, I am happy with current OSS models especially so if you let frontier-ish models design the plan with your input
If I were to suppose that China created a frontier model so good and far ahead, then I can understand if they don't open-source it. Qwen does it already with their Max models being closed-source.
but if you are suggesting that China in whole will remove itself from AI race, then 3 (or 4) possibilities can occur.
1. Some chinese companies might stop the production of OSS models if their names are known (z.ai etc.) but there are multiple other companies who are fighting with their research labs as well. They might create a decent model and OSS it to get known within world and China.
2. The whole Chinese economy (well similar to America, but to an even more extreme level from my understanding) depends on AI and is a bet on AI. They are funneling state and all bank money into these companies. From point 1, they wouldn't wish to be silent with frontier models and then lag behind and wait for other countries to catch up (point 3)
3. Europe(MistralAI)/India(SarvanAI? Kinda recent) will jump on the opportunity. (My point is that these two regions are trying to create their own models. How much they lack from the frontier is another thing but if China were to remove itself from the race, then they will have much more time to figure out how to make better models)
My point is that america and china are in arms race of closed source vs open source models. If china were to close source its models, they might simply lag behind and other countries will catch up.
4. Either that or you are right and we will have the current frontier OSS models and some more. IMO they are reasonably good as well and I used to wonder what would happen if say it would have been net good if AI was stuck at a similar level to sonnet 4.5 (IMO it was sweet spot), so I don't think that I am reasonably worried about it all. If absolutely need be, you can have an frontier model direct a plan and have OSS models do the grunt work.
That sounds so great.
> Will we be the poorer for that, or will we be safer?
We will be not just safer but richer. These LLMs are like drugs that should absolutely not be cast freely into the highways and byways. My main worry is that this action will be a haphazard one-off and not part of a coherent plan of curtailing LLM propagation.
I’ve been trying to track LLM code generation adoption in the critical infrastructure world - as far as I can tell, it’s nill. Zero. Nada. Nobody is relying on these models to write secure code for anything where failure is catastrophic. Planes falling out of the sky. Nuclear reactors going into meltdown. Electrical grids loosing synchronicity. Lots of these BS claims from the marketing and investment crowd, but - it’s just a useful tool for non-critical areas. That’s all it is.
It can be both the most powerful LLM on the market, and have no adoption in critical infrastructure.
obviously you need to review it
Okay. Let's say I agreed with you.
If you look at all technology and break down the total market for Critical Workloads vs non-critical workloads, what do you think that works out too, percentage wise? 12% critical? 18%? What if it was 30%! That would still mean 70% of the world's software could possibly be handled by an LLM. If that happens, the 30% of the Critical Workloads stuff is gonna get very, very competitive.
The whole idea is a lie and a marketing stunt to prop up the US stock market.
At first people said this was tin-foil hat territory. But ANYONE who publicly pisses this guy off, mysteriously get a government takedown weeks later. They're not even pretending anymore.
When Trump attacked them before because he wanted anthropic to decide who lives and dies, they said no. (That's probably a lie, I'm sure it has to do with money - but I digress).
So exec ban happened. Problem is - everyone uses claude. Microsoft is going through the same thing now. They find a way to use it anyway.
So they reverted it. What better way to go around circumvention than to just outright ban it.
Funniest thing - his own law makers are the ones who run on freedom and a nanny state. They are literally preventing us from using a tech "for our own safety" - can't get more nanny than that.
But it is a computer program and it won’t be long before the dam breaks to open source and anyone can use Mythos level AI at home for anything.
There is no stopping that unless you would set up a Police state more strict than China
It may be a de facto weapon and for better or worse everyone will be able to use it sooner or later.
I predict it will give birth to many great things and many equally terrible. Milions of people will die and milions will be saved. Such is nature of humanity. The good always comes with the bad. That was true for every invention.
Cars are used to kill by terrorists, rockets are used to bomb kids but also to go into space. It’s all same, old story.
You can’t stop LLMs the way you cannot control fire. Everyone can pour gasoline in the forest and cause terrible damage. And any excuse that we shouldn’t have matches must be viewed as what it is - an authoritarian, futile, desire for control
It would be too naive to suppose that the strongest LLMs are available to plebs now.
Those of us who are negative about AI for political reasons have been saying from the start that the biggest problem with AI is power. People can’t now all of a sudden be thinking that huh nation states have power (along with Big Tech and the rest of the power brokers).
But this is in fact quite a tortured fear, all wrapped up in the usual hyping—though this part is expected of LLM Euphorics. The usual story of simply making human labor less valuable and concentrating hardware for compute is just, you know, this rotten economic system working as it is intended. No need for weapons, subterfuge, three-letter agencies, much more straightforward, and just a natural evolution of X-CLASS CAPABILITIES.
I would be surprised if the public ever had anything close to the strongest LLM. It’s not like nuclear bombs were created by the private sector, then the government started the Manhattan Project and seized them all for itself.
They probably had Fable-quality models in 2016.
First I want to see them play video games at a high skill level, preferably without any access to game state beyond the same visual output that humans have access to, like a raster frame X number of times per second. One LLM model played Factorio, albeit at a very, very poor level, which can be seen if you slow the video to 0.25 playback speed and pause frequently.
https://old.reddit.com/r/factorio/comments/1u1blr6/claude_fa...
There have been streams of other games, where LLMs and AIs have likewise performed very poorly.
I recognize that LLMs might be better at language processing than these sorts of tasks. But being able to play video games is part of general capability. And this kind of hardcore video game playing, with no access to game state, is also a general task where feigning skill can be harder. If LLMs excel at pretending to be competent without actually being competent, like this AI training approach is arguably about
https://en.wikipedia.org/wiki/Generative_adversarial_network
Then some AIs might be trained and designed for deceiving humans instead of actually being competent and capable. And thus, one response is that they should be met with more difficult tests.
Basically, make tests that AIs or LLMs will not have an easy time cheating. Hopefully, that will engender research in greater LLM/AI competence, not in greater ability to cheat or deceive, neither for LLM/AI researchers and companies, nor for LLMs/AIs themselves.