undefined

https://naokishibuya.github.io/blog/2022-12-30-gpt-2-2019/

[-]

Mythos is from the same guy who did "GPT-2 is too dangerous to release"

by oceansky5 days ago|

[-]

He was kinda right.

Lawyers, doctors, students, teachers. Lots of people using GPT models carelessly in harmful ways.

by alasano5 days ago|

[-]

Obviously not what he meant at the time but hilarious(ly sad) in retrospect.

by dmix5 days ago|

[-]

Delaying a technology release is not going to stop that in the long term. Society, culture, and the support tooling just needs to adapt. Just like how AI coding is still in the early days.

The sooner people learn the risks and build the infrastructure to make it fail less the better.

by uselessTA5 days ago|

[-]

The claim I remember was that releasing it would start an arms race for AGI, which was absolutely true

by notnullorvoid5 days ago|

[-]

If it was truely an arm's race to AGI they would've stopped relying on the data/param scaling law BS ages ago.

by killerstorm5 days ago|

[-]

"Malicious use" means spam, propaganda bots, etc. It's nice to give people who work on spam filters some heads-up.

by supern0va5 days ago|

[-]

It's clear that the parent didn't bother to read the link they shared, which articulates exactly this. That's embarrassing.

From the link:

> They summarized their findings from the nine months:

> 1. Humans find GPT-2 outputs convincing.

> 2. GPT-2 can be fine-tuned for misuse.

> 3. Detection is challenging (detection rates of ~95% for detecting 1.5B GPT-2-generated text by RoBERTa).

> We’ve seen no strong evidence of misuse so far.

> We need standards for studying bias.

> All these points are valid, and OpenAI did a great job identifying potential risks, especially misuse and biases, at an early stage.

by riknos3145 days ago|

[1] https://en.wikipedia.org/wiki/Dario_Amodei

[-]

> All these points are valid, and OpenAI did a great job identifying potential risks, especially misuse and biases, at an early stage.

Many of the OpenAI employees who were focused on these risks in GPT-2 later founded Anthropic, notably Dario [1]. Since the beginning and continuing through today Anthropic describes itself as an "AI safety and research company" [2]

I'm not sure if the OpenAI of today has the same focus on safety, or if they do the minimum to not look irresponsible given Anthropic's effort.

[2] https://www.anthropic.com/company

by supern0va5 days ago|

[-]

Just to be clear: that is quoted text from the source and not a statement I'm making, in case that's what you're suggesting here.

by InsideOutSanta5 days ago|

[-]

People quote the "GPT-2 is too dangerous to release" thing as if it were wrong, but given all the slop all over social media and how it's used to create division and attack social cohesion, he was clearly right.

by 1attice5 days ago|

[-]

History is long and never over, so he could easily be right both times before this is through.

by viking1235 days ago|

[-]

That guy is the biggest clown lmao

by Flere-Imsaho5 days ago|

https://arstechnica.com/ai/2026/04/uk-govs-mythos-ai-tests-h...

[-]

The UK gov disagrees with you:

https://www.aisi.gov.uk/blog/our-evaluation-of-claude-mythos...

by ainch5 days ago|

https://www.aisi.gov.uk/blog/our-evaluation-of-openais-gpt-5...

[-]

AISI did also say that GPT-5.5, which has been public for months, scores basically the same as Mythos on their cybersec evaluation. But there wasn't as much media about about that for some reason.

by saddlerustle5 days ago|

[-]

AISI found the release version of mythos preview outperformed GPT-5.5 https://x.com/AISecurityInst/status/2054589763173126339

by cxvrfr5 days ago|

[-]

Government of the least mismanaged country in the world?

by mhh__5 days ago|

[-]

AISI is basically the crown jewel of the British government at this point in that its actually pretty good.

by HDBaseT5 days ago|

[-]

You mean the most mismanaged country?

by geerlingguy5 days ago|

[-]

Bingo.

"We had to do extra work to make this safe because it's so advanced and dangerous..." how many times can they trot out that line before it loses its effect entirely?

by copperx5 days ago|

[-]

Only three times, if fables are right.

by TaupeRanger5 days ago|

[-]

The Startup Who Cried Unsafe, by AIsop

by YumpiLumpus5 days ago|

[-]

[dead]

by OtomotO5 days ago|

[-]

With homo "sapiens" "sapiens"? A few decades at least.

by aesthesia5 days ago|

[-]

I mean, they do actually describe what that extra work was, and people elsewhere in this thread are complaining about the effects of those safeguards. So it's not like this is purely empty rhetoric.

by zem5 days ago|

[-]

people are not questioning whether they did the work, they are questioning whether the work was really necessary (i.e. if mythos is really so good that it needs safeguards to prevent malicious actors from using it)

by bel85 days ago|

[-]

It worked for OpenAI when GPT 3 was deemed too dangerous to be released. This is just a spin of that.

[-]

I still remember it. "Open"AI going API-only because GPT-3 is really really dangerous, so forget the Open in our name and all of that, you can't download our models anymore and must request access to them because they pose a THREAT.

Fast forward to today and GPT-3 has laughable performance.

by shoeb00m5 days ago|

[-]

Even back then there were plenty of people who got fooled by AI generated articles. It's easier to spot AI writing now because we are so used to it. They were right to be concerned; not that it achieved much since oss models run laps around gpt-3 now.

[-]

But it seems like that was not genuine concern, but instead a tactic to pivot to closed models and an API service with an excuse to do so, breaking the public's expectation that they would be a non-profit making open models, like their name implies.

by teaearlgraycold5 days ago|

[-]

I know a security researcher at Google with access to Mythos. He says it's the "real deal" and that "there are career plans I had that are no longer viable".

by zeroonetwothree5 days ago|

[-]

“trust me bro”

by teaearlgraycold5 days ago|

[-]

He could be incredibly naive. We'll all find out with time.

by CSSer5 days ago|

[-]

Yes, and "in collaboration with the U.S. Government" feels like a very gross ploy at appeal to authority. You don't need Mythos or really any SotA frontier model to make malware or do extensive penetration testing/reconnaissance already. Sure, Mythos might be faster/more efficient, but the cat has been out of the bag for awhile. Even the terminology "infrastructure providers" practically screams "Enterprise leads".

by whazor5 days ago|

[-]

I think all models can find vulnerabilities if read the entire code base. Or intelligently combine parts of the codebase. Especially with test loops.

by ls6125 days ago|

[-]

And to ensure that only USG-approved entities are allowed to secure their code.

by toddmorey5 days ago|

[-]

I fear it's a smokescreen to manage cost and capacity.

by mpeg5 days ago|

[-]

It's not even very usable... I tried 2 different chats and both eventually got stopped due to the safeguards

One was a piece of code I gave it to improve, it did so and then started writing tests, some of which tested security so the safeguards triggered

Another was one of the cryptography puzzles I use as new model tests, which are hard to oneshot and there's no public solution anywhere, it completely refused to even try to solve it

by gavinray5 days ago|

[-]

I tried 2 chats and it declined both.

- 1st chat asked about a minor shoulder injury most likely mechanisms

- 2nd chat asked about optimal bloodwork testing markers

by kranke1555 days ago|

[-]

it seems to dislike biological chats. Rejected me on a chat that I am running with 4.8 as well on a rare condition I have.

by Erem5 days ago|

[-]

So the degradation to Opus 4.8 from the article isn't happening in practice?

by mtkd5 days ago|

[-]

No, you get a AUP violation and have to manually swap the model

(I had same issue, just asked it to check some code that 4.8 had modified earlier in day)

by andai5 days ago|

[-]

Maybe that's only in the chat UI, and not the API?

by mpeg5 days ago|

[-]

It is, it asks you if you want to continue as opus 4.8… but I was trying precisely to evaluate fable

by CSSer5 days ago|

[-]

Oh joy. A model whose safeguards make it prone towards code that make your systems less safe. How brilliant!

by himata41135 days ago|

[-]

They're trained in a model class likely in 2t to 3t range. It's very unlikely that chinese labs have access to gpu systems capable of training models like that, let alone serving them. This requires proprietary room-scale systems which fetch a huge premium over typical 10 slot systems.

I am sure that they can develop their own equivlient version of such clusters in around 1 year though. Distilling fabel 5 will also go a long way.

by logicprog5 days ago|

[-]

DSv4 is nearly in the 2t range, but yes you're generally right

by himata41135 days ago|

[-]

MoE experts were likely trained independently / in a sparse format. Training anything beyond 2t on typical systems would be infuriantingly slow, you could do 4t on nvidias room-scale solution, but for a reasonable training speed / batch size it caps around 3t.

by sosodev5 days ago|

[-]

Do you have any resources to share regarding independent expert training? I was under the impression that it's not feasible.

by himata41135 days ago|

[-]

concept is similar to how it works in inference, instead of performing regressive writes to the entire model you run the whole model, but part of the model can live in system memory and get swapped in/out on demand. So only XB parameters are active in training.

edit: I am not really sure if it works like that. I haven't looked too deep into deepseek v4 pro specifically.

by axpy9065 days ago|

[-]

We’ll see it distilled first.

by OtomotO5 days ago|

[-]

Ah, American Hubris ... I don't blame you, Hollywood is the world's greatest propaganda machinery of all times.

by FergusArgyll5 days ago|

[-]

I think we're about to see a big relative drop-off of open models vs closed. I don't think there'll be an open model that competes with Mythos for ~2 years.

Even OpenAI and Google are struggling to get this kind of performance. If the distillation defenses are any good + chip controls prevent China from training massive models, it's over.

by Daishiman5 days ago|

[-]

I think the Chinese have identified this gap and are working overtime on sovereign inference tech including chips.

by __blockcipher__5 days ago|

[-]

They have, but even with the whole CCP backing you you can't just catch up on the chip war overnight. It's going to take time to get their memory and compute industries where they need to be. Meanwhile, barring an invasion of Taiwan, US will have Rubin class models and then whatever the next tier is, within 3 years.

by 1attice5 days ago|

[-]

'Barring the invasion of Taiwan' might actually be quite a lot to bar in mid 2026.

My hot take is that it's now or never for Xi, and from the specific things he is reported to have said to the US president at their last meeting lead me to think that he at least knows this is his big chance; whether or not it is taken is the part of the forecast that is opaque to me.

by ricardobeat5 days ago|

[-]

Unluckily for you, they started back in 2014, and had a huge incentive to speed up in 2019 when Trump started restricting exports.

by throwwwll4 days ago|

[-]

Nice fomoing.

by sosodev5 days ago|

[-]

I wonder if model distillation will continue to work as well as it has. Given hidden reasoning, the ever expanding number of expected capabilities, a serious compute shortage, the looming possibility of model collapse, and dramatically higher API costs I would guess that it's getting much harder to do.

[-]

You should check out some Chinese forums. There are services selling gateways/proxies for all major models at fraction of the official rates. Likely reselling subscriptions, or some other form of abuse.

I've seen people posting screenshots of billions of tokens consumed where they paid next to nothing.

These same gateways are likely also reselling the data to Chinese labs, because TLS has to terminate at the gateway level.

by sourcecodeplz5 days ago|

[-]

Asian labs generated synthetic datasets from UBS labs but also innovated with technology. Now it is harder to get the thinking traces AND Anthropic is recorded to poison it as well.

Thus Asian labs will have to generate their own data sets, which with the huuuuge usage boom from deepseek, mimo, kimi, etc, they will be able to.

[-]

There's also a reality where China does develop Mythos-level model but stops releasing the weights.

That reality is much scarier.

by kaashif5 days ago|

[-]

That's the reality China already lives in. Their weapon against US companies is commoditizing them, eliminating their moats and their profits by going open weights.

Same thing Meta was doing before they fell behind.

[-]

> Same thing Meta was doing before they fell behind.

Obviously unrelated to the OP, but it's crazy to me how incompetent Meta is at everything new they try to do.

They burned billions of dollars on the most ridiculous project one could ever think of - somehow thinking that VR is the future.

Then they did catch the initial wave of actual future with AI, they were at the forefront of open weight models - and failed at that too.

What is even happening there?

by bonoboTP5 days ago|

[-]

Meta made Pytorch and a lot of vision models back in the day, like Faster RCNN, Mask RCNN, the Detectron framework, and more recently the SAM and DINO series. AI not just LLMs.

by TurdF3rguson5 days ago|

[-]

muse-spark is the next most capable text model after Opus according to LMArena FWIW

by cco5 days ago|

[-]

My experience is that open weight models from China are at least ~12 months behind. In some workloads they may be closer, in others further away.

I also find that the harness and product you wrap around models can often narrow that gap considerably.

Opus 4.6 for example, on a PR-for-PR basis was head and shoulders above GLM 5.1. Perhaps GLM 5.1 was a bit under Sonnet 4.6 at the time. That's roughly a year or so behind.

Much cheaper though! I'm bullish on open weight models, I have no idea where all these curves will top out, can the frontier labs keep the year plus lead? Do open labs get close enough to SOTA that they gain adoption across many tasks and drive down inference prices??? Who knows, not me.

by jstummbillig5 days ago|

[-]

I wonder where the trees are. In this thread nobody appears to actually be talking about the model.

[-]

Yeah, because it's impossible. You can't ask it anything about the thing that it's known for. It will not even answer a sky-high level question about reverse engineering, for example.

In CC, it will probably report you to authorities if you ask it to do a vulnerability scan of your codebase.

by dmantis5 days ago|

[-]

Isn't that a good thing in a way? If everyone has the weapon and defense at the same time, we will fix security holes and live safer lifes instead of having some three letter agencies and military backdoors in everything.

Pandora box is open anyway. It's better now for everyone to have the same power rather than a few national states.

by lebovic5 days ago|

[-]

Not sure this holds, sadly. I spent a few months reporting serious security bugs as model capabilities took off earlier this year, and only ~half were fixed. The unfixed bugs were just as critical as the fixed ones; sometimes they were even two similarly critical bugs at the same company, and only one would be fixed!

On your other point, the government still has systemic leverage and can compel access, so this doesn't remove that risk.

That doesn't mean this is the end of the world, and some balance of power is usually good. But I do think it will still increase the capabilties of rogue actors and their net harm.

by uyzstvqs5 days ago|

[-]

It's more evidence that the future is local. With some time we'll all be running highly capable & efficient open-source models on dedicated NPUs. No censorship, no rate limits, no overpriced subscriptions.

by deaton5 days ago|

[-]

Oh they might try to put in place safeguards, but Qwen has had no problem being abliterated

by m3kw95 days ago|

[-]

3-5 months is a long time and they are pretty useless on arrival because the frontier models are so good, that it's hard to go back even if it's way cheaper. Your work flow is adapted to that level of intelligence for months.

[-]

That doesn't match my experience at all. I can't see myself saying in 6 months that the current model I am using is useless, that makes no sense.

In fact, I did go back to DeepSeek V4 Flash for most of my problems as it is way cheaper and there is no need to use SOTA for absolutely everything.

by m3kw94 days ago|

[-]

i'm sure there are small use cases, but lets just say you would never go back to gpt3.5 to do much except for fun.

by YumpiLumpus5 days ago|

[-]

[dead]

by xdennis5 days ago|

[-]

> every bit as capable and dangerous as current day Mythos except with no safeguards

Not quite. They will definitely have "no criticism of China/communism" safeguards.

by surgical_fire5 days ago|

[-]

And, thankfully, I never needed to have a discussion on Chinese politics with LLM in all the myriad of uses I had for it.

[-]

People can work around those if they are open-weight.

by xyzsparetimexyz5 days ago|

[-]

Trying asking fable is Israel is committing a genocide

by flagged3577335 days ago|

[-]

They aren’t.

by elAhmo5 days ago|

[-]

Oh please let’s stop with the Mythos “it’s dangerous” PR talk.

Its obvious Anthropic used it to hype things up and that’s about it.

by soledades5 days ago|

[-]

> Rationalists are inventing oligopolies from first principles, absolutely incredible things happening in SF.

Based.

by ibejoeb5 days ago|

[-]

I don't think China has any incentive to arm the rest of the world with highly capable models that can be used against them. Undoubtedly they will continue with the arms race, but they will preserve the best stuff for their own use.

by james2doyle5 days ago|

[-]

I think the stronger incentive is undermining/undercutting the Western AI companies. Given what we have seen, any model can be used/convinced to do harm so that is just part of the game

by ibejoeb5 days ago|

[-]

I agree, depending on how much of this is marketing and how much is actual capability. It's one thing to undercut models that finish writing assignments for lazy students. If this actually identifies vulns and writes exploits, or if it designs bioweapons, those are pretty different. Those are actual weapons, and I don't think they're going to arm the adversary.

by trollbridge5 days ago|