undefined

upvote

points

by mordae27 days ago |

upvote

by jxf27 days ago|

[-]

My understanding is that it's not that the _models_ are banned, but rather the _platform_ is banned. It is acceptable to host, say, `deepseek-r1-distill-qwen-7b` and run it yourself, for example. It is not acceptable (to the authors of these bans) to download the DeepSeek app and run it on your work device.

reply

upvote

by eskibars26 days ago|

[-]

I just left a job for a German B2B software company which sold primarily to large automotive, defense, and aerospace companies. Several of our customers specifically banned anything with the word "DeepSeek" -- hosted or self-hosted.

There's still a lot of naivety on what the difference is between models and platforms, and its easier for a lot of these big companies to just make a blanket statement like "nothing DeepSeek" than for their procurement teams to try to understand and negotiate with each vendor. They don't see the potential benefit over the potential risk of somebody misinterpreting or getting it wrong, so they outright ban it.

Most people that approve or buy software simply also just don't understand how models are being trained or if it's possible/how far a model could go to "introduce backdoors." A backdoor could be, from a business perspective, a model which has been trained to give answers that could hurt western business in a "strict text mode" or produces payloads in a programmatic mode that are intentionally trained to introduce software vulnerabilities.

Anyone can make arguments against these for a variety of reasons (looking at the transparency of both sides and comparing, etc) but for many reasons today and for better or worse, many Chinese models are being banned on big software contracts, which gets back to the title of the article

reply

upvote

by anvuong26 days ago|

[-]

Thing is these models can also be a propaganda machine whether you run it locally or not. This is true no matter the origins. Chinese LLMs will never shit-talk CCP, and it will always give a rosy depiction of the Chinese government. It's perfectly understandable if companies don't want things like that. US/EU models have these problems too, but at least there are some ways to fight that: with a lawsuit or a megaphone on social networks. With Chinese models there is nothing you can do.

reply

upvote

by wouldbecouldbe26 days ago|

[-]

You are sending all your prompts code and files there. So ofcourse its an issue

reply

upvote

by overfeed26 days ago|

[-]

Where's "there" on a self-hosted setup?

reply

upvote

by forgotusername626 days ago|

[-]

We aren't allowed to use any unauthorized models even locally.

reply

upvote

by mariopt26 days ago|

[-]

True, many people don't know GLM 5.1 and Kimi 2.6, really on par with frontier models. There's also Minimax 2.7, DeepSeek 4, Qwen, Xiaomi 2.5 Pro, etc.

China is leading in open source frontier models, so I don't really see how the US wins this one. At some point, companies and people will start running their own models in the cloud and locally, Chinese models will be everywhere.

reply

upvote

by packetlost26 days ago|

[-]

Nah, I model hop constantly as I work with serving GLM and Kimi models and they're not nearly as good as Opus 4.5+ and GPT 5.2+ and it's not particularly close. They're good by standards set a generation or two ago, but they're really not competitive with where the frontier models are at now.

reply

upvote

by zozbot23426 days ago|

[-]

They compete with "mini" or "nano" model classes quite well given the price of inference. You'd need to "model hop" anyway, using Opus for everything is quite wasteful.

reply

upvote

by packetlost26 days ago|

[-]

Now those aren't really "frontier models" now, are they.

reply

upvote

by zozbot23426 days ago|

[-]

They are on the frontier of local models, where the game is often to get the best bang for the buck. You can always scale model size and compute (Mythos, GPT Pro, Gemini DeepThink) to reach better outcomes, but that's not a very interesting strategy.

reply

upvote

by satvikpendem26 days ago|

[-]

> They are on the frontier of local models

That's not what anyone means when they say frontier models, don't change the definition. It's almost as bad as open weight being subsumed by open source when it comes to local models.

reply

upvote

by mariopt26 days ago|

[-]

Guess it really depends on what you use them for. I've been able to built whole apps with them, not slop. Kimi is quite good at design, for 3D, I noticed Gemini 3.1 is excellent for basic to medium use cases.

I've tried both Opus and GPT 5.4, they also hallucinate just like the rest at a much higher cost.

The more you use a model overtime, the better you become with it. It's really hard to measure, my main metric lately has been tokens per second/time to complete task.

At this point I've the feeling frontier models are optimizing for benchmarks and one shot prompts.

reply

upvote

by anvuong26 days ago|

[-]

If you actually use them you'll see that they are far from frontier models. They are much more cost-effective for what they are, but frontier they are not.

reply

upvote

by MetaWhirledPeas26 days ago|

[-]

> They are winning because West is forbidden to use Chinese models for anything work-related.

Because the models hosted in China are not trusted. This is 100% a part of what makes up commercialization.

reply

upvote

by lmm26 days ago|

[-]

Is anyone outside the US trusting anything hosted in today's US? If so, why?

reply

upvote

by coredev_26 days ago|

[-]

I would say that both US and China are using the data we trust upon them for industrial espionage. So don't use their models if you are working defence or other sensitive areas

reply

upvote

by aucisson_masque26 days ago|

[-]

Deepseek is a fraction of the cost of western LLM and still just as good. I say it's also related.

reply

upvote

by pattt26 days ago|

[-]

Do we have any solid evidence these models can outperform Western models in terms of quality? Or is it more: because they are forbidden, they can't get enough training data, visibility etc. to compete?

reply

upvote

by gpt526 days ago|

[-]

Scroll down to the leaderboard - https://arcprize.org/leaderboard

Spoiler alert - they are all towards the bottom of the leaderboard. People come up with a wide variety of excuses for why they are not used despite being offered for significantly lower cost, but the answer is simply because they don't perform well enough for now.

reply

upvote

by aucisson_masque26 days ago|

[-]

There isn't even deepseek V4.

I'd rather trust LLM arena leaderboard, which puts it on par with sonnet.

reply

upvote

by gpt526 days ago|

[-]

LM Arena uses human side by side voting, which limits its applicability to complex tasks.

The ARCPrize leaderboard does have Deepseek V3.2, which only scored 4% on ARC-AGI 2 (while the top models score over 80%). It also Kimi and Qwen, but they also didn't perform well.

reply

upvote

by dyauspitr26 days ago|

[-]

Why? So that even more American IP can pass through Chinese servers? Or because their near frontier models are heavily government subsidized?

reply

upvote

by thinkingtoilet26 days ago|

[-]

>No, they are not. They are winning

You agree they are winning though, right? China is known for not playing fair, stealing industrial secrets, etc... that reputation matters and it's a good reason why the US is winning. Is the US perfect? No. Does the US play fair? No. Spare me the whataboutism in the comments. The bottom line is most people think the US is a safer bet and that's why we're winning. I personally wouldn't trust either government, but if I had to choose, I feel like I at least have a chance at secrecy and due process with the US. Obviously that is being eroded day by day, but you literally have no due process in China.

reply

upvote

by aspenmartin26 days ago|

[-]

You’re saying if we were allowed to use e.g. qwen more broadly the US wouldn’t be in the same strategic position? We have the best models…we own all the companies that make the best infra and the hyper scalers…I don’t think “oh we can use Qwen now?” Would exactly devastate the US

reply

upvote

by visarga26 days ago|

[-]

> I don’t think “oh we can use Qwen now?” Would exactly devastate the US

You'd be surprised how useful it can be to fine tune it in enterprise.

reply

upvote

by aspenmartin26 days ago|

[-]

Well definitely but we have plenty of sanctioned OSS options for that

reply

upvote

by zozbot23426 days ago|

[-]

Qwen's open models are quite small compared to Kimi, GLM and DeepSeek Pro, which are often described as near-SOTA.

reply