undefined

points

by feverzsj10 hours ago |

comments

by bel89 hours ago|

[-]

I don't think China would strugle to scrape the internet for fresh data.

And they constantly publish state of the art LLM research (see DS4 context compaction and cache tech).

They have very capable tech giants. So while not being able to distill western models would probably have some impact, it's probably becoming lesser as time passes.

We might even see Western LLMs distilling Chinese models soon. If they aren't already to some extent.

by hnfong2 hours ago|

parent|

[-]

Everyone distills/copies training data.

A couple months ago when Anthropic was complaining about Chinese distillation, people found that Claude self-identified as "DeepSeek" when asked in Chinese:

https://x.com/stevibe/status/2026227392076018101

It's really a fiasco of massive hypocrisy at this point.

by bdcravens5 hours ago|

prev|

[-]

Look at all of the software that has been developed as an alternative (and often an upgrade to) software in the west. (Baidu, Wechat, etc)

Many of the top AI researchers at western companies are from China, and many are returning.

by tristanj9 hours ago|

prev|

[-]

Yes, 100%. GLM 5.2 is capable of RSI. It's too late to stop.

by VortexLain6 hours ago|

prev|

[-]

Depends on a lab, but they do have plenty of compute and engineering. So this would only slow down the progress.

by pjmlp8 hours ago|

prev|

[-]

Of course, it is like any other kind of weapon system, eventually the knowledge gets acquired.

by margorczynski9 hours ago|

prev|

[-]

China has most probably already achieved "escape velocity" on the software side. Now if they achieve parity, to some degree at least, on the hardware side with Nvidia it is very possible they'll overtake the US.

by realusername5 hours ago|

prev|

[-]

It doesn't matter, the only models getting compared are the public ones.

If Anthropic had a super secret model that nobody has access to, I'm not sure why I should care about it since I can't access it.

by surgical_fire9 hours ago|

prev|

[-]

Probably yes.

More than a year ago, when Anthropic and OpenAI started to hide the reasoning bits from the output, a lot of people here on HN predicted that Chinese models days were numbered.

Fast forward to today, and models such as DeepSeek and MiMo are nothing short of excellent. I haven't used GLM or Qwen but heard very good things about them as well.

This "massive distillation" sounds a lot like anxiety about how companies from outside the US can develop very good models themselves.

by VortexLain6 hours ago|

parent|

[-]

In my personal, subjective opinion GLM-5.2 is on par with GPT-5.3