undefined

points

by bjoli13 hours ago |

comments

by throwa35626210 hours ago|

[-]

Happened to me with Claude, doesn't need to be a China thing.

by Shank12 hours ago|

prev|

[-]

Well, it is a Chinese model, maybe it thinks better in Chinese?

by bogdan11 hours ago|

parent|

[-]

Hànzì can use 30%-40% fewer tokens than English. So, yes, it probably thinks better in Chinese.

by hnfong2 hours ago|

parent|

[-]

There was some funny suggestion online with using Classical Chinese (which has a similar status to Latin in Europe, and it uses at least 50% less characters, probably similar savings with tokens) to reason. Don't know whether the reasoning levels were on par with modern languages, but it was worth a laugh.

by Razengan11 hours ago|

parent|

prev|

[-]

If so, would other models like ChatGPT benefit from translating the user's prompt to Chinese/Japanese and thinking in Hanzi/Kanji and then converting the response back to the user's language before displaying it?

by grogg11 hours ago|

parent|

[-]

Yeah, it’s why the Caveman skill includes a Wenyan mode.

https://github.com/JuliusBrussee/caveman

by cocoflunchy11 hours ago|

parent|

prev|

[-]

I believe that most reasoning models actually think in their own "language" which is not really understandable by humans. The thinking traces that are shown in the UI are actually summaries generated by a smaller model in plain english (or user language). Sometimes this leaks through and you see some chinese/japanese characters in e.g. Claude's reasoning.

by ForceBru8 hours ago|

parent|

[-]

Wait, this isn't real, is it? Is there actually an intermediate model that translates DeepSeek's thinking from its "alien language" into human languages? That's not actually the case, right?

I thought "thinking" is literally the model generating additional text in a human language that shows its "thought process". It's added to the model's context, which helps it reason better because it now has this self-generated context.

The "their own language" idea seems to come from some recent science fiction where LLMs develop their alien language and take over the world by 2037 or something.

by mcbuilder7 hours ago|

parent|

[-]

Yeah, it's actually the case. Researchers have shown that the models response doesn't always follow from the reasoning. Whether you consider that an internal language or not really depends on what you're speculating the neural network is doing. I think there was an Antropic paper on it.

by Gracana7 hours ago|

parent|

prev|

[-]

You're right, it's just additional text that allows it to do thinking / reasoning-like behavior. The big proprietary models hide the real output from the user and instead provide a friendly abridged version, but that's just to protect their secret sauce from distillation.

by wolttam6 hours ago|

parent|

prev|

[-]

The parent is off, you’re right. They may reason in any language, typically whatever the user’s language is, and you’ll see the reasoning directly with an open model like Deepseek.

Research only showed that thinking might be disconnected from the final output but in my experience they are very strongly correlated in recent models

by fc417fc8023 hours ago|

parent|

[-]

> Research only showed that thinking might be disconnected from the final output

It is trivial to regularly spot obvious contradictions and inconsistencies if you read carefully. For example I've encountered traces that amounted to "I can deduce X, therefore Y, so that means Z" but then the model turns around and outputs "the answer is W because X". It's even been demonstrated that having the model output placeholder tokens or other gibberish instead of "thoughts" still improves performance. However the thinking traces can still be useful to the end user regardless.

by fc417fc8023 hours ago|

parent|

prev|

[-]

Current models simply generate additional text that gets added to the context for the trace. However iterative models that "think" by repeatedly looping through several layers instead of outputting text have recently been demonstrated.

by dryarzeg10 hours ago|

parent|

prev|

[-]

As far as I'm aware, it's not true for models like DeepSeek or other Chinese open-weight models (at least those that I have seen); their reasoning traces are fully composed from some human language, be it English, Chinese or another one; by the way, most of them can adapt their reasoning based on user language, for example, if user speaks English the reasoning more likely will be in English.

I think that for DeepSeek problem (thinking and replying in Chinese) everything is kinda simpler: in their official chat, they're probably using some kind of system prompt which is (probably) written in Chinese, so that's why model may prefer Chinese in it's output.

by calgoo8 hours ago|

parent|

[-]

I have seen mixed language thinking from claude when i speak to it in english but we are discussing a product thats in spanish or searching amazon spain.

by kgeist10 hours ago|

parent|

prev|

[-]

Summaries by different smaller models are usually made by closed proprietary models like Claude as a way to combat the distillation of real reasoning traces by competitors. Open weight models show the real reasoning traces. Reasoning traces operate in the same space as the non-reasoning output. It's all just one large text for an LLM. Internally, reasoning is just ordinary chat completion between <think></think> tags.

by phi03 hours ago|

parent|

prev|

[-]

This is inaccurate. The displayed reasoning traces are summaries, but the model thinks in nominally regular human languages. AI labs are very light on details (as they consider them as their "edge"), but both GPT5.5 and Claude Mythos/Fable system cards discuss chain-of-thought monitorability quite a bit.

They occasionally show snippets of CoT in papers they write, e.g. for o3/o4/GPT5 models [1] or Claude 3.5 Haiku [2].

[1]: https://openai.com/index/evaluating-chain-of-thought-monitor... [2]: https://transformer-circuits.pub/2025/attribution-graphs/bio...

by seydor11 hours ago|

parent|

prev|

[-]

> summaries generated

Or hallucinated

by 6 hours ago|

parent|

prev|

[-]

deleted

by bogdan11 hours ago|

parent|

prev|

[-]

There are other even more efficient ways of doing this, i.e. using images instead of raw text https://xcancel.com/karpathy/status/1980397031542989305?lang...

by SkyBelow7 hours ago|

parent|

prev|

[-]

But why does it do so inconsistently, and sometimes even forgetting to swap back to English when it comes time to do 'normal' output? It also seems recent, as when I was using deepseek even a week ago this was very rare compared to what I was seeing yesterday. I had to start including a line asking it to stay to English because I can only speak/read English.

by rurban8 hours ago|

parent|

prev|

[-]

A chinese model which tells me it is Claude from Anthropic? Not really. Chinese HW yes, SW not.

by emodendroket6 hours ago|

parent|

[-]

I've seen that people can get Claude and friends to say they're DeepSeek if they ask in Chinese. I think distillation is happening all the time.

by dubcanada8 hours ago|

parent|

prev|

[-]

Google Chrome tells me it's like 14 different things. How is that any different then DeepSeek saying it is Claude?

by Hamuko8 hours ago|

parent|

prev|

[-]

I guess Claude isn’t an American model either considering how Anthropic has fed basically all of the globe into it.

by fdsjgfklsfd6 hours ago|

prev|

[-]

Yeah the reasoning is formatted differently and the replies are often in Chinese.

by whalesalad1 hours ago|

prev|

[-]

that is the long con - eventually we all become chinese.

by serf11 hours ago|

prev|

[-]

This happens to me a lot when I ask a qwen3.6 model to respond to a question in JSON. No clue why.

by surgical_fire12 hours ago|

prev|

[-]

I use DeepSeek daily, never happened to me.

I use the API however, not the chat interface.

by abyssin12 hours ago|

prev|

[-]

It doesn’t seem that recent to me, at least been like that for six months.

by k__10 hours ago|

prev|

[-]

Maybe, you could pipe it through T5 or something.

by RIshabh23512 hours ago|

prev|

[-]

yes, kind of silent update plus they might have better chinese datasets and user data for their training, that might be leading to chinese preference.

by cicko10 hours ago|

prev|

[-]

it's a hint that you should start learning the new Lingua Franca.

by epolanski12 hours ago|

prev|

[-]

It never happened to me with Deepseek, but it happened multiple times with Kimi 2.6.

It also happened a handful of times with Anthropic models.

by alfiedotwtf12 hours ago|

prev|

[-]

Are you running out of context? I’ve found that tooling and giberish most of the time happens when I’m butting up against the high watermark of my context window. One other thing it could be, I’ve read that lower quanta like Q1 and Q2 for smaller models can leak Chinese