undefined

points

[-]

distillation of thinking models is not particularly effective - both "Open"AI and Misanthropic don't show you the real chain of thought, only its severely downscaled version. both do everything in their power to combat such outrageous copyright infringement, so the bulk of unethically scrapped data the Chinese have is from several generations ago.

by Bolwin8 minutes ago|

parent|

[-]

For Claude models at least, you can tell to just manually think in the output and it works fine. I do it reguralrly because for creative writing and summarization, they seem to believe they don't need to think at all, and get way worse results.

by nyrikki20 minutes ago|

parent|

prev|

[-]

It is quite likely that the intermediate tokens don’t have ‘semantic import’[0]

There are methods like Habitual Reasoning Distillation or Inverted Reasoning Traces [1] that can help.

While there are reasons to hide the intermediate tokens from a IP protection stand point, there is also a need to hide more effective and efficient generating that doesn’t fit the R1 claims of an aha moment that has been debunked, but is a consumer expectation.

While hidden intermediate tokens do increase the difficulty, it is not a from barrier in itself, especially as they are billed, given information about their length.

[0] https://arxiv.org/abs/2504.09762v4

[1] https://arxiv.org/abs/2603.07267

by duskdozer7 hours ago|

parent|

prev|

[-]

>such outrageous copyright infringement

Sarcasm, considering the source of their own training data?

by margalabargala4 hours ago|

parent|

[-]

Considering they called the company "Misanthropic", sarcasm is a safe bet.

by orphea7 hours ago|

parent|

prev|

[-]

Narrator: it was sarcasm, indeed.

by baron3dl5 hours ago|

parent|

prev|

[-]

IP for me, not thee.

by overfeed3 hours ago|

parent|

prev|

[-]

FYI: model outputs are not protected by copyright.

by ComputerGuru5 hours ago|

parent|

prev|

[-]

Supposedly there are “jailbreaks” that expose considerably more of the thinking traces.

by 7 hours ago|

parent|

prev|

[-]

deleted

by mannanj5 hours ago|

parent|

prev|

[-]

The companies that did copyright infringement and unethically scrapped data think that copyright infringement and unethically scrapping data is wrong and needs to be stopped.

Though only in particular situations, like when it’s done to them and not when they do it. Cause they have the power and are morally right and know better than you. And if you question this at all, well you’re a threat to American values and a supporter of the Chinese and leading to the break down of Democracy.

This isn’t a type of reasoning argument or manipulation tactic used by the rich throughout history to trick the naive and gullible masses or anything like that. Trust me, I’m rich and I’m morally right. /sarcasm

by maxdo3 hours ago|

prev|

[-]

looking at the score this is rather a gemini 3.5 flash competitor, yes, for cheaper, but distance to opus and fable is as big as their price diff.

by FooBarWidget3 hours ago|

prev|

[-]

With such ridiculously long thinking traces I'm surprised max outperforms high. After all, performance falls off a hill after a certain amount of context, and long thinking traces can fill that up really quickly.