undefined

points

[-]

Likely an improvement on:

> We study a novel language model architecture that is capable of scaling test-time computation by implicitly reasoning in latent space. Our model works by iterating a recurrent block, thereby unrolling to arbitrary depth at test-time. This stands in contrast to mainstream reasoning models that scale up compute by producing more tokens. Unlike approaches based on chain-of-thought, our approach does not require any specialized training data, can work with small context windows, and can capture types of reasoning that are not easily represented in words. We scale a proof-of-concept model to 3.5 billion parameters and 800 billion tokens. We show that the resulting model can improve its performance on reasoning benchmarks, sometimes dramatically, up to a computation load equivalent to 50 billion parameters.

<https://arxiv.org/abs/2502.05171>

by tyre2 hours ago|

prev|

[-]

From the recent New Yorker piece on Sam:

“My vibes don’t match a lot of the traditional A.I.-safety stuff,” Altman said. He insisted that he continued to prioritize these matters, but when pressed for specifics he was vague: “We still will run safety projects, or at least safety-adjacent projects.” When we asked to interview researchers at the company who were working on existential safety—the kinds of issues that could mean, as Altman once put it, “lights-out for all of us”—an OpenAI representative seemed confused. “What do you mean by ‘existential safety’?” he replied. “That’s not, like, a thing.”

by actionfromafar1 hours ago|

parent|

[-]

Amusing! Even if they believe that, they should know the company communicated the opposite earlier.

by levocardia2 hours ago|

prev|

[-]

Oh you mean literally the thing in AI2027 that gets everyone killed? Wonderful.

by notrealyme1232 hours ago|

prev|

[-]

That's sounds really interesting. Do you have some hints where to read more?

by arm322 hours ago|

prev|

[-]

Oh, of course they will /s