upvote
That assumes scaling laws still hold up. A bigger model might end up only incrementally more intelligent.
reply
They do. Mythos kicked ass while it lasted. And what we know of the scaling law curves promises us even more gains in the future.

"The future" being "whenever training and inference at increased scale becomes economical". Which is probably bounded by new generations of hardware, but might also be pushed forward by algorithmic advances.

reply
I think they're out of training data though...
reply
Synthetics are often used for "data amplification" nowadays. Extra compute covers a multitude of sins.
reply
Not only you could: you would also want to.

The likes of Mythos show that the scaling laws are real, and you can x5/x2 the total/active params and get meaningful gains. If "inference per param" gets cheaper? Up the params and get more intelligence for the same price.

reply
Quite true
reply