upvote
> They've published a fair amount about their architecture - enough that I imagine frontier labs could implement.

i think the real ones know this is the tip of the iceberg? hparam tuning, data recipes, data collection, custom kernels, rl/eval infra, all immensely deep topics that would condense multiple decades of phd lifetimes to produce SOTA performance (in both senses of the word) like this.

i would also calibrate what you are impressed by. simply waiting is a posttrain thing - the fact that gemini and oai have not prioritized it is not something you should overindex on as hard. what they showed with full duplex is technically far far harder to achieve

reply
In China it's become well known that promising new companies will get an offer from either Alibaba or Tencent. In the US, it's probably simmilar. Everything that's out in the open can get acquired or simply copied. Maybe that is what Thinking Machines is hoping as well?
reply
they hire leading researchers, and leading researchers won't work for you unless they're able to publish
reply
That was true 10 years ago. It’s most definitely not true now. The arms race is very real.
reply
> leading researchers won't work for you unless they're able to publish

oh, honey.

reply
Do we want the whole humanity to get richer, or few individuals (company owners)?
reply
Which seems bizarre. Companies can’t afford to just give things away right?
reply
Yes they can. Your research papers are not the whole story. It’s like google could open source their entire monorepo and very little would change. No one else could operate it.
reply