upvote
Yeah I don’t think the models are meaningfully differentiated outside of very specific edge cases. I suspect this was the thinking behind OpenAI and Facebook and all trying to lean hard into presenting their chatbots as friends and romantic partners. If they can’t maintain a technical moat they can try to cultivate an emotional one.
reply
> very specific edge cases

Mathematics is hardly an edge case, but SOTA models differ wildly in their ability to write proofs for unsolved problems.

Models also differ wildly in tasks like decompilation for reverse engineering.

Also, so far, the only model I've found which can competently write PTX for SM100 CUDA devices is GPT-5.4pro, but I'm willing to admit that this is more of an edge case than the aforementioned.

AFAICT, the extent to which someone finds models interchangeable is inversely proportional to the novelty of their work.

reply
Saw a comment here yesterday referencing the Attention Is All You Need paper title in a tongue in cheek way. Kinda fun to imagine the friend/romance angle is just a bunch of socially awkward folk at OpenAI misinterpreting the original paper
reply