upvote
> very specific edge cases

Mathematics is hardly an edge case, but SOTA models differ wildly in their ability to write proofs for unsolved problems.

Models also differ wildly in tasks like decompilation for reverse engineering.

Also, so far, the only model I've found which can competently write PTX for SM100 CUDA devices is GPT-5.4pro, but I'm willing to admit that this is more of an edge case than the aforementioned.

AFAICT, the extent to which someone finds models interchangeable is inversely proportional to the novelty of their work.

reply
Saw a comment here yesterday referencing the Attention Is All You Need paper title in a tongue in cheek way. Kinda fun to imagine the friend/romance angle is just a bunch of socially awkward folk at OpenAI misinterpreting the original paper
reply