Qwen 3.6 shipped with working MTP first, and had working MTP in llama.cpp first.
Ultimately though the real explanation, I think, is Google doesn't care since for their own purposes (in LiteRT-LM), they do bundle them. As far as I know, anyway.
They are more like a single model that has two separate attention head mechanisms.