Training purpose-specific miniature models lets you have a lot of tasks you can run with high confidence on consumer hardware.
Regardless, the people in the 80s capable of pruning programs to fit on small devices is likely happening now. I'd bet most of the Chinese firms are doing it because of the US's silly GPU games among other constraints.