And yet: LLMs are writing entirely based on human input. Presumably there exists a great quantity of median representative text, some lowest-common denominator, of humans who write similarly to these heuristics.
(In particular: why are LLMs so fond of em-dashes, when I'm not sure I've ever seen them used in the wilds of the internet?)