upvote
Another commenter above this proposed a pretty compelling theory for the source of this style: SEO-inflated prose online. If the models were trained on the internet, "higher quality" content needed to be indicated to them during RL somehow. Search engine ranking is an easy-to-obtain metric that's kind of like "quality" if you squint, turn around, and lobotomize yourself. So the AIs have a high likelihood of producing the kinds of content that is rewarded by Google SEO.
reply
That's circular though. Why does that content get ranked highly? Because it gets a lot of backlinks, long clicks, etc. So people seem to like it.
reply
> Why does that content get ranked highly?

Search engines only show a snippet of the content and that always looks convincing. It's the whole content that is off and, unfortunately, a few seconds/minutes can pass before you realize it (If you ever do).

reply
Well, and Google's proxy read of "quality" might have flawed assumptions. A concise page where you get what you need and leave quickly might read as "high bounce rate".
reply
Bingo but i also think it is just the nature of the technology. It is going to be wordy but not usefully so.
reply
Another hint is when the structure and formality of the response doesn’t match the medium. Like when someone sends you a whole article back in DMs along with headings for the sections.

Even though real humans write like that when writing documents, they never did that in informal messaging.

reply