upvote
> is so absurd that it will probably genuinely burn down the entire global economy if paid.

Where did you get that idea. Global economy is ~200T/year PPP. 0.1% of that split across every artist you want the training data from would be insanely difficult for the vast majority of them to turn down. Which makes sense as art isn’t that big a percentage of the global economy compared to say housing, food, medical care, infrastructure, military spending etc.

Obviously the incentive to take without compensation is far more appealing, but that doesn’t mean it was impossible to make a reasonable offer.

reply
For all the people represented in the training data to receive royalties would be an incredible wealth transfer to the Extremely Online. My forum posts, StackOverflow answers etc are also contributing to the model outputs. The training data, by volume, mostly belongs to blog authors, redditors, Wikipedia editors, to us!
reply
The people in that counting to infinity subreddit would get compensated a lot if this were fully automated - their posts were so overrepresented in the training set that many of their usernames became complete tokens (e.g. SolidGoldMagikarp).
reply
Hey finally my reddit and hn habit can be lucrative!
reply
> The cumulative license fees required to properly compensate all artists is so absurd that it will probably genuinely burn down the entirety of global economy if paid.

That's kind of an interesting concept: "since the scale of my transgression was so big, I should get away with it scot-free."

reply
That’s how eminent domain and regulatory takings work in most countries.
reply
"If it took all the fossil fuel on Earth" What do you mean? To TRAIN an LLM model it takes roughly the same amount of energy as to raise a person, so it's not even really expensive in energy costs.
reply