upvote
According to US AI labs, training on other people's output is fair use. So that's how.
reply
AI training is considered transformational. That's how AI training gets around copyright and it's probably consistent with copyright precedent. For example, indexing the web is considered transformational, even though you can recover the full text of everything in an inverted index.
reply
Machine-extruded text is not copyrightable, since there was no human creativity involved in producing it.

(and if you argue the US models do produce copyrighted works, then oooops - whose copyright is it huh?)

reply
Ow my head.
reply
That when I pay for a model, the copyright of the output belongs to me. This is as work for hire as it gets.
reply