upvote
> LLMs don’t output full copyrighted works word for word

Apparently they do, as per the evidence in the NYT vs OpenAI suit.

reply
Isn’t the output of LLMs completely copyright-free in the US?
reply
One lower court has said that the output of AI models is uncopyrightable.

But the real unsettled issue is if model training is fair use, and where copyright infringement might creep in to model output.

reply
The copyright office itself also says this when it talks about determining authorship.
reply
> Anthropic and others argue that because LLMs don’t output full copyrighted works word for word - hence their LLMs aren’t infringing on copyright laws.

That surely can't be what they argue, because I'm sure I can't translate a copyrighted book into a different language and say "that's fine, it's not word-for-word".

reply