Your point is circular, let me bring it all around. If I make a 'clean-room' implementation using an LLM of a software that has a GPL license. How does the court enforce that my black box didn't use the original software in any way if there's no way to know? Does having that software as part of it's training corpus automatically enroll all output as GPL enforceable? This is essentially the question some courts are attempting to answer right now.