In cases like this it is usually incumbent on the entity claiming the clean-room situation was pure to show their working. For instance how Compaq clean-room cloned the IBM BIOS chip¹ was well documented (the procedures used, records of comms by the teams involved) where some other manufacturers did face costly legal troubles from IBM.
So the question is “is the clean-room claim sufficiently backed up to stand legal tests?” [and moral tests, though the AI world generally doesn't care about failing those]
--------
[1] the one part of their PCs that was not essentially off-the-shelf, so once it could be reliably legally mimicked this created an open IBM PC clone market
> *Context:* The registry maps every supported encoding to its metadata. Era assignments MUST match chardet 6.0.0's `chardet/metadata/charsets.py` at https://raw.githubusercontent.com/chardet/chardet/f0676c0d6a...
> Fetch that file and use it as the authoritative reference for which encodings belong to which era. Do not invent era assignments.
[0] https://github.com/chardet/chardet/issues/327#issuecomment-4...
a2mark has to demonstrate that v7 is "a work containing the v6 or a portion of it, either verbatim or with modifications and/or translated straightforwardly into another language", which is different from demanding a clean-room reimplementation.
Theoretically, the existence of a publicly available commit that is half v6 code and half v7 can be used to show that this part of v7 code has been infected by LGPL and must thus infect the rest of v7, but that's IMO going against the spirit of the [L]GPL.
which, minimally instructs it to directly examine the test suite: `4. High encoding accuracy on the chardet test suite`
You can't just dismiss it then say the claimant has to provide proof.
Plus the argument put forth is that they can re-license the project. It's not a new one made from scratch.
Now if you had 2 entirely distinct humans involved in the process that might work though.