undefined

points

by iepathos4 hours ago |

comments

by john_strinlai4 hours ago|

[-]

>The 'Firewall' they describe is an illusion because [...]

it is an illusion because this is a satire site.

by melvinram3 hours ago|

parent|

[-]

This service is provided "as is" without warranty. MalusCorp is not responsible for any legal consequences, moral implications, or late-night guilt spirals resulting from use of our services.

by lofaszvanitt3 hours ago|

parent|

prev|

[-]

"Our lawyers estimated $4M in compliance costs. MalusCorp's Total Liberation package was $50K. The board was thrilled. The open source maintainers were not, but who cares?"

by gwern4 hours ago|

prev|

[-]

The solution here seems to be to impose some constraint or requirement which means that literal copying is impossible (remember, copyright governs copies, it doesn't govern ideas or algorithms - that would be 'patents', which essentially no open source software has) or where any 'copying' from vaguely remembered pretraining code is on such an abstract indirect level that it is 'transformative' and thus safe.

For example, the Anthropic Rust C compiler could hardly have copied GCC or any of the many C compilers it surely trained on, because then it wouldn't have spat out reasonably idiomatic and natural looking Rust in a differently organized codebase.

Good news for Rust and Lean, I guess, as it seems like everyone these days is looking for an excuse to rewrite everything into those for either speed or safety or both.

by pron4 hours ago|

parent|

[-]

> copyright governs copies, it doesn't govern ideas or algorithms

The second part is true. The first is a little trickier. The copyright applies to some fixed media (text in this case) rather than the idea expressed, but the protections extend well beyond copies. For example, in fiction, the narrative arc and "arrangement" is also protected, as are adaptations and translations.

If you were to try and write The Catcher in the Rye in Italian completely from memory (however well you remember it) I believe that would be protected by copyright even if not a single sentence were copied verbatim.

by briandw3 hours ago|

prev|

[-]

Obviously satire, but it will clearly be what happens in the future (predicting here, I'm not endorsing this practice). We can scratch train a new LLM on code generated from "contaminated" LLMs. We can then audit all the training data used and demonstrate that the original source wasn't in the training data. Therefore the cleanroom implementation holds. Current LLM training is relying less and less on human generated code. Just look at the open source models from China. They rely heavily on distilling from other models. One additional point. Exposure to the original source isn't enough to show infringement. Linus looked at UNIX source before writing linux.

by neilv4 hours ago|

prev|

[-]

I think this site is either satire, or serious but with a certain kind of humor in which both they and the reader know they're lying (but it's in everyone's interest to play along).

They do say this:

> Is this legal? / our clean room process is based on well-established legal precedent. The robots performing reconstruction have provably never accessed the original source code. We maintain detailed audit logs that definitely exist and are available upon request to courts in select jurisdictions.

Unless they're rejecting almost all of open source packages submitted by the customer, due to those packages being in the training set of the foundation model that they use, this is really the opposite of cleanroom.

by littlestymaar3 hours ago|

prev|

[-]

This is definitely a parody though, not a real service.

by superxpro123 hours ago|

parent|

[-]

This site is an obvious parody, but like most comedy these days it betrays the severity of the issues happening today.

by ActivePattern4 hours ago|

prev|

[-]

[flagged]

by badrequest4 hours ago|

parent|

[-]

Ah, it really wouldn't be HN without baselessly accusing other posters you disagree with.

by john_strinlai4 hours ago|

parent|

[-]

i mean... the site is very clearly satire and the comment is clearly responding as if it is a real service.

i do not necessarily agree with the phrasing of ActivePatterns comment, but i also raised an eyebrow at iepathos' comment.

by svnt4 hours ago|

parent|

[-]

The pathos paradox: the more times a person introduces the word pathos in casual conversation the less likely they are to recognize humor/satire.