upvote
The horse already left the barn. Every major AI lab scraped the entire internet years ago. Asking archive sites to "take a harder stance" now is just performative. The training data is baked in. The only real question left is whether we want the knowledge accessible to individuals too, or only locked inside corporate models.
reply
That is just not true. These AI scrapers are hammering all types of sites and causing their bills to explode.

https://www.pcmag.com/news/wikipedia-faces-flood-of-ai-bots-...

The nature of archives is that they are constantly updated.

reply
That's a good point I suppose.

I guess I'm just kind of sad. LLMS appropriately sourcing material could have been such a boom for artists in a way. I guess I feel like it was a missed opportunity for some mutual benefit.

Would have been a really interesting at least.

reply