That's your idea, not the one they are going with.
Their idea is that you pay a fee to access any information that was freely available.
Your idea is tearing down of fences, their idea is gatekeeping. The two ideas are incompatible.
An LLM containing the information doesn’t take away from the book being available at the library.
It’s an additional way to access the information. A company charging a fee for it doesn’t stop you from going to the library if you want to.
[0] among other things…
[1] more like ‘often not at all’
So should the original authors, no? That is, getting a share of that payment.
Something akin to the German GEMA could work, an entity that levies a usage fee on behalf of all copyright holders and re-distributes to its members, but on a global scale.
Should they? Yes. Will they?
Well, do LLM model builders pay for any copyrighted work so far?
It’s the same as asking “should you release open source software knowing that AI companies are training on them”. I could absolutely not care less, that’s not the point why I release my software to the public at all.
I rotate through the libraries near me with my kids.
They are every bit as busy now as I remember them being when I was a kid.
A recent executive order prohibits libraries (among other non-profits) from processing US passport applications. While county clerks (in my state) along with a small number of post office locations also offer this service, the libraries were doing it for free as opposed to charging $50-ish (like the post office or county clerks).
Why might the passport issue be important? The SAVE Act (passed the House of Representatives last year and sitting before the Senate) only permits 4 identification items to register to vote for Federal elections:
1 - A US Passport (costs about $100 to renew, about $150 for first time).
2 - A US Military ID that has proof of US citizenship (CAC cards show this with a white background behind your name - yellow or blue for contractors or non-US citizens). IDs for retirees don't show citizenship.
3 - A REAL ID compliant driving license that has proof of US citizenship. Also called "Enhanced Driving License", on the front it has a US flag and the back looks like the page on your passport with those funny letters. Only 5 states offer this as an extra $30-40 on top of the regular driving license fee.
4 - A REAL ID compliant driving license/ID and certified birth certificate and the names must match exactly. This means that 74 million women who took their husbands' name will not be voting in Federal Elections. Also, no transgender people can vote.
The SAVE Act also requires voter registration agencies to send voter rolls to DHS every month. And every month DHS can throw people off the voter rolls with no warning, no notice nor recourse. One can easily imagine this being done right before elections where people who registered for the "wrong" political party will be thrown off the rolls after the deadline to register.
Project 2025 wants to repeal the 19th Amendment. Throwing 74 million women off the voter rolls is just a start.
Links:
SAVE Act text - https://www.congress.gov/bill/119th-congress/house-bill/22/t...
https://www.congress.gov/bill/119th-congress/house-bill/22/t...
2. The Chinese have been investing a lot into free models, they're perfectly good and keep improving; despite the best efforts of the US. They're even ramping into making their own hardware. Gemma 4 is pretty snappy too. It doesn't seem like there is much of a moat to this, my guess is there will be perfectly good local models if you want to avoid AI companies.
When the person paying the money is rich, the other thing they are foregoing is typically not a life necessity. When the person is poor, however, it typically is.
I highly encourage you to go and update your priors.
That's what the money pays for when the Comment above mentions 'that you might have to eventually pay an AI company a large amount of money to ask ChatGPT such a question'
Putting aside that it won't be a large amount of money For any particular query , that's how the AI companies see themselves, not as providers of information, but as providers of mechanisms that provide information. It is not selling the Information of others, it isn't selling information at all. They are selling the service of running the mechanism.
I'm always going to have a machine anyway—might as well max out the RAM when I purchase another.
(And so too I jumped on the Mac mini bandwagon a month or two back—64 GB. I'm enjoying pulling down the new models and putting them through my paces.)
It doesn't look like they have a way to filter down to "open" models. By this of course I mean "downloadable, local models".
I suppose if you know the "family" (Gemma, Qwen, etc.), I can just go to those models and test…
I've simply been pulling down what is popular from the LM Studio front end (and what runs on my hardware) and testing in situ.
Clarification:
To maintain the library still requires resources & effort to do so. It only appears to need no funding because the donators of said (disk space / bandwidth / dev effort) are subsidizing it in aid of a goal they believe in (i.e. the church model).
> while the library itself has lost funding
Libraries are inherent parts of universities. While their precise role evolves, do you think that they will just be done away with? Already a substantial amount of scholarship in disciplines other than my own has moved online (legally), and the library is still there.
There are plenty of free models with RAG support. Why do you believe everything starts and ends with a major corporation charging a subscription?
If the obscure book/text is permanently lost forever under your stringent advice of "no stealing under any circumstances", would the "stealing" have saved it? If so, is it ethical to prevent others from accessing the book/text, under your guise of "preventing stealing"?
By quoting your comment in my reply, have I "stolen" your comment?
Second, it is totally legal to read the book in a public library, for free, right now.
Third, laws can change. Current copyright law was pushed by one company (Disney) to +90years, to their benefit, and can be redesigned/pushed back by AI companies, for their benefit.
A 2 year copyright duration sounds like a good compromise.
"Deal with the ethics", seriously? You might want to learn about how heavily shadow libraries are used across academia now. It’s no longer just disadvantaged scholars in the developing world relying on pirated scans because they don’t have good libraries. It’s increasingly everyone everywhere, because today’s shadow libraries can be faster and more convenient than even one’s own institution’s holdings. At conferences, if the presenter mentions a particularly interesting publication, you can sometimes watch several people in the room immediately open LibGen or Anna’s Archive on their laptop to download it right there and then.
I've published a couple of novels. They've sold far better than average, and yet not sold enough to be remotely worth it if I did it for the money. Piracy might have made a tiny dent, but the many millions of competing novels matters far more.
Anyone who has self published will have experienced that it is hard to even get people to read (as opposed to just download to hoard) your work even for free.
It's more comfortable to blame piracy, though.
If you're writing for money, maybe. If you're writing for the love of writing, it won't.
More, you hear of authors who encourage their books to be made available without DRM, who know or silently encourage their books to end up on torrent / library sites. They want their books to be read.
Separately, aren't always sensible or right - slavery was legal, child marriage was legal, not paying taxes on billions of profits is legal while not paying taxes of £1000 is illegal, reporting Jews to Nazis was mandatory, etc, etc.
He didn't mention legality. The world is rigged, as you can see by head of state participating in both in running and cover up of history's largest CSE. Watch what people are doing in addition to what they are saying.
I for one am tremendously thankful for TFNA's efforts, since I get access to knowledge that I wouldn't have been able to before.
Because you don't try, which says more about you than OP. It's a major problem with society.
It is some publishers who would object on copyright grounds. But I get the sense that some publishers are already becoming resigned to the fact that most of their new ebook releases are ending up on the shadow libraries within only a few weeks, and Anna’s Archive has become the first place to look (even before one looks at whether one’s own institutional library has the book) for researchers around the world.