upvote
Not exactly automatically.

Seeding is opt-out, not opt-in… but it is usually a default that has to actively manually overridden. Most users never touch those settings. The average pirate downloading a torrent is seeding whether they know it or not.

The protocol absolutely does not enforce seeding. A client can lie to the tracker, cap upload to 0k. BitTorrent has no mechanism to compel one to share. Leeching a file, downloading and sharing no forward packets is possible. While the "social contract" of seeding is entirely a norm enforced by private trackers and community shame. It is not the protocol itself.

reply
deleted
reply
seeding is not the only way you actually upload

you're uploading before seeding, and i'm willing to bet Meta weren't seeding but, as they correctly stated in that regard, they're sharing even when they try their best not to because of the way the protocol works as zero-upload is typically impractical for any significant size files

some trackers will additionally penalise you for not sharing file parts, but this depends on the tracker

reply
and the protocol doesn't enforce you upload anything.

The original design called for some kind of tit-for-tat algorithm, but it's long obsolete and you get whatever bandwidth the seeder has.

reply
If you try to download any significant file with zero-upload, you will run out of peers that will share with you much earlier than you will download the file. It's not practical.

Most people that speak of leeching or not seeding really are talking about not seeding at all after they've completed. In fact, most clients will let you set upload speeds to a trickle but not zero (zero means unlimited in most clients). From a legal standpoint, that already means you uploaded.

reply
It’s true that most clients do not support a zero upload configuration, but it’s not inherent to the protocol, and modified clients exist.

I’m not aware of any clients that will refuse to share data with clients that are configured to not upload. I don’t even see how they could determine that, especially in situations where there are no other peers to upload to, and given that stats are entirely self-reported and clients that send bogus numbers exist.

You would need a central tracker that cares, which is what private torrent communities rely on, but not public/DHT torrents such as those discussed here.

reply
You’re correct about seeds, but peers who are also downloading will often stop sharing with you if you stop sharing with them. Seeds generally are configured to try to give different pieces to different peers so that they can send them to each other and reduce load on the seed; they don’t want to give you the entire file directly unless you’re the only person downloading. And peers prioritize and filter which other ones they’ll send pieces to based on reciprocity.

You will probably get the data eventually, and it really depends on the composition and configuration of the swarm, but generally, you do need to upload if you want to ensure the fastest and most reliable download.

reply
Long-running torrents are mostly populated by seeders. Bit torrent was originally designed for a lot of downloaders to get a file at the same time with limited seeding bandwidth, so leechers would need to trade with each other a lot, but that's not really the situation most torrents are in today.
reply
It depends what you’re downloading. Peer prioritization mechanics have been relevant for a lot of my recent downloads.
reply
You can, but you will slow down your own downloads dramatically by doing so. In some cases you will fail to finish them.

The case for doing this would be just so you can have this ridiculous legal defence Meta seem to be trying to pull out. Really no other good reason. Even for the most parasitic leeches, zero upload is a bad strategy.

reply
Seeders don't know how much data you shared with other leechers.
reply
Yes. So?
reply
So you can download from seeders as fast as they can upload.
reply
This is entirely dependent on the client on the other end of the connection.
reply
"tit-for-tat" trading of chunks only happens between peers that both are actively downloading. Seeding nodes just let anybody leech.

You totally CAN disable all uploads in the torrent protocol. Just set the "upload budget" to zero in most clients. Just nobody realizes they can do that.

Bittorrent is wildly successful in part because every popular client makes it nontrivial to "opt out" of it's more socialist components (chunk trading, DHT participation, seeding by default).

Making an "leech behavior only" torrent client is straightforward and viable.

reply
Tit-for-tat kicks in. It's fine for smaller files to just jump peers with zero upload, but i reckon Meta would have found it challenging to download very large files without sharing. It's certainly much faster if you don't get throttled or banned by many peers.
reply
Would you say that generally books would be considered a small file or a BIG file?
reply
they'd most certainly go for very large curated collections like those of Anna's Archives, we're talking about 10s or 100s of TBs per archive

going 1 by 1 would be quite the exercise in itself considering just how much variety of formats, styles, crap added in the files, random password crapware, etc etc you find for anything other than the most trendy stuff

reply
I can't believe that no one has ever tried that one before... So do we now roll back all of the previous copyright cases where downloading music with bittorrent has been prosecuted?
reply
> So do we now roll back all of the previous copyright cases where downloading music with bittorrent has been prosecuted

No, because those cases were pirating-while-poor. This is pirating-while-trillion-dollar-corporation, which falls under a completely different section of the law.

reply
At this stage, you are going to far in claiming that. So far, all that happened is that Meta's lawyers claimed it was fair use. They are paid to try every argument they can think of that might work. Just because they make the argument doesn't mean the court will find it has any merit.
reply
Meta has so much money, even if they end up paying they’ll probably barely be affected. In that case, actually GP is wrong and it’s the same law, but still different outcomes (like “neither poor nor rich may sleep on public benches…”)
reply
While you are correct that a decision on this specific case is still pending, your parent comment does have a point that breaking the law while rich and while poor have very different outcomes. Also, no way they’re going to roll back all previous cases. So the joke works now, no need to wait.
reply
From my understanding, Meta's use of the pirated book was accepted as fair use and the plaintiffs admitted to no harm. In the case of pirated music and films, neither of those points are made. Copyright holders assume people who pirate would have bought the content, usually even assuming that one download is one lost sale. And I am not aware of a single case where watching or listening to pirated content was accepted as fair use.

It is interesting to follow how this plays out for Meta and how that will impact future cases.

reply
One of the underlying issues is that punitive damages seem to be the norm in US courts.

In the UK you can only claim for the actual damages incurred, which at most will be the profit you would've made on the sale of that book. Which makes most claims for private infringement uneconomical for corporations.

reply
Note though that the court can award more than this in some circumstances. From the Copyright, Designs and Patents Act of 1988, section 97 [1]:

  (2) The court may in an action for infringement of copyright having regard to all the circumstances, and in particular to—

    (a)the flagrancy of the infringement, and

    (b)any benefit accruing to the defendant by reason of the infringement, award such additional damages as the justice of the case may require.
I think most copyright systems have some provision for damages beyond lost profits, because if they did not what incentive would there be to not infringe?

[1] https://www.legislation.gov.uk/ukpga/1988/48/section/97

reply
I don’t get that, the use of these books was instrumental and necessary for the success of the training run. The expected value of these training runs is high as the build out of 100 billion+ infrastructure demonstrates, so the book publishers should at a minimum be paid a licensing fee, a small fraction of every inference run revenue or whatever they decide. The fact that authors and publishers didn’t get any say under what conditions their intellectual property can be used is pretty outrageous.
reply
The conclusion was they suffered no legal harm, in that their interests such as their continued publishing of books was not affected by LLMs; no one is using AI to compete with publishers, if anything "authors" might very well use those same publishers to get their generated books on shelves.

If it's fair use, no licensing fee is needed.

reply
So pretty much the same as the Authors Guild, Inc. v. Google, Inc. case ruling it as fair use as a transformative work. I mean if indexing the worlds books is transformative then a neural net run on them certainly is a transformative work and fair use.
reply
Yes, that's how the argument is going for these lawsuits so far.
reply
We consumers just need BiTorrent clients that come with LLM training code incorporated, as that transforms the downloads into fair use (according to the very expensive Meta legal team).
reply
The use of the pirated book is a totally separate action than acquiring the pirated book.
reply
When I pull the trigger and the bullet kills an another person, it is just how technology works. Why would I be responsible if I choose to use it or not?
reply
I'm going to need a copy of your latest bank statement before i can accurately answer that.
reply
Even if the court accepts the argument, it can be undermined by pointing out that they knew it in advance, or could have known, and thus accepted it.
reply
I agree, that people used to be called "leechers". Somewhat related xkcd https://xkcd.com/553/
reply
deleted
reply
This. You can set upload speed to zero, and download entire dataset without uploading anything. Slower but doable.
reply
As far as I know, setting upload speed to zero disables the limit. You can set it to be very low but not zero.
reply
That is client dependent. On rtorrent, there is a separate "off" setting for the speed throttle that means "no throttle" with the result that "zero" actually means "no uploading".
reply
You can patch it so zero means zero.
reply
I think it's a fair argument in the context of big corporations using the technology.
reply
Lawyers are paid to defend a position. They are intellectual prostitutes.
reply
My client didn't "buy" illegal drugs. He received illegal drugs. But anyone who makes a drug deal automatically sends money to the drug dealer, as inherent of the protocol. In other words, "giving money for drugs" wasn't a choice, it was simply how drug deals work.
reply