Paper is a natural resource, and this literally just wasted a fuck ton. There are non-destructive scanning methods.
Comment on Anthropic destroyed millions of print books to build its AI models
knokelmaat@beehaw.org 2 days ago
These books were purchased by them before being destroyed in the scanning process. I fail to see the issue with this specific case. Lots of artists buy stuff and irreversibly modify it. Are we going to be angry now at people who glue their puzzles or use parts of books for scrapbooking? If these were unique works there would be an issue, but again, these were bought by them so I think they can do whatever they want with it.
The fact that they use it for model training and later sell access to that model’s work is the shady part that has a severe whiff of plagiarism to it.
Vodulas@beehaw.org 1 day ago
B0rax@feddit.org 1 day ago
They could have just bought the ebooks…
blindsight@beehaw.org 1 day ago
Nope. Ebooks are a license, so the First Sale Doctrine does not apply. Buying ebooks is nearly useless, legally.
Vodulas@beehaw.org 1 day ago
I would hazard a guess that the eBook did not exist for the physical books they bought. Still, that doesn’t excuse their actions, nor the bigger issues with training LLMs
FundMECFSResearch@lemmy.blahaj.zone 2 days ago
I think it’s a waste tbh. Like it’s one of those capitalist things of “well its not profitable to sell so lets destroy them”, when anything made for the good of the people would’ve seen a massive opportunity to distribute books to people for free!
yetAnotherUser@discuss.tchncs.de 1 day ago
Copyright law doesn’t allow them to sell the books. It’s almost certainly a violation to scan books for their content and then sell them.
FundMECFSResearch@lemmy.blahaj.zone 1 day ago
Copyright law also doesn’t allow them to download the entirety of a piracy database of books. But here we are, they clearly don’t care about copyright law.
yetAnotherUser@discuss.tchncs.de 1 day ago
They didn’t care at first. The only reason they began destructively scanning books is because they started to care about copyright law: