Description: Meta and Bloomberg allegedly used Books3, a dataset containing 191,000 pirated books, to train their AI models, including LLaMA and BloombergGPT, without author consent. Lawsuits from authors such as Sarah Silverman and Michael Chabon claim this constitutes copyright infringement. Books3 includes works from major publishers like Penguin Random House and HarperCollins. Meta argues its AI outputs are not "substantially similar" to the original books, but legal challenges continue.
Entities
View all entitiesAlleged: Various generative AI developers , Meta , EleutherAI , Bloomberg , The Pile and Shawn Presser developed an AI system deployed by Various generative AI developers , Meta , EleutherAI and Bloomberg, which harmed Zadie Smith , Writers , Verso , Stephen King , Sarah Silverman , Richard Kadrey , Publishers found in Books3 , Penguin Random House , Oxford University Press , Over 170,000 authors found in Books3 , Michael Pollan , Margaret Atwood , Macmillan , HarperCollins , General public , Creative industries , Christopher Golden and Authors.
Alleged implicated AI systems: The Pile , LLaMA , hugging face , GPT-J , Books3 , BloombergGPT and Bibliotik
Incident Stats
Incident ID
996
Report Count
2
Incident Date
2020-10-25
Editors
Daniel Atherton
Incident Reports
Reports Timeline
/cdn.vox-cdn.com/uploads/chorus_asset/file/24778390/668894138.jpg)
Comedian and author Sarah Silverman, as well as authors Christopher Golden and Richard Kadrey — are suing OpenAI and Meta each in a US District Court over dual claims of copyright infringement.
The suits alleges, among other things, that Op…
Updated at 1:40 p.m. ET on September 25, 2023
Editor's note: This article is part of The Atlantic's series on Books3. Check out our searchable Books3 database to find specific authors and titles. A deeper analysis of what is in the database…
Variants
A "variant" is an AI incident similar to a known case—it has the same causes, harms, and AI system. Instead of listing it separately, we group it under the first reported incident. Unlike other incidents, variants do not need to have been reported outside the AIID. Learn more from the research paper.
Seen something similar?