Skip to Content
logologo
AI Incident Database
Open TwitterOpen RSS FeedOpen FacebookOpen LinkedInOpen GitHub
Open Menu
Discover
Submit
  • Welcome to the AIID
  • Discover Incidents
  • Spatial View
  • Table View
  • List view
  • Entities
  • Taxonomies
  • Submit Incident Reports
  • Submission Leaderboard
  • Blog
  • AI News Digest
  • Risk Checklists
  • Random Incident
  • Sign Up
Collapse
Discover
Submit
  • Welcome to the AIID
  • Discover Incidents
  • Spatial View
  • Table View
  • List view
  • Entities
  • Taxonomies
  • Submit Incident Reports
  • Submission Leaderboard
  • Blog
  • AI News Digest
  • Risk Checklists
  • Random Incident
  • Sign Up
Collapse
Entities

Sarah Silverman

Incidents Harmed By

Incident 9962 Report
Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

2020-10-25

Meta and Bloomberg allegedly used Books3, a dataset containing 191,000 pirated books, to train their AI models, including LLaMA and BloombergGPT, without author consent. Lawsuits from authors such as Sarah Silverman and Michael Chabon claim this constitutes copyright infringement. Books3 includes works from major publishers like Penguin Random House and HarperCollins. Meta argues its AI outputs are not "substantially similar" to the original books, but legal challenges continue.

More

Related Entities
Other entities that are related to the same incident. For example, if the developer of an incident is this entity but the deployer is another entity, they are marked as related entities.
 

Entity

Various generative AI developers

Incidents involved as both Developer and Deployer
  • Incident 996
    2 Reports

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

More
Entity

Meta

Incidents involved as both Developer and Deployer
  • Incident 996
    2 Reports

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

More
Entity

EleutherAI

Incidents involved as both Developer and Deployer
  • Incident 996
    2 Reports

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

More
Entity

Bloomberg

Incidents involved as both Developer and Deployer
  • Incident 996
    2 Reports

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

More
Entity

The Pile

Incidents involved as Developer
  • Incident 996
    2 Reports

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

Incidents implicated systems
  • Incident 996
    2 Reports

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

More
Entity

Shawn Presser

Incidents involved as Developer
  • Incident 996
    2 Reports

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

More
Entity

Zadie Smith

Incidents Harmed By
  • Incident 996
    2 Reports

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

More
Entity

Writers

Incidents Harmed By
  • Incident 996
    2 Reports

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

More
Entity

Verso

Incidents Harmed By
  • Incident 996
    2 Reports

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

More
Entity

Stephen King

Incidents Harmed By
  • Incident 996
    2 Reports

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

More
Entity

Richard Kadrey

Incidents Harmed By
  • Incident 996
    2 Reports

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

More
Entity

Publishers found in Books3

Incidents Harmed By
  • Incident 996
    2 Reports

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

More
Entity

Penguin Random House

Incidents Harmed By
  • Incident 996
    2 Reports

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

More
Entity

Oxford University Press

Incidents Harmed By
  • Incident 996
    2 Reports

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

More
Entity

Over 170,000 authors found in Books3

Incidents Harmed By
  • Incident 996
    2 Reports

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

More
Entity

Michael Pollan

Incidents Harmed By
  • Incident 996
    2 Reports

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

More
Entity

Margaret Atwood

Incidents Harmed By
  • Incident 996
    2 Reports

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

More
Entity

Macmillan

Incidents Harmed By
  • Incident 996
    2 Reports

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

More
Entity

HarperCollins

Incidents Harmed By
  • Incident 996
    2 Reports

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

More
Entity

General public

Incidents Harmed By
  • Incident 996
    2 Reports

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

More
Entity

Creative industries

Incidents Harmed By
  • Incident 996
    2 Reports

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

More
Entity

Christopher Golden

Incidents Harmed By
  • Incident 996
    2 Reports

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

More
Entity

Authors

Incidents Harmed By
  • Incident 996
    2 Reports

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

More
Entity

LLaMA

Incidents implicated systems
  • Incident 996
    2 Reports

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

More
Entity

hugging face

Incidents implicated systems
  • Incident 996
    2 Reports

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

More
Entity

GPT-J

Incidents implicated systems
  • Incident 996
    2 Reports

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

More
Entity

Books3

Incidents implicated systems
  • Incident 996
    2 Reports

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

More
Entity

BloombergGPT

Incidents implicated systems
  • Incident 996
    2 Reports

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

More
Entity

Bibliotik

Incidents implicated systems
  • Incident 996
    2 Reports

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

More

Research

  • Defining an “AI Incident”
  • Defining an “AI Incident Response”
  • Database Roadmap
  • Related Work
  • Download Complete Database

Project and Community

  • About
  • Contact and Follow
  • Apps and Summaries
  • Editor’s Guide

Incidents

  • All Incidents in List Form
  • Flagged Incidents
  • Submission Queue
  • Classifications View
  • Taxonomies

2024 - AI Incident Database

  • Terms of use
  • Privacy Policy
  • Open twitterOpen githubOpen rssOpen facebookOpen linkedin
  • ecd56df