Skip to Content
logologo
AI Incident Database
Open TwitterOpen RSS FeedOpen FacebookOpen LinkedInOpen GitHub
Open Menu
Discover
Submit
  • Welcome to the AIID
  • Discover Incidents
  • Spatial View
  • Table View
  • List view
  • Entities
  • Taxonomies
  • Submit Incident Reports
  • Submission Leaderboard
  • Blog
  • AI News Digest
  • Risk Checklists
  • Random Incident
  • Sign Up
Collapse
Discover
Submit
  • Welcome to the AIID
  • Discover Incidents
  • Spatial View
  • Table View
  • List view
  • Entities
  • Taxonomies
  • Submit Incident Reports
  • Submission Leaderboard
  • Blog
  • AI News Digest
  • Risk Checklists
  • Random Incident
  • Sign Up
Collapse
Entities

BitTorrent

Incidents implicated systems

Incident 9974 Report
Meta and OpenAI Accused of Using LibGen’s Pirated Books to Train AI Models

2023-02-28

Court records reveal that Meta employees allegedly discussed pirating books to train LLaMA 3, citing cost and speed concerns with licensing. Internal messages suggest Meta accessed LibGen, a repository of over 7.5 million pirated books, with apparent approval from Mark Zuckerberg. Employees allegedly took steps to obscure the dataset’s origins. OpenAI has also been implicated in using LibGen.

More

Related Entities
Other entities that are related to the same incident. For example, if the developer of an incident is this entity but the deployer is another entity, they are marked as related entities.
 

Entity

OpenAI

Incidents involved as both Developer and Deployer
  • Incident 997
    4 Reports

    Meta and OpenAI Accused of Using LibGen’s Pirated Books to Train AI Models

More
Entity

Meta

Incidents involved as both Developer and Deployer
  • Incident 997
    4 Reports

    Meta and OpenAI Accused of Using LibGen’s Pirated Books to Train AI Models

More
Entity

Writers

Incidents Harmed By
  • Incident 997
    4 Reports

    Meta and OpenAI Accused of Using LibGen’s Pirated Books to Train AI Models

More
Entity

publishers

Incidents Harmed By
  • Incident 997
    4 Reports

    Meta and OpenAI Accused of Using LibGen’s Pirated Books to Train AI Models

More
Entity

Journalists

Incidents Harmed By
  • Incident 997
    4 Reports

    Meta and OpenAI Accused of Using LibGen’s Pirated Books to Train AI Models

More
Entity

Authors

Incidents Harmed By
  • Incident 997
    4 Reports

    Meta and OpenAI Accused of Using LibGen’s Pirated Books to Train AI Models

More
Entity

Academic researchers

Incidents Harmed By
  • Incident 997
    4 Reports

    Meta and OpenAI Accused of Using LibGen’s Pirated Books to Train AI Models

More
Entity

OpenAI models

Incidents implicated systems
  • Incident 997
    4 Reports

    Meta and OpenAI Accused of Using LibGen’s Pirated Books to Train AI Models

More
Entity

Llama 3

Incidents implicated systems
  • Incident 997
    4 Reports

    Meta and OpenAI Accused of Using LibGen’s Pirated Books to Train AI Models

More
Entity

Library Genesis (LibGen)

Incidents implicated systems
  • Incident 997
    4 Reports

    Meta and OpenAI Accused of Using LibGen’s Pirated Books to Train AI Models

More
Entity

GPT-4

Incidents implicated systems
  • Incident 997
    4 Reports

    Meta and OpenAI Accused of Using LibGen’s Pirated Books to Train AI Models

More

Research

  • Defining an “AI Incident”
  • Defining an “AI Incident Response”
  • Database Roadmap
  • Related Work
  • Download Complete Database

Project and Community

  • About
  • Contact and Follow
  • Apps and Summaries
  • Editor’s Guide

Incidents

  • All Incidents in List Form
  • Flagged Incidents
  • Submission Queue
  • Classifications View
  • Taxonomies

2024 - AI Incident Database

  • Terms of use
  • Privacy Policy
  • Open twitterOpen githubOpen rssOpen facebookOpen linkedin
  • ecd56df