Skip to Content
logologo
AI Incident Database
Open TwitterOpen RSS FeedOpen FacebookOpen LinkedInOpen GitHub
Open Menu
Découvrir
Envoyer
  • Bienvenue sur AIID
  • Découvrir les incidents
  • Vue spatiale
  • Vue de tableau
  • Vue de liste
  • Entités
  • Taxonomies
  • Soumettre des rapports d'incident
  • Classement des reporters
  • Blog
  • Résumé de l’Actualité sur l’IA
  • Contrôle des risques
  • Incident au hasard
  • S'inscrire
Fermer
Découvrir
Envoyer
  • Bienvenue sur AIID
  • Découvrir les incidents
  • Vue spatiale
  • Vue de tableau
  • Vue de liste
  • Entités
  • Taxonomies
  • Soumettre des rapports d'incident
  • Classement des reporters
  • Blog
  • Résumé de l’Actualité sur l’IA
  • Contrôle des risques
  • Incident au hasard
  • S'inscrire
Fermer
Entités

Writers

Affecté par des incidents

Incident 9974 Rapports
Meta and OpenAI Accused of Using LibGen’s Pirated Books to Train AI Models

2023-02-28

Court records reveal that Meta employees allegedly discussed pirating books to train LLaMA 3, citing cost and speed concerns with licensing. Internal messages suggest Meta accessed LibGen, a repository of over 7.5 million pirated books, with apparent approval from Mark Zuckerberg. Employees allegedly took steps to obscure the dataset’s origins. OpenAI has also been implicated in using LibGen.

Plus

Incident 9952 Rapports
The New York Times Sues OpenAI and Microsoft Over Alleged Unauthorized AI Training on Its Content

2023-12-27

The New York Times alleges that OpenAI and Microsoft used millions of its articles without permission to train AI models, including ChatGPT. The lawsuit claims the companies scraped and reproduced copyrighted content without compensation, in turn undermining the Times’s business and competing with its journalism. Some AI outputs allegedly regurgitate Times articles verbatim. The lawsuit seeks damages and demands the destruction of AI models trained on its content.

Plus

Incident 9962 Rapports
Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

2020-10-25

Meta and Bloomberg allegedly used Books3, a dataset containing 191,000 pirated books, to train their AI models, including LLaMA and BloombergGPT, without author consent. Lawsuits from authors such as Sarah Silverman and Michael Chabon claim this constitutes copyright infringement. Books3 includes works from major publishers like Penguin Random House and HarperCollins. Meta argues its AI outputs are not "substantially similar" to the original books, but legal challenges continue.

Plus

Entités liées
Autres entités liées au même incident. Par exemple, si le développeur d'un incident est cette entité mais que le responsable de la mise en œuvre est une autre entité, ils sont marqués comme entités liées.
 

Entity

OpenAI

Incidents impliqués en tant que développeur et déployeur
  • Incident 997
    4 Report

    Meta and OpenAI Accused of Using LibGen’s Pirated Books to Train AI Models

  • Incident 995
    2 Report

    The New York Times Sues OpenAI and Microsoft Over Alleged Unauthorized AI Training on Its Content

Plus
Entity

Microsoft

Incidents impliqués en tant que développeur et déployeur
  • Incident 995
    2 Report

    The New York Times Sues OpenAI and Microsoft Over Alleged Unauthorized AI Training on Its Content

Plus
Entity

The New York Times

Affecté par des incidents
  • Incident 995
    2 Report

    The New York Times Sues OpenAI and Microsoft Over Alleged Unauthorized AI Training on Its Content

Plus
Entity

Journalists

Affecté par des incidents
  • Incident 997
    4 Report

    Meta and OpenAI Accused of Using LibGen’s Pirated Books to Train AI Models

  • Incident 995
    2 Report

    The New York Times Sues OpenAI and Microsoft Over Alleged Unauthorized AI Training on Its Content

Plus
Entity

Journalism

Affecté par des incidents
  • Incident 995
    2 Report

    The New York Times Sues OpenAI and Microsoft Over Alleged Unauthorized AI Training on Its Content

Plus
Entity

Media organizations

Affecté par des incidents
  • Incident 995
    2 Report

    The New York Times Sues OpenAI and Microsoft Over Alleged Unauthorized AI Training on Its Content

Plus
Entity

publishers

Affecté par des incidents
  • Incident 997
    4 Report

    Meta and OpenAI Accused of Using LibGen’s Pirated Books to Train AI Models

  • Incident 995
    2 Report

    The New York Times Sues OpenAI and Microsoft Over Alleged Unauthorized AI Training on Its Content

Plus
Entity

ChatGPT

Incidents implicated systems
  • Incident 995
    2 Report

    The New York Times Sues OpenAI and Microsoft Over Alleged Unauthorized AI Training on Its Content

Plus
Entity

GPT-4

Incidents implicated systems
  • Incident 997
    4 Report

    Meta and OpenAI Accused of Using LibGen’s Pirated Books to Train AI Models

  • Incident 995
    2 Report

    The New York Times Sues OpenAI and Microsoft Over Alleged Unauthorized AI Training on Its Content

Plus
Entity

Microsoft Bing Chat

Incidents implicated systems
  • Incident 995
    2 Report

    The New York Times Sues OpenAI and Microsoft Over Alleged Unauthorized AI Training on Its Content

Plus
Entity

Various generative AI developers

Incidents impliqués en tant que développeur et déployeur
  • Incident 996
    2 Report

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

Plus
Entity

Meta

Incidents impliqués en tant que développeur et déployeur
  • Incident 997
    4 Report

    Meta and OpenAI Accused of Using LibGen’s Pirated Books to Train AI Models

  • Incident 996
    2 Report

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

Plus
Entity

EleutherAI

Incidents impliqués en tant que développeur et déployeur
  • Incident 996
    2 Report

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

Plus
Entity

Bloomberg

Incidents impliqués en tant que développeur et déployeur
  • Incident 996
    2 Report

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

Plus
Entity

The Pile

Incidents involved as Developer
  • Incident 996
    2 Report

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

Incidents implicated systems
  • Incident 996
    2 Report

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

Plus
Entity

Shawn Presser

Incidents involved as Developer
  • Incident 996
    2 Report

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

Plus
Entity

Zadie Smith

Affecté par des incidents
  • Incident 996
    2 Report

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

Plus
Entity

Verso

Affecté par des incidents
  • Incident 996
    2 Report

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

Plus
Entity

Stephen King

Affecté par des incidents
  • Incident 996
    2 Report

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

Plus
Entity

Sarah Silverman

Affecté par des incidents
  • Incident 996
    2 Report

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

Plus
Entity

Richard Kadrey

Affecté par des incidents
  • Incident 996
    2 Report

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

Plus
Entity

Publishers found in Books3

Affecté par des incidents
  • Incident 996
    2 Report

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

Plus
Entity

Penguin Random House

Affecté par des incidents
  • Incident 996
    2 Report

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

Plus
Entity

Oxford University Press

Affecté par des incidents
  • Incident 996
    2 Report

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

Plus
Entity

Over 170,000 authors found in Books3

Affecté par des incidents
  • Incident 996
    2 Report

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

Plus
Entity

Michael Pollan

Affecté par des incidents
  • Incident 996
    2 Report

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

Plus
Entity

Margaret Atwood

Affecté par des incidents
  • Incident 996
    2 Report

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

Plus
Entity

Macmillan

Affecté par des incidents
  • Incident 996
    2 Report

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

Plus
Entity

HarperCollins

Affecté par des incidents
  • Incident 996
    2 Report

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

Plus
Entity

General public

Affecté par des incidents
  • Incident 996
    2 Report

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

Plus
Entity

Creative industries

Affecté par des incidents
  • Incident 996
    2 Report

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

Plus
Entity

Christopher Golden

Affecté par des incidents
  • Incident 996
    2 Report

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

Plus
Entity

Authors

Affecté par des incidents
  • Incident 997
    4 Report

    Meta and OpenAI Accused of Using LibGen’s Pirated Books to Train AI Models

  • Incident 996
    2 Report

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

Plus
Entity

LLaMA

Incidents implicated systems
  • Incident 996
    2 Report

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

Plus
Entity

hugging face

Incidents implicated systems
  • Incident 996
    2 Report

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

Plus
Entity

GPT-J

Incidents implicated systems
  • Incident 996
    2 Report

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

Plus
Entity

Books3

Incidents implicated systems
  • Incident 996
    2 Report

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

Plus
Entity

BloombergGPT

Incidents implicated systems
  • Incident 996
    2 Report

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

Plus
Entity

Bibliotik

Incidents implicated systems
  • Incident 996
    2 Report

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

Plus
Entity

Academic researchers

Affecté par des incidents
  • Incident 997
    4 Report

    Meta and OpenAI Accused of Using LibGen’s Pirated Books to Train AI Models

Plus
Entity

OpenAI models

Incidents implicated systems
  • Incident 997
    4 Report

    Meta and OpenAI Accused of Using LibGen’s Pirated Books to Train AI Models

Plus
Entity

Llama 3

Incidents implicated systems
  • Incident 997
    4 Report

    Meta and OpenAI Accused of Using LibGen’s Pirated Books to Train AI Models

Plus
Entity

Library Genesis (LibGen)

Incidents implicated systems
  • Incident 997
    4 Report

    Meta and OpenAI Accused of Using LibGen’s Pirated Books to Train AI Models

Plus
Entity

BitTorrent

Incidents implicated systems
  • Incident 997
    4 Report

    Meta and OpenAI Accused of Using LibGen’s Pirated Books to Train AI Models

Plus

Recherche

  • Définition d'un « incident d'IA »
  • Définir une « réponse aux incidents d'IA »
  • Feuille de route de la base de données
  • Travaux connexes
  • Télécharger la base de données complète

Projet et communauté

  • À propos de
  • Contacter et suivre
  • Applications et résumés
  • Guide de l'éditeur

Incidents

  • Tous les incidents sous forme de liste
  • Incidents signalés
  • File d'attente de soumission
  • Affichage des classifications
  • Taxonomies

2024 - AI Incident Database

  • Conditions d'utilisation
  • Politique de confidentialité
  • Open twitterOpen githubOpen rssOpen facebookOpen linkedin
  • 1420c8e