Skip to Content
logologo
AI Incident Database
Open TwitterOpen RSS FeedOpen FacebookOpen LinkedInOpen GitHub
Open Menu
Descubrir
Enviar
  • Bienvenido a la AIID
  • Descubrir Incidentes
  • Vista espacial
  • Vista Tabular
  • Vista de lista
  • Entidades
  • Taxonomías
  • Enviar Informes de Incidentes
  • Ranking de Reportadores
  • Blog
  • Resumen de noticias de IA
  • Control de Riesgos
  • Incidente aleatorio
  • Registrarse
Colapsar
Descubrir
Enviar
  • Bienvenido a la AIID
  • Descubrir Incidentes
  • Vista espacial
  • Vista Tabular
  • Vista de lista
  • Entidades
  • Taxonomías
  • Enviar Informes de Incidentes
  • Ranking de Reportadores
  • Blog
  • Resumen de noticias de IA
  • Control de Riesgos
  • Incidente aleatorio
  • Registrarse
Colapsar
Entidades

Writers

Afectado por Incidentes

Incidente 9974 Reportes
Meta and OpenAI Accused of Using LibGen’s Pirated Books to Train AI Models

2023-02-28

Court records reveal that Meta employees allegedly discussed pirating books to train LLaMA 3, citing cost and speed concerns with licensing. Internal messages suggest Meta accessed LibGen, a repository of over 7.5 million pirated books, with apparent approval from Mark Zuckerberg. Employees allegedly took steps to obscure the dataset’s origins. OpenAI has also been implicated in using LibGen.

Más

Incidente 9952 Reportes
The New York Times Sues OpenAI and Microsoft Over Alleged Unauthorized AI Training on Its Content

2023-12-27

The New York Times alleges that OpenAI and Microsoft used millions of its articles without permission to train AI models, including ChatGPT. The lawsuit claims the companies scraped and reproduced copyrighted content without compensation, in turn undermining the Times’s business and competing with its journalism. Some AI outputs allegedly regurgitate Times articles verbatim. The lawsuit seeks damages and demands the destruction of AI models trained on its content.

Más

Incidente 9962 Reportes
Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

2020-10-25

Meta and Bloomberg allegedly used Books3, a dataset containing 191,000 pirated books, to train their AI models, including LLaMA and BloombergGPT, without author consent. Lawsuits from authors such as Sarah Silverman and Michael Chabon claim this constitutes copyright infringement. Books3 includes works from major publishers like Penguin Random House and HarperCollins. Meta argues its AI outputs are not "substantially similar" to the original books, but legal challenges continue.

Más

Entidades relacionadas
Otras entidades que están relacionadas con el mismo incidente. Por ejemplo, si el desarrollador de un incidente es esta entidad pero el implementador es otra entidad, se marcan como entidades relacionadas.
 

Entity

OpenAI

Incidentes involucrados como desarrollador e implementador
  • Incidente 997
    4 Report

    Meta and OpenAI Accused of Using LibGen’s Pirated Books to Train AI Models

  • Incidente 995
    2 Report

    The New York Times Sues OpenAI and Microsoft Over Alleged Unauthorized AI Training on Its Content

Más
Entity

Microsoft

Incidentes involucrados como desarrollador e implementador
  • Incidente 995
    2 Report

    The New York Times Sues OpenAI and Microsoft Over Alleged Unauthorized AI Training on Its Content

Más
Entity

The New York Times

Afectado por Incidentes
  • Incidente 995
    2 Report

    The New York Times Sues OpenAI and Microsoft Over Alleged Unauthorized AI Training on Its Content

Más
Entity

Journalists

Afectado por Incidentes
  • Incidente 997
    4 Report

    Meta and OpenAI Accused of Using LibGen’s Pirated Books to Train AI Models

  • Incidente 995
    2 Report

    The New York Times Sues OpenAI and Microsoft Over Alleged Unauthorized AI Training on Its Content

Más
Entity

Journalism

Afectado por Incidentes
  • Incidente 995
    2 Report

    The New York Times Sues OpenAI and Microsoft Over Alleged Unauthorized AI Training on Its Content

Más
Entity

Media organizations

Afectado por Incidentes
  • Incidente 995
    2 Report

    The New York Times Sues OpenAI and Microsoft Over Alleged Unauthorized AI Training on Its Content

Más
Entity

publishers

Afectado por Incidentes
  • Incidente 997
    4 Report

    Meta and OpenAI Accused of Using LibGen’s Pirated Books to Train AI Models

  • Incidente 995
    2 Report

    The New York Times Sues OpenAI and Microsoft Over Alleged Unauthorized AI Training on Its Content

Más
Entity

ChatGPT

Incidents implicated systems
  • Incidente 995
    2 Report

    The New York Times Sues OpenAI and Microsoft Over Alleged Unauthorized AI Training on Its Content

Más
Entity

GPT-4

Incidents implicated systems
  • Incidente 997
    4 Report

    Meta and OpenAI Accused of Using LibGen’s Pirated Books to Train AI Models

  • Incidente 995
    2 Report

    The New York Times Sues OpenAI and Microsoft Over Alleged Unauthorized AI Training on Its Content

Más
Entity

Microsoft Bing Chat

Incidents implicated systems
  • Incidente 995
    2 Report

    The New York Times Sues OpenAI and Microsoft Over Alleged Unauthorized AI Training on Its Content

Más
Entity

Various generative AI developers

Incidentes involucrados como desarrollador e implementador
  • Incidente 996
    2 Report

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

Más
Entity

Meta

Incidentes involucrados como desarrollador e implementador
  • Incidente 997
    4 Report

    Meta and OpenAI Accused of Using LibGen’s Pirated Books to Train AI Models

  • Incidente 996
    2 Report

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

Más
Entity

EleutherAI

Incidentes involucrados como desarrollador e implementador
  • Incidente 996
    2 Report

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

Más
Entity

Bloomberg

Incidentes involucrados como desarrollador e implementador
  • Incidente 996
    2 Report

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

Más
Entity

The Pile

Incidents involved as Developer
  • Incidente 996
    2 Report

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

Incidents implicated systems
  • Incidente 996
    2 Report

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

Más
Entity

Shawn Presser

Incidents involved as Developer
  • Incidente 996
    2 Report

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

Más
Entity

Zadie Smith

Afectado por Incidentes
  • Incidente 996
    2 Report

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

Más
Entity

Verso

Afectado por Incidentes
  • Incidente 996
    2 Report

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

Más
Entity

Stephen King

Afectado por Incidentes
  • Incidente 996
    2 Report

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

Más
Entity

Sarah Silverman

Afectado por Incidentes
  • Incidente 996
    2 Report

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

Más
Entity

Richard Kadrey

Afectado por Incidentes
  • Incidente 996
    2 Report

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

Más
Entity

Publishers found in Books3

Afectado por Incidentes
  • Incidente 996
    2 Report

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

Más
Entity

Penguin Random House

Afectado por Incidentes
  • Incidente 996
    2 Report

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

Más
Entity

Oxford University Press

Afectado por Incidentes
  • Incidente 996
    2 Report

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

Más
Entity

Over 170,000 authors found in Books3

Afectado por Incidentes
  • Incidente 996
    2 Report

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

Más
Entity

Michael Pollan

Afectado por Incidentes
  • Incidente 996
    2 Report

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

Más
Entity

Margaret Atwood

Afectado por Incidentes
  • Incidente 996
    2 Report

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

Más
Entity

Macmillan

Afectado por Incidentes
  • Incidente 996
    2 Report

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

Más
Entity

HarperCollins

Afectado por Incidentes
  • Incidente 996
    2 Report

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

Más
Entity

General public

Afectado por Incidentes
  • Incidente 996
    2 Report

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

Más
Entity

Creative industries

Afectado por Incidentes
  • Incidente 996
    2 Report

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

Más
Entity

Christopher Golden

Afectado por Incidentes
  • Incidente 996
    2 Report

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

Más
Entity

Authors

Afectado por Incidentes
  • Incidente 997
    4 Report

    Meta and OpenAI Accused of Using LibGen’s Pirated Books to Train AI Models

  • Incidente 996
    2 Report

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

Más
Entity

LLaMA

Incidents implicated systems
  • Incidente 996
    2 Report

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

Más
Entity

hugging face

Incidents implicated systems
  • Incidente 996
    2 Report

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

Más
Entity

GPT-J

Incidents implicated systems
  • Incidente 996
    2 Report

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

Más
Entity

Books3

Incidents implicated systems
  • Incidente 996
    2 Report

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

Más
Entity

BloombergGPT

Incidents implicated systems
  • Incidente 996
    2 Report

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

Más
Entity

Bibliotik

Incidents implicated systems
  • Incidente 996
    2 Report

    Meta Allegedly Used Books3, a Dataset of 191,000 Pirated Books, to Train LLaMA AI

Más
Entity

Academic researchers

Afectado por Incidentes
  • Incidente 997
    4 Report

    Meta and OpenAI Accused of Using LibGen’s Pirated Books to Train AI Models

Más
Entity

OpenAI models

Incidents implicated systems
  • Incidente 997
    4 Report

    Meta and OpenAI Accused of Using LibGen’s Pirated Books to Train AI Models

Más
Entity

Llama 3

Incidents implicated systems
  • Incidente 997
    4 Report

    Meta and OpenAI Accused of Using LibGen’s Pirated Books to Train AI Models

Más
Entity

Library Genesis (LibGen)

Incidents implicated systems
  • Incidente 997
    4 Report

    Meta and OpenAI Accused of Using LibGen’s Pirated Books to Train AI Models

Más
Entity

BitTorrent

Incidents implicated systems
  • Incidente 997
    4 Report

    Meta and OpenAI Accused of Using LibGen’s Pirated Books to Train AI Models

Más

Investigación

  • Definición de un “Incidente de IA”
  • Definición de una “Respuesta a incidentes de IA”
  • Hoja de ruta de la base de datos
  • Trabajo relacionado
  • Descargar Base de Datos Completa

Proyecto y Comunidad

  • Acerca de
  • Contactar y Seguir
  • Aplicaciones y resúmenes
  • Guía del editor

Incidencias

  • Todos los incidentes en forma de lista
  • Incidentes marcados
  • Cola de envío
  • Vista de clasificaciones
  • Taxonomías

2024 - AI Incident Database

  • Condiciones de uso
  • Política de privacidad
  • Open twitterOpen githubOpen rssOpen facebookOpen linkedin
  • 1420c8e