Skip to Content
logologo
AI Incident Database
Open TwitterOpen RSS FeedOpen FacebookOpen LinkedInOpen GitHub
Open Menu
Discover
Submit
  • Welcome to the AIID
  • Discover Incidents
  • Spatial View
  • Table View
  • List view
  • Entities
  • Taxonomies
  • Submit Incident Reports
  • Submission Leaderboard
  • Blog
  • AI News Digest
  • Risk Checklists
  • Random Incident
  • Sign Up
Collapse
Discover
Submit
  • Welcome to the AIID
  • Discover Incidents
  • Spatial View
  • Table View
  • List view
  • Entities
  • Taxonomies
  • Submit Incident Reports
  • Submission Leaderboard
  • Blog
  • AI News Digest
  • Risk Checklists
  • Random Incident
  • Sign Up
Collapse
Entities

Academic researchers

Incidents Harmed By

Incident 9974 Report
Meta and OpenAI Accused of Using LibGen’s Pirated Books to Train AI Models

2023-02-28

Court records reveal that Meta employees allegedly discussed pirating books to train LLaMA 3, citing cost and speed concerns with licensing. Internal messages suggest Meta accessed LibGen, a repository of over 7.5 million pirated books, with apparent approval from Mark Zuckerberg. Employees allegedly took steps to obscure the dataset’s origins. OpenAI has also been implicated in using LibGen.

More

Incident 13081 Report
Springer Nature Book 'Mastering Machine Learning: From Basics to Advanced' Reportedly Published With Numerous Purportedly Nonexistent or Incorrect Citations

2025-04-18

"Mastering Machine Learning: From Basics to Advanced," published by Springer Nature in April 2025, reportedly contained numerous purportedly nonexistent or materially incorrect academic citations. Independent checks allegedly found that many referenced works did not exist or were misattributed, with multiple named researchers reportedly confirming they did not author the cited material. The pattern of errors has been described as consistent with known LLM citation hallucinations.

More

Incident 13091 Report
Springer Nature Book 'Social, Ethical and Legal Aspects of Generative AI: Tools, Techniques and Systems' Reportedly Published With Numerous Purportedly Fabricated or Unverifiable Citations

2025-06-17

'Social, Ethical and Legal Aspects of Generative AI: Tools, Techniques and Systems,' published by Springer Nature in June 2025, reportedly contains numerous purportedly untraceable academic citations. Independent analyses by multiple researchers allegedly found that a substantial share of references in certain chapters could not be verified, including citations to journals that do not exist. Citation patterns reportedly appear consistent with known large language model hallucination behaviors.

More

Related Entities
Other entities that are related to the same incident. For example, if the developer of an incident is this entity but the deployer is another entity, they are marked as related entities.
 

Entity

OpenAI

Incidents involved as both Developer and Deployer
  • Incident 997
    4 Reports

    Meta and OpenAI Accused of Using LibGen’s Pirated Books to Train AI Models

More
Entity

Meta

Incidents involved as both Developer and Deployer
  • Incident 997
    4 Reports

    Meta and OpenAI Accused of Using LibGen’s Pirated Books to Train AI Models

More
Entity

Writers

Incidents Harmed By
  • Incident 997
    4 Reports

    Meta and OpenAI Accused of Using LibGen’s Pirated Books to Train AI Models

More
Entity

publishers

Incidents Harmed By
  • Incident 997
    4 Reports

    Meta and OpenAI Accused of Using LibGen’s Pirated Books to Train AI Models

More
Entity

Journalists

Incidents Harmed By
  • Incident 997
    4 Reports

    Meta and OpenAI Accused of Using LibGen’s Pirated Books to Train AI Models

More
Entity

Authors

Incidents Harmed By
  • Incident 997
    4 Reports

    Meta and OpenAI Accused of Using LibGen’s Pirated Books to Train AI Models

More
Entity

OpenAI models

Incidents implicated systems
  • Incident 997
    4 Reports

    Meta and OpenAI Accused of Using LibGen’s Pirated Books to Train AI Models

More
Entity

Llama 3

Incidents implicated systems
  • Incident 997
    4 Reports

    Meta and OpenAI Accused of Using LibGen’s Pirated Books to Train AI Models

More
Entity

Library Genesis (LibGen)

Incidents implicated systems
  • Incident 997
    4 Reports

    Meta and OpenAI Accused of Using LibGen’s Pirated Books to Train AI Models

More
Entity

GPT-4

Incidents implicated systems
  • Incident 997
    4 Reports

    Meta and OpenAI Accused of Using LibGen’s Pirated Books to Train AI Models

More
Entity

BitTorrent

Incidents implicated systems
  • Incident 997
    4 Reports

    Meta and OpenAI Accused of Using LibGen’s Pirated Books to Train AI Models

More
Entity

Springer Nature

Incidents involved as Deployer
  • Incident 1308
    1 Report

    Springer Nature Book 'Mastering Machine Learning: From Basics to Advanced' Reportedly Published With Numerous Purportedly Nonexistent or Incorrect Citations

  • Incident 1309
    1 Report

    Springer Nature Book 'Social, Ethical and Legal Aspects of Generative AI: Tools, Techniques and Systems' Reportedly Published With Numerous Purportedly Fabricated or Unverifiable Citations

More
Entity

Govindakumar Madhavan

Incidents involved as Deployer
  • Incident 1308
    1 Report

    Springer Nature Book 'Mastering Machine Learning: From Basics to Advanced' Reportedly Published With Numerous Purportedly Nonexistent or Incorrect Citations

More
Entity

Unknown large language model developers

Incidents involved as Developer
  • Incident 1308
    1 Report

    Springer Nature Book 'Mastering Machine Learning: From Basics to Advanced' Reportedly Published With Numerous Purportedly Nonexistent or Incorrect Citations

  • Incident 1309
    1 Report

    Springer Nature Book 'Social, Ethical and Legal Aspects of Generative AI: Tools, Techniques and Systems' Reportedly Published With Numerous Purportedly Fabricated or Unverifiable Citations

More
Entity

Unknown generative AI developers

Incidents involved as Developer
  • Incident 1308
    1 Report

    Springer Nature Book 'Mastering Machine Learning: From Basics to Advanced' Reportedly Published With Numerous Purportedly Nonexistent or Incorrect Citations

  • Incident 1309
    1 Report

    Springer Nature Book 'Social, Ethical and Legal Aspects of Generative AI: Tools, Techniques and Systems' Reportedly Published With Numerous Purportedly Fabricated or Unverifiable Citations

More
Entity

students

Incidents Harmed By
  • Incident 1308
    1 Report

    Springer Nature Book 'Mastering Machine Learning: From Basics to Advanced' Reportedly Published With Numerous Purportedly Nonexistent or Incorrect Citations

  • Incident 1309
    1 Report

    Springer Nature Book 'Social, Ethical and Legal Aspects of Generative AI: Tools, Techniques and Systems' Reportedly Published With Numerous Purportedly Fabricated or Unverifiable Citations

More
Entity

Readers of academic and technical books

Incidents Harmed By
  • Incident 1308
    1 Report

    Springer Nature Book 'Mastering Machine Learning: From Basics to Advanced' Reportedly Published With Numerous Purportedly Nonexistent or Incorrect Citations

More
Entity

Epistemic integrity

Incidents Harmed By
  • Incident 1308
    1 Report

    Springer Nature Book 'Mastering Machine Learning: From Basics to Advanced' Reportedly Published With Numerous Purportedly Nonexistent or Incorrect Citations

  • Incident 1309
    1 Report

    Springer Nature Book 'Social, Ethical and Legal Aspects of Generative AI: Tools, Techniques and Systems' Reportedly Published With Numerous Purportedly Fabricated or Unverifiable Citations

More
Entity

Unknown large language models

Incidents implicated systems
  • Incident 1308
    1 Report

    Springer Nature Book 'Mastering Machine Learning: From Basics to Advanced' Reportedly Published With Numerous Purportedly Nonexistent or Incorrect Citations

  • Incident 1309
    1 Report

    Springer Nature Book 'Social, Ethical and Legal Aspects of Generative AI: Tools, Techniques and Systems' Reportedly Published With Numerous Purportedly Fabricated or Unverifiable Citations

More
Entity

Unknown generative AI systems

Incidents implicated systems
  • Incident 1308
    1 Report

    Springer Nature Book 'Mastering Machine Learning: From Basics to Advanced' Reportedly Published With Numerous Purportedly Nonexistent or Incorrect Citations

  • Incident 1309
    1 Report

    Springer Nature Book 'Social, Ethical and Legal Aspects of Generative AI: Tools, Techniques and Systems' Reportedly Published With Numerous Purportedly Fabricated or Unverifiable Citations

More
Entity

Srikanta Patnaik

Incidents involved as Deployer
  • Incident 1309
    1 Report

    Springer Nature Book 'Social, Ethical and Legal Aspects of Generative AI: Tools, Techniques and Systems' Reportedly Published With Numerous Purportedly Fabricated or Unverifiable Citations

More
Entity

Jair Minoro Abe

Incidents involved as Deployer
  • Incident 1309
    1 Report

    Springer Nature Book 'Social, Ethical and Legal Aspects of Generative AI: Tools, Techniques and Systems' Reportedly Published With Numerous Purportedly Fabricated or Unverifiable Citations

More
Entity

Kazumi Nakamatsu

Incidents involved as Deployer
  • Incident 1309
    1 Report

    Springer Nature Book 'Social, Ethical and Legal Aspects of Generative AI: Tools, Techniques and Systems' Reportedly Published With Numerous Purportedly Fabricated or Unverifiable Citations

More
Entity

Francesco Vigliarolo

Incidents involved as Deployer
  • Incident 1309
    1 Report

    Springer Nature Book 'Social, Ethical and Legal Aspects of Generative AI: Tools, Techniques and Systems' Reportedly Published With Numerous Purportedly Fabricated or Unverifiable Citations

More
Entity

Unnamed chapter authors

Incidents involved as Deployer
  • Incident 1309
    1 Report

    Springer Nature Book 'Social, Ethical and Legal Aspects of Generative AI: Tools, Techniques and Systems' Reportedly Published With Numerous Purportedly Fabricated or Unverifiable Citations

More
Entity

AI researchers

Incidents Harmed By
  • Incident 1309
    1 Report

    Springer Nature Book 'Social, Ethical and Legal Aspects of Generative AI: Tools, Techniques and Systems' Reportedly Published With Numerous Purportedly Fabricated or Unverifiable Citations

More
Entity

Readers of academic and technical publications

Incidents Harmed By
  • Incident 1309
    1 Report

    Springer Nature Book 'Social, Ethical and Legal Aspects of Generative AI: Tools, Techniques and Systems' Reportedly Published With Numerous Purportedly Fabricated or Unverifiable Citations

More

Research

  • Defining an “AI Incident”
  • Defining an “AI Incident Response”
  • Database Roadmap
  • Related Work
  • Download Complete Database

Project and Community

  • About
  • Contact and Follow
  • Apps and Summaries
  • Editor’s Guide

Incidents

  • All Incidents in List Form
  • Flagged Incidents
  • Submission Queue
  • Classifications View
  • Taxonomies

2024 - AI Incident Database

  • Terms of use
  • Privacy Policy
  • Open twitterOpen githubOpen rssOpen facebookOpen linkedin
  • f5f2449