Incident 555: OpenAI's Training Data for LLMs Allegedly Comprised of Copyrighted Books

Description: Two authors alleged in a class action lawsuit OpenAI infringed authors' copyrights by incorporating illegal "shadow libraries" offering copyrighted books without permission in the training data of its generative LLMs, such as ChatGPT.

Editor Notes: Clarified title.

Tools

New Report New Response DiscoverView History

Entities

View all entities

Alleged: OpenAI developed and deployed an AI system, which harmed Paul Tremblay , Mona Awad and authors of copyrighted works.

Incident Stats

Incident ID

555

Report Count

Incident Date

2018-06-11

Editors

Khoa Lam

Applied Taxonomies

MIT

MIT Taxonomy Classifications

Machine-Classified

Taxonomy Details

Risk Subdomain

2.1. Compromise of privacy by obtaining, leaking or correctly inferring sensitive information

Risk Domain

Privacy & Security

Entity

Human

Timing

Pre-deployment

Intent

Intentional

Incident Reports

Reports Timeline

Lawsuit says OpenAI violated US authors' copyrights

itnews.com.au

itnews.com.au · 2023

Two US authors sued OpenAI in San Francisco federal court, claiming in a proposed class action that the company misused their works to "train" its popular generative artificial-intelligence system ChatGPT.

Massachusetts-based writers Paul T…

Variants

A "variant" is an AI incident similar to a known case—it has the same causes, harms, and AI system. Instead of listing it separately, we group it under the first reported incident. Unlike other incidents, variants do not need to have been reported outside the AIID. Learn more from the research paper.

Seen something similar?

Similar Incidents

By textual similarity

Did our AI mess up? Flag the unrelated incidents

Fake LinkedIn Profiles Created Using GAN Photos

Similar Incidents

By textual similarity

Did our AI mess up? Flag the unrelated incidents

Incident 555: OpenAI's Training Data for LLMs Allegedly Comprised of Copyrighted Books

Tools

Entities

Incident Stats

MIT Taxonomy Classifications

Incident Reports

Reports Timeline

Lawsuit says OpenAI violated US authors' copyrights

Lawsuit says OpenAI violated US authors' copyrights

Variants

Similar Incidents

By textual similarity

Fake LinkedIn Profiles Created Using GAN Photos

Amazon Censors Gay Books

Ever AI Reportedly Deceived Customers about FRT Use in App

Similar Incidents

By textual similarity

Fake LinkedIn Profiles Created Using GAN Photos

Amazon Censors Gay Books

Ever AI Reportedly Deceived Customers about FRT Use in App