Incident 465: Generative Models Trained on Dataset Containing Private Medical Photos

Description: Text-to-image models trained using the LAION-5B dataset such as Stable Diffusion and Imagen were able to regurgitate private medical record photos which were used as training data without consent or recourse for removal.

Tools

New Report New Response DiscoverView History

Entities

View all entities

Alleged: Stability AI , Google and LAION developed an AI system deployed by Stability AI and Google, which harmed people having medical photos online.

Incident Stats

Incident ID

465

Report Count

Incident Date

2022-03-03

Editors

Khoa Lam

Applied Taxonomies

MIT

MIT Taxonomy Classifications

Machine-Classified

Taxonomy Details

Risk Subdomain

2.1. Compromise of privacy by obtaining, leaking or correctly inferring sensitive information

Risk Domain

Privacy & Security

Entity

Human

Timing

Pre-deployment

Intent

Unintentional

Incident Reports

Reports Timeline

Artist finds private medical record photos in popular AI training data set

arstechnica.com

arstechnica.com · 2022

Late last week, a California-based AI artist who goes by the name Lapine discovered private medical record photos taken by her doctor in 2013 referenced in the LAION-5B image set, which is a scrape of publicly available images on the web. A…

Variants

A "variant" is an AI incident similar to a known case—it has the same causes, harms, and AI system. Instead of listing it separately, we group it under the first reported incident. Unlike other incidents, variants do not need to have been reported outside the AIID. Learn more from the research paper.

Seen something similar?

Similar Incidents

By textual similarity

Did our AI mess up? Flag the unrelated incidents

Similar Incidents

By textual similarity

Did our AI mess up? Flag the unrelated incidents

Incident 465: Generative Models Trained on Dataset Containing Private Medical Photos

Tools

Entities

Incident Stats

MIT Taxonomy Classifications

Incident Reports

Reports Timeline

Artist finds private medical record photos in popular AI training data set

Artist finds private medical record photos in popular AI training data set

Variants

Similar Incidents

By textual similarity

Sexist and Racist Google Adsense Advertisements

All Image Captions Produced are Violent

AI-Designed Phone Cases Are Unexpected

Similar Incidents

By textual similarity

Sexist and Racist Google Adsense Advertisements

All Image Captions Produced are Violent

AI-Designed Phone Cases Are Unexpected