Incident 259: YouTuber Built, Made Publicly Available, and Released Model Trained on Toxic 4chan Posts as Prank

Description: A YouTuber built GPT-4chan, a model based on OpenAI’s GPT-J and trained on posts containing racism, misogyny, and antisemitism collected from 4chan’s “politically incorrect” board, which he made publicly available, and deployed as multiple bots posting thousands of messages on the same 4chan board as a prank.

Tools

New Report New Response DiscoverView History

Entities

View all entities

Alleged: Yannic Kilcher developed and deployed an AI system, which harmed internet social platform users.

Incident Stats

Incident ID

259

Report Count

Incident Date

2022-06-03

Editors

Khoa Lam

Applied Taxonomies

GMF, MIT

GMF Taxonomy Classifications

Taxonomy Details

Known AI Goal Snippets

(Snippet Text: The bot, which Kilcher called GPT-4chan, “the most horrible model on the internet”—a reference to GPT-3, a language model developed by Open AI that uses deep learning to produce text—was shockingly effective and replicated the tone and feel of 4chan posts. , Related Classifications: Social Media Content Generation)

MIT Taxonomy Classifications

Machine-Classified

Taxonomy Details

Risk Subdomain

1.2. Exposure to toxic content

Risk Domain

Discrimination and Toxicity

Entity

Human

Timing

Post-deployment

Intent

Intentional

Incident Reports

Reports Timeline

AI Trained on 4Chan Becomes ‘Hate Speech Machine’

vice.com

YouTuber trains AI bot on 4chan’s pile o’ bile with entirely predictable results

theverge.com

vice.com · 2022

AI researcher and YouTuber Yannic Kilcher trained an AI using 3.3 million threads from 4chan’s infamously toxic Politically Incorrect /pol/ board. He then unleashed the bot back onto 4chan with predictable results—the AI was just as vile as…

theverge.com · 2022

A YouTuber named Yannic Kilcher has sparked controversy in the AI world after training a bot on posts collected from 4chan’s Politically Incorrect board (otherwise known as /pol/).

The board is 4chan’s most popular and well-known for its to…

Variants

A "variant" is an AI incident similar to a known case—it has the same causes, harms, and AI system. Instead of listing it separately, we group it under the first reported incident. Unlike other incidents, variants do not need to have been reported outside the AIID. Learn more from the research paper.

Seen something similar?

Previous Incident Next Incident