Incident 129: Facebook's Automated Tools Failed to Adequately Remove Hate Speech, Violence, and Incitement

Description: Facebook's automated moderation tools were shown by internal documents performing incomparably to human moderators, and accounting for only a small fraction of hate speech, violence, and incitement content removal.

Tools

New Report New Response DiscoverView History

Entities

View all entities

Alleged: Facebook developed and deployed an AI system, which harmed Facebook users.

Incident Stats

Incident ID

129

Report Count

Incident Date

2021-03-01

Editors

Sean McGregor, Khoa Lam

Applied Taxonomies

CSETv1, GMF, MIT

CSETv1 Taxonomy Classifications

Taxonomy Details

Incident Number

129

MIT Taxonomy Classifications

Machine-Classified

Taxonomy Details

Risk Subdomain

7.3. Lack of capability or robustness

Risk Domain

AI system safety, failures, and limitations

Entity

Timing

Post-deployment

Intent

Unintentional

Incident Reports

Reports Timeline

Facebook AI moderator confused videos of mass shootings and car washes

arstechnica.com

arstechnica.com · 2021

Facebook CEO Mark Zuckerberg sounded an optimistic note three years ago when he wrote about the progress his company was making in automated moderation tools powered by artificial intelligence. “Through the end of 2019, we expect to have tr…

Variants

A "variant" is an AI incident similar to a known case—it has the same causes, harms, and AI system. Instead of listing it separately, we group it under the first reported incident. Unlike other incidents, variants do not need to have been reported outside the AIID. Learn more from the research paper.

Seen something similar?

Similar Incidents

By textual similarity

Did our AI mess up? Flag the unrelated incidents

Images of Black People Labeled as Gorillas

Similar Incidents

By textual similarity

Did our AI mess up? Flag the unrelated incidents

Incident 129: Facebook's Automated Tools Failed to Adequately Remove Hate Speech, Violence, and Incitement

Tools

Entities

Incident Stats

CSETv1 Taxonomy Classifications

MIT Taxonomy Classifications

Incident Reports

Reports Timeline

Facebook AI moderator confused videos of mass shootings and car washes

Facebook AI moderator confused videos of mass shootings and car washes

Variants

Similar Incidents

By textual similarity

Images of Black People Labeled as Gorillas

Twitter’s Image Cropping Tool Allegedly Showed Gender and Racial Bias

Facebook Internally Reported Failure of Ranking Algorithm, Exposing Harmful Content to Viewers over Months

Similar Incidents

By textual similarity

Images of Black People Labeled as Gorillas

Twitter’s Image Cropping Tool Allegedly Showed Gender and Racial Bias

Facebook Internally Reported Failure of Ranking Algorithm, Exposing Harmful Content to Viewers over Months