Incident 283: Facebook’s Automated Content Moderation Tool Flagged a Post Containing Parts of the Declaration of Independence as Hate Speech by Mistake

Description: Facebook’s content moderation algorithm was acknowledged by the company to have flagged excerpts of the Declaration of Independence posted by a small newspaper in Texas as hate speech by mistake.

Tools

New Report New Response DiscoverView History

Entities

View all entities

Alleged: Facebook developed and deployed an AI system, which harmed The Vindicator.

Incident Stats

Incident ID

283

Report Count

Incident Date

2018-07-02

Editors

Khoa Lam

Applied Taxonomies

GMF, MIT

MIT Taxonomy Classifications

Machine-Classified

Taxonomy Details

Risk Subdomain

7.3. Lack of capability or robustness

Risk Domain

AI system safety, failures, and limitations

Entity

Timing

Post-deployment

Intent

Unintentional

Incident Reports

Reports Timeline

Facebook has apologized for flagging parts of the Declaration of Independence as hate speech

businessinsider.com

businessinsider.com · 2018

Facebook has apologized for taking down a post containing excerpts of the Declaration of Independence, saying that it was mistakenly flagged as hate speech.
In addition to apologizing, Facebook has restored the post.
This is just the lates…

Variants

A "variant" is an AI incident similar to a known case—it has the same causes, harms, and AI system. Instead of listing it separately, we group it under the first reported incident. Unlike other incidents, variants do not need to have been reported outside the AIID. Learn more from the research paper.

Seen something similar?

Previous Incident Next Incident