Incident 393: Facebook AI-Supported Moderation for Ads Failed to Detect Violating Content

Description: Facebook's ad moderation system involving algorithms failed to flag hateful language and violating content such as calls for killings for ads in English and Swahili.

Tools

New Report New Response DiscoverView History

Entities

View all entities

Alleged: Facebook developed and deployed an AI system, which harmed Facebook users speaking Swahili , Facebook users speaking English and Facebook users.

Incident Stats

Incident ID

393

Report Count

Incident Date

2021-12-08

Editors

Khoa Lam

Applied Taxonomies

CSETv1, MIT

CSETv1 Taxonomy Classifications

Taxonomy Details

Incident Number

393

MIT Taxonomy Classifications

Machine-Classified

Taxonomy Details

Risk Subdomain

1.2. Exposure to toxic content

Risk Domain

Discrimination and Toxicity

Entity

Timing

Post-deployment

Intent

Unintentional

Incident Reports

Reports Timeline

In a 3rd test, Facebook still fails to block hate speech

apnews.com

apnews.com · 2022

Facebook is letting violent hate speech slip through its controls in Kenya as it has in other countries, according to a new report from the nonprofit groups Global Witness and Foxglove.

It is the third such test of Facebook’s ability to det…

Variants

A "variant" is an AI incident similar to a known case—it has the same causes, harms, and AI system. Instead of listing it separately, we group it under the first reported incident. Unlike other incidents, variants do not need to have been reported outside the AIID. Learn more from the research paper.

Seen something similar?

Similar Incidents

By textual similarity

Did our AI mess up? Flag the unrelated incidents

Similar Incidents

By textual similarity

Did our AI mess up? Flag the unrelated incidents

Incident 393: Facebook AI-Supported Moderation for Ads Failed to Detect Violating Content

Tools

Entities

Incident Stats

CSETv1 Taxonomy Classifications

MIT Taxonomy Classifications

Incident Reports

Reports Timeline

In a 3rd test, Facebook still fails to block hate speech

In a 3rd test, Facebook still fails to block hate speech

Variants

Similar Incidents

By textual similarity

Facebook’s Political Ad Detection Reportedly Showed High and Geographically Uneven Error Rates

Facebook Allegedly Failed to Police Anti-Rohingya Hate Speech Content That Contributed to Violence in Myanmar

Wikipedia Vandalism Prevention Bot Loop

Similar Incidents

By textual similarity

Facebook’s Political Ad Detection Reportedly Showed High and Geographically Uneven Error Rates

Facebook Allegedly Failed to Police Anti-Rohingya Hate Speech Content That Contributed to Violence in Myanmar

Wikipedia Vandalism Prevention Bot Loop