Incident 143: Facebook’s and Twitter's Automated Content Moderation Reportedly Failed to Effectively Enforce Violation Rules for Small Language Groups

Description: Facebook's and Twitter were not able to sufficiently moderate content of small language groups such as the Balkan languages using AI, allegedly due to the lack of investment in human moderation and difficulty in AI-solution design for the languages.

Tools

New Report New Response DiscoverView History

Entities

View all entities

Alleged: Facebook and Twitter developed and deployed an AI system, which harmed Facebook users of small language groups and Twitter users of small language groups.

Incident Stats

Incident ID

143

Report Count

Incident Date

2021-02-16

Editors

Sean McGregor, Khoa Lam

Applied Taxonomies

CSETv1, GMF, MIT

CSETv1 Taxonomy Classifications

Taxonomy Details

Incident Number

143

MIT Taxonomy Classifications

Machine-Classified

Taxonomy Details

Risk Subdomain

1.2. Exposure to toxic content

Risk Domain

Discrimination and Toxicity

Entity

Timing

Post-deployment

Intent

Unintentional

Incident Reports

Reports Timeline

Facebook, Twitter Struggling in Fight against Balkan Content Violations

balkaninsight.com

balkaninsight.com · 2021

According to the responses to BIRN’s questionnaire, some 57 per cent of those who reported hate speech said they were notified that the reported post/account violated the rules.

On the other hand, some 28 per cent said they had received not…

Variants

A "variant" is an AI incident similar to a known case—it has the same causes, harms, and AI system. Instead of listing it separately, we group it under the first reported incident. Unlike other incidents, variants do not need to have been reported outside the AIID. Learn more from the research paper.

Seen something similar?

Similar Incidents

By textual similarity

Did our AI mess up? Flag the unrelated incidents

Airbnb's Trustworthiness Algorithm Allegedly Banned Users without Explanation, and Discriminated against Sex Workers

Similar Incidents

By textual similarity

Did our AI mess up? Flag the unrelated incidents

Incident 143: Facebook’s and Twitter's Automated Content Moderation Reportedly Failed to Effectively Enforce Violation Rules for Small Language Groups

Tools

Entities

Incident Stats

CSETv1 Taxonomy Classifications

MIT Taxonomy Classifications

Incident Reports

Reports Timeline

Facebook, Twitter Struggling in Fight against Balkan Content Violations

Facebook, Twitter Struggling in Fight against Balkan Content Violations

Variants

Similar Incidents

By textual similarity

Airbnb's Trustworthiness Algorithm Allegedly Banned Users without Explanation, and Discriminated against Sex Workers

Facebook’s Political Ad Detection Reportedly Showed High and Geographically Uneven Error Rates

Indian Political App Tek Fog Allegedly Hijacked Trends and Manipulated Public Opinion on Other Social Media Platforms

Similar Incidents

By textual similarity

Airbnb's Trustworthiness Algorithm Allegedly Banned Users without Explanation, and Discriminated against Sex Workers

Facebook’s Political Ad Detection Reportedly Showed High and Geographically Uneven Error Rates

Indian Political App Tek Fog Allegedly Hijacked Trends and Manipulated Public Opinion on Other Social Media Platforms