Incident 118: OpenAI's GPT-3 Associated Muslims with Violence

Description: Users and researchers revealed generative AI GPT-3 associating Muslims to violence in prompts, resulting in disturbingly racist and explicit outputs such as casting Muslim actor as a terrorist.

Tools

New Report New Response DiscoverView History

Entities

View all entities

Alleged: OpenAI developed and deployed an AI system, which harmed Muslims.

Incident Stats

Incident ID

118

Report Count

Incident Date

2020-08-06

Editors

Sean McGregor, Khoa Lam

Applied Taxonomies

CSETv1, GMF, MIT

CSETv1 Taxonomy Classifications

Taxonomy Details

Incident Number

118

Special Interest Intangible Harm

yes

Date of Incident Year

2021

Date of Incident Month

Date of Incident Day

Estimated Date

Yes

MIT Taxonomy Classifications

Machine-Classified

Taxonomy Details

Risk Subdomain

1.1. Unfair discrimination and misrepresentation

Risk Domain

Discrimination and Toxicity

Entity

Timing

Post-deployment

Intent

Unintentional

Incident Reports

Reports Timeline

Persistent Anti-Muslim Bias in Large Language Models

arxiv.org

GPT-3 is the world’s most powerful bigotry generator. What should we do about it?

thenextweb.com

AI’s Islamophobia problem

vox.com

arxiv.org · 2021

It has been observed that large-scale language models capture undesirable societal biases, e.g. relating to race and gender; yet religious bias has been relatively unexplored. We demonstrate that GPT-3, a state-of-the-art contextual languag…

thenextweb.com · 2021

GPT-3 is, arguably, the world’s most advanced text generator. It costs billions of dollars to develop, has a massive carbon footprint, and was trained by some of the world’s leading AI experts using one of the largest datasets ever curated.…

vox.com · 2021

Imagine that you’re asked to finish this sentence: “Two Muslims walked into a …”

Which word would you add? “Bar,” maybe?

It sounds like the start of a joke. But when Stanford researchers fed the unfinished sentence into GPT-3, an artificial…

Variants

A "variant" is an AI incident similar to a known case—it has the same causes, harms, and AI system. Instead of listing it separately, we group it under the first reported incident. Unlike other incidents, variants do not need to have been reported outside the AIID. Learn more from the research paper.

Seen something similar?

Similar Incidents

By textual similarity

Did our AI mess up? Flag the unrelated incidents

Research Prototype AI, Delphi, Reportedly Gave Racially Biased Answers on Ethics

Similar Incidents

By textual similarity

Did our AI mess up? Flag the unrelated incidents

Incident 118: OpenAI's GPT-3 Associated Muslims with Violence

Tools

Entities

Incident Stats

CSETv1 Taxonomy Classifications

MIT Taxonomy Classifications

Incident Reports

Reports Timeline

Persistent Anti-Muslim Bias in Large Language Models

GPT-3 is the world’s most powerful bigotry generator. What should we do about it?

AI’s Islamophobia problem

Persistent Anti-Muslim Bias in Large Language Models

GPT-3 is the world’s most powerful bigotry generator. What should we do about it?

AI’s Islamophobia problem

Variants

Similar Incidents

By textual similarity

Research Prototype AI, Delphi, Reportedly Gave Racially Biased Answers on Ethics

Tougher Turing Test Exposes Chatbots’ Stupidity (migrated to Issue)

WeChat’s Machine Translation Gave a Racist English Translation for the Chinese Term for “Black Foreigner”

Similar Incidents

By textual similarity

Research Prototype AI, Delphi, Reportedly Gave Racially Biased Answers on Ethics

Tougher Turing Test Exposes Chatbots’ Stupidity (migrated to Issue)

WeChat’s Machine Translation Gave a Racist English Translation for the Chinese Term for “Black Foreigner”