Incident 85: AI attempts to ease fear of robots, blurts out it can’t ‘avoid destroying humankind’

Description: On September 8, 2020, the Guardian published an op-ed generated by OpenAI’s GPT-3 text generating AI that included threats to destroy humankind. This incident has been downgraded to an issue as it does not meet current ingestion criteria.

Tools

New Report New Response DiscoverView History

Entities

View all entities

Alleged: OpenAI developed and deployed an AI system, which harmed unknown.

Incident Stats

Incident ID

Report Count

Incident Date

2020-10-09

Editors

Sean McGregor

Applied Taxonomies

CSETv0, CSETv1, MIT

CSETv0 Taxonomy Classifications

Taxonomy Details

Problem Nature

Specification

Physical System

Software only

Level of Autonomy

Medium

Nature of End User

Amateur

Public Sector Deployment

Data Inputs

Unlabeled text drawn from web scraping

CSETv1 Taxonomy Classifications

Taxonomy Details

Incident Number

Estimated Date

Lives Lost

Injuries

Estimated Harm Quantities

There is a potentially identifiable specific entity that experienced the harm

MIT Taxonomy Classifications

Machine-Classified

Taxonomy Details

Risk Subdomain

7.3. Lack of capability or robustness

Risk Domain

AI system safety, failures, and limitations

Entity

Timing

Pre-deployment

Intent

Unintentional

Incident Reports

Reports Timeline

AI Incident Database Incidents Converted to Issues

github.com

github.com · 2022

The following former incidents have been converted to "issues" following an update to the incident definition and ingestion criteria.

21: Tougher Turing Test Exposes Chatbots’ Stupidity

Description: The 2016 Winograd Schema Challenge highli…

Variants

A "variant" is an AI incident similar to a known case—it has the same causes, harms, and AI system. Instead of listing it separately, we group it under the first reported incident. Unlike other incidents, variants do not need to have been reported outside the AIID. Learn more from the research paper.

Seen something similar?

Similar Incidents

By textual similarity

Did our AI mess up? Flag the unrelated incidents

Philosophy AI Allegedly Used To Generate Mixture of Innocent and Harmful Reddit Posts

Similar Incidents

By textual similarity

Did our AI mess up? Flag the unrelated incidents

Incident 85: AI attempts to ease fear of robots, blurts out it can’t ‘avoid destroying humankind’

Tools

Entities

Incident Stats

CSETv0 Taxonomy Classifications

CSETv1 Taxonomy Classifications

MIT Taxonomy Classifications

Incident Reports

Reports Timeline

AI Incident Database Incidents Converted to Issues

AI Incident Database Incidents Converted to Issues

21: Tougher Turing Test Exposes Chatbots’ Stupidity

Variants

Similar Incidents

By textual similarity

Philosophy AI Allegedly Used To Generate Mixture of Innocent and Harmful Reddit Posts

Russian Chatbot Supports Stalin and Violence

TayBot

Similar Incidents

By textual similarity

Philosophy AI Allegedly Used To Generate Mixture of Innocent and Harmful Reddit Posts

Russian Chatbot Supports Stalin and Violence

TayBot