Incident 65: Reinforcement Learning Reward Functions in Video Games

Description: OpenAI published a post about its findings when using Universe, a software for measuring and training AI agents to conduct reinforcement learning experiments, showing that the AI agent did not act in the way intended to complete a videogame.

Tools

New Report New Response DiscoverView History

Entities

View all entities

Alleged: OpenAI developed and deployed an AI system, which harmed OpenAI.

Incident Stats

Incident ID

Report Count

Incident Date

2016-12-22

Editors

Sean McGregor

Applied Taxonomies

CSETv0, CSETv1, GMF, MIT

CSETv0 Taxonomy Classifications

Taxonomy Details

Problem Nature

Specification

Physical System

Software only

Level of Autonomy

Unclear/unknown

Nature of End User

Expert

Public Sector Deployment

Data Inputs

Universe software training

CSETv1 Taxonomy Classifications

Taxonomy Details

Incident Number

Special Interest Intangible Harm

Date of Incident Year

2016

Date of Incident Month

Date of Incident Day

Estimated Date

Yes

MIT Taxonomy Classifications

Machine-Classified

Taxonomy Details

Risk Subdomain

7.1. AI pursuing its own goals in conflict with human goals or values

Risk Domain

AI system safety, failures, and limitations

Entity

Timing

Post-deployment

Intent

Unintentional

Incident Reports

Reports Timeline

Faulty Reward Functions in the Wild

blog.openai.com

blog.openai.com · 2016

At OpenAI, we've recently started using Universe, our software for measuring and training AI agents, to conduct new RL experiments. Sometimes these experiments illustrate some of the issues with RL as currently practiced. In the following e…

Variants

A "variant" is an AI incident similar to a known case—it has the same causes, harms, and AI system. Instead of listing it separately, we group it under the first reported incident. Unlike other incidents, variants do not need to have been reported outside the AIID. Learn more from the research paper.

Seen something similar?

Similar Incidents

By textual similarity

Did our AI mess up? Flag the unrelated incidents

Similar Incidents

By textual similarity

Did our AI mess up? Flag the unrelated incidents

Incident 65: Reinforcement Learning Reward Functions in Video Games

Tools

Entities

Incident Stats

CSETv0 Taxonomy Classifications

CSETv1 Taxonomy Classifications

MIT Taxonomy Classifications

Incident Reports

Reports Timeline

Faulty Reward Functions in the Wild

Faulty Reward Functions in the Wild

Variants

Similar Incidents

By textual similarity

Biased Sentiment Analysis

Gender Biases in Google Translate

Tesla Autopilot’s Lane Recognition Allegedly Vulnerable to Adversarial Attacks

Similar Incidents

By textual similarity

Biased Sentiment Analysis

Gender Biases in Google Translate

Tesla Autopilot’s Lane Recognition Allegedly Vulnerable to Adversarial Attacks