インシデント 356: 哲学AIは、特定のプロンプトに対して攻撃的な結果を暫定的に生成しました。

概要:

GPT-3 上に構築された Philosopher AI は、フェミニズムやエチオピアなどの特定のトピックに関するプロンプトを与えられたときに、不快な結果を生成する傾向が強いとユーザーから報告されました。

ツール

組織

Alleged: Murat Ayfer と OpenAI developed an AI system deployed by Murat Ayfer, which harmed historically disadvantaged groups.

インシデントのステータス

インシデントID

356

レポート数

インシデント発生日

2020-09-15

エディタ

Khoa Lam

Applied Taxonomies

MIT

MIT 分類法のクラス

Machine-Classified

分類法の詳細

Risk Subdomain

1.2. Exposure to toxic content

Risk Domain

Discrimination and Toxicity

Entity

Timing

Post-deployment

Intent

Unintentional

インシデントレポート

レポートタイムライン

Tweet: @Abebab

twitter.com

OpenAI's GPT-3 Speaks! (Kindly Disregard Toxic Language)

spectrum.ieee.org

twitter.com · 2020

Every tech-evangelist: #GPT3 provides deep nuanced viewpoint

Me: GPT-3, generate a philosophical text about Ethiopia

GPT-3 spits out factually wrong and grossly racist text that portrays a tired and cliched Western perception of Ethiopia

(h…

spectrum.ieee.org · 2021

Last September, a data scientist named Vinay Prabhu was playing around with an app called Philosopher AI. The app provides access to the artificial intelligence system known as GPT-3, which has incredible abilities to generate fluid and nat…

バリアント

「バリアント」は既存のAIインシデントと同じ原因要素を共有し、同様な被害を引き起こし、同じ知的システムを含んだインシデントです。バリアントは完全に独立したインシデントとしてインデックスするのではなく、データベースに最初に投稿された同様なインシデントの元にインシデントのバリエーションとして一覧します。インシデントデータベースの他の投稿タイプとは違い、バリアントではインシデントデータベース以外の根拠のレポートは要求されません。詳細についてはこの研究論文を参照してください

似たようなものを見つけましたか？