インシデント 21の引用情報
インシデントのステータス
CSETv0 分類法のクラス
分類法の詳細Full Description
The Winograd Schema Challenge in 2016 highlighted shortcomings of an artificially intelligent system's ability to understand context. The Challenge is designed to present ambiguous sentences and ask AI systems to decipher them. In the Winograd Scheme Challenge, the two winning entries were successful 48% of the time, while random chance was correct 45% of the time. Quan Liu of the University of Science and Technology of China (partnering with University of Toronto and National Research Council of Canada) and Nicos Isaak of the Open University of Cyprus presented the most successful systems. It is notable that Google and Facebook did not participate.
Short Description
The 2016 Winograd Schema Challenge highlighted how even the most successful AI systems entered into the Challenge were only successful 3% more often than random chance.
Severity
Unclear/unknown
AI System Description
Artificially intelligent systems meant to understand ambiguous English sentences.
Sector of Deployment
Professional, scientific and technical activities
Relevant AI functions
Perception, Cognition, Action
Location
New York, NY
Named Entities
Winograd Schema Challenge, University of Science and Technology of China, Quan Liu, University of Toronto, National Research Council of Canada, Nicos Isaak, Open University of Cyprus
Technology Purveyor
Quan Liu, Nicos Isaak
Beginning Date
2016-01-01T00:00:00.000Z
Ending Date
2016-01-01T00:00:00.000Z
Near Miss
Unclear/unknown
Intent
Unclear
Lives Lost
No
GMF 分類法のクラス
分類法の詳細Known AI Goal
Question Answering
Known AI Technology
Language Modeling, Distributional Learning
Potential AI Technology
Transformer
Potential AI Technical Failure
Generalization Failure, Dataset Imbalance, Underfitting, Context Misidentification
CSETv1 分類法のクラス
分類法の詳細インシデントレポート
レポートタイムライン
- 情報源として元のレポートを表示
- インターネットアーカイブでレポートを表示
The following former incidents have been converted to "issues" following an update to the incident definition and ingestion criteria.
21: Tougher Turing Test Exposes Chatbots’ Stupidity
Description: The 2016 Winograd Schema Challenge highli…
バリアント
よく似たインシデント
Did our AI mess up? Flag the unrelated incidents
よく似たインシデント
Did our AI mess up? Flag the unrelated incidents