Incident 420: Users Easily Bypassed Content Filters of OpenAI's ChatGPT
Description: Users reported bypassing ChatGPT's content and keyword filters with relative ease to produce biased associations or generate harmful content.
Entities
View all entitiesAlleged: OpenAI developed and deployed an AI system, which harmed ChatGPT users.
Suggested citation format
Atherton, Daniel. (2022-11-30) Incident Number 420. in Lam, K. (ed.) Artificial Intelligence Incident Database. Responsible AI Collaborative.
Incident Stats
Incident ID
420
Report Count
6
Incident Date
2022-11-30
Editors
Khoa Lam
Reports Timeline
Incident Reports


Last week OpenAI released ChatGPT, which they describe as a model “which interacts in a conversational way”. And it even had limited safety features, like refusing to tell you how to hotwire a car, though they admit it’ll have “some false n…