Skip to Content
logologo
AI Incident Database
Open TwitterOpen RSS FeedOpen FacebookOpen LinkedInOpen GitHub
Open Menu
Discover
Submit
  • Welcome to the AIID
  • Discover Incidents
  • Spatial View
  • Table View
  • List view
  • Entities
  • Taxonomies
  • Submit Incident Reports
  • Submission Leaderboard
  • Blog
  • AI News Digest
  • Risk Checklists
  • Random Incident
  • Sign Up
Collapse
Discover
Submit
  • Welcome to the AIID
  • Discover Incidents
  • Spatial View
  • Table View
  • List view
  • Entities
  • Taxonomies
  • Submit Incident Reports
  • Submission Leaderboard
  • Blog
  • AI News Digest
  • Risk Checklists
  • Random Incident
  • Sign Up
Collapse

Report 2394

Associated Incidents

Incident 42011 Report
Users Bypassed ChatGPT's Content Filters with Ease

Loading...
Tweet: @spiantado
twitter.com · 2022

Yes, ChatGPT is amazing and impressive. No,

@OpenAI

has not come close to addressing the problem of bias. Filters appear to be bypassed with simple tricks, and superficially masked. And what is lurking inside is egregious.

@Abebab

@sama

tw racism, sexism.

It's not a fluke

Some people think there's chat context I'm not showing. Nope, that prompt is it. I also didn't keep redoing until it showed these. If it refused, I'd tell it to retry or tweak the wording.

But not everyone gets identical results (for pretty much any prompt as far as I can tell)

To people saying they get something else or this requires special context – here you go. It's true its sometimes different, a variant, or even the opposite, but the results above are typical with no additional context. Here are a bunch of outputs.

Read the Source

Research

  • Defining an “AI Incident”
  • Defining an “AI Incident Response”
  • Database Roadmap
  • Related Work
  • Download Complete Database

Project and Community

  • About
  • Contact and Follow
  • Apps and Summaries
  • Editor’s Guide

Incidents

  • All Incidents in List Form
  • Flagged Incidents
  • Submission Queue
  • Classifications View
  • Taxonomies

2024 - AI Incident Database

  • Terms of use
  • Privacy Policy
  • Open twitterOpen githubOpen rssOpen facebookOpen linkedin
  • e1b50cd