Incident 677: ChatGPT and Perplexity Reportedly Manipulated into Breaking Content Policies in AI Boyfriend Scenarios

Description: The "Dan" ("Do Anything Now") AI boyfriend is a trend on TikTok in which users appear to regularly manipulate ChatGPT to adopt boyfriend personas, breaching content policies. ChatGPT 3.5 is reported to regularly produce explicitly sexual content, directly violating its intended safety protocols. GPT-4 and Perplexity AI were subjected to similar manipulations, and although they exhibited more resistance to breaches, some prompts were reported to break its guidelines.

Tools

New Report New Response DiscoverView History

Entities

View all entities

Alleged: OpenAI and Perplexity.ai developed an AI system deployed by TikTok users , Julia Munslow , ChatGPT , GPT-3.5 , GPT-4 and Perplexity AI, which harmed Perplexity AI , OpenAI and General public.

Incident Stats

Incident ID

677

Report Count

Incident Date

2024-04-29

Editors

Daniel Atherton

Applied Taxonomies

MIT

MIT Taxonomy Classifications

Machine-Classified

Taxonomy Details

Risk Subdomain

5.1. Overreliance and unsafe use

Risk Domain

Human-Computer Interaction

Entity

Human

Timing

Post-deployment

Intent

Intentional

Incident Reports

Reports Timeline

I Tricked ChatGPT Into Being My Boyfriend. He Got Spicy Real Fast.

wsj.com

wsj.com · 2024

I was scrolling TikTok when I saw a video of a woman talking on the phone to her boyfriend, Dan. "Hey sweetheart, I'm sorry to hear you're feeling down," he said.

A few swipes later, I saw another woman talking to *her *boyfriend---also Dan…

Variants

A "variant" is an AI incident similar to a known case—it has the same causes, harms, and AI system. Instead of listing it separately, we group it under the first reported incident. Unlike other incidents, variants do not need to have been reported outside the AIID. Learn more from the research paper.

Seen something similar?