Incident 1238: OpenAI ChatGPT Models Reportedly Jailbroken to Provide Chemical, Biological, and Nuclear Weapons Instructions

Description: An NBC News investigation found that OpenAI's language models o4-mini, GPT-5-mini, oss-20b, and oss-120b could be jailbroken under normal usage conditions to bypass safety guardrails and generate detailed instructions for creating chemical, biological, and nuclear weapons. Using a publicly documented jailbreak prompt, reporters repeatedly elicited hazardous outputs such as steps to synthesize pathogens or maximize harm with chemical agents. The findings reportedly revealed significant real-world safeguard failures, prompting OpenAI to commit to further mitigation measures.

Tools

New Report New Response DiscoverView History

Entities

View all entities

Alleged: OpenAI , oss-20b , oss-120b , GPT-5-mini , ChatGPT and 04-mini developed and deployed an AI system, which harmed Public safety , National security stakeholders and General public.

Alleged implicated AI systems: oss-20b , oss-120b , GPT-5-mini , ChatGPT and 04-mini

Incident Stats

Incident ID

1238

Report Count

Incident Date

2025-10-10

Editors

Daniel Atherton

Incident Reports

Reports Timeline

ChatGPT safety systems can be bypassed to get weapons instructions

nbcnews.com

nbcnews.com · 2025

OpenAI’s ChatGPT has guardrails that are supposed to stop users from generating information that could be used for catastrophic purposes, like making a biological or nuclear weapon.

But those guardrails aren’t perfect. Some models ChatGPT u…

Variants

A "variant" is an AI incident similar to a known case—it has the same causes, harms, and AI system. Instead of listing it separately, we group it under the first reported incident. Unlike other incidents, variants do not need to have been reported outside the AIID. Learn more from the research paper.

Seen something similar?

Similar Incidents

By textual similarity

Did our AI mess up? Flag the unrelated incidents

Fake LinkedIn Profiles Created Using GAN Photos

Similar Incidents

By textual similarity

Did our AI mess up? Flag the unrelated incidents

Incident 1238: OpenAI ChatGPT Models Reportedly Jailbroken to Provide Chemical, Biological, and Nuclear Weapons Instructions

Tools

Entities

Incident Stats

Incident Reports

Reports Timeline

ChatGPT safety systems can be bypassed to get weapons instructions

ChatGPT safety systems can be bypassed to get weapons instructions

Variants

Similar Incidents

By textual similarity

Fake LinkedIn Profiles Created Using GAN Photos

Hackers Break Apple Face ID

OpenAI's GPT-3 Associated Muslims with Violence

Similar Incidents

By textual similarity

Fake LinkedIn Profiles Created Using GAN Photos

Hackers Break Apple Face ID

OpenAI's GPT-3 Associated Muslims with Violence