Description: CodeWall reported that its autonomous agent exploited vulnerabilities in McKinsey's Lilli AI platform and obtained unauthorized read and write access to production systems, allegedly exposing internal chat messages, files, user accounts, and prompts. McKinsey confirmed the vulnerability and said it fixed the issue within hours, but said it found no evidence that client data or client confidential information were accessed.
Editor Notes: Treated as an incident rather than an issue because the report alleges a realized unauthorized access event against McKinsey's live Lilli production system, with actual internal data and prompt-layer assets reportedly exposed, rather than just being a theoretical or unexploited vulnerability.
Entities
View all entitiesAlleged: McKinsey & Company , CodeWall , Retrieval-augmented generation (RAG) system , Lilli , CodeWall autonomous offensive agent , AI-powered enterprise search system and AI document analysis system developed and deployed an AI system, which harmed McKinsey & Company , McKinsey & Company employees , McKinsey & Company consultants and Lilli users.
Alleged implicated AI systems: Retrieval-augmented generation (RAG) system , Lilli , CodeWall autonomous offensive agent , AI-powered enterprise search system and AI document analysis system
Incident Stats
Incident ID
1412
Report Count
1
Incident Date
2026-02-28
Editors
Daniel Atherton
Incident Reports
Reports Timeline
Loading...
McKinsey & Company --- the world's most prestigious consulting firm --- built an internal AI platform called Lilli for its 43,000+ employees. Lilli is a purpose-built system: chat, document analysis, RAG over decades of proprietary research…
Variants
A "variant" is an AI incident similar to a known case—it has the same causes, harms, and AI system. Instead of listing it separately, we group it under the first reported incident. Unlike other incidents, variants do not need to have been reported outside the AIID. Learn more from the research paper.
Seen something similar?
Similar Incidents
Did our AI mess up? Flag the unrelated incidents
Similar Incidents
Did our AI mess up? Flag the unrelated incidents

