Journalists
Incidents Harmed By
Incident 96824 Reports
'Pravda' Network, Successor to 'Portal Kombat,' Allegedly Seeding AI Models with Kremlin Disinformation
2022-02-24
A Moscow-based disinformation network, Pravda, allegedly infiltrated AI models by flooding the internet with pro-Kremlin falsehoods. A NewsGuard audit found that 10 major AI chatbots repeated these narratives 33% of the time, citing Pravda sources as legitimate. The tactic, called "LLM grooming," manipulates AI training data to embed Russian propaganda. Pravda is part of Portal Kombat, a larger Russian disinformation network identified by VIGINUM in February 2024, but in operation since February 2022.
MoreIncident 9973 Reports
Meta and OpenAI Accused of Using LibGen’s Pirated Books to Train AI Models
2023-02-28
Court records reveal that Meta employees allegedly discussed pirating books to train LLaMA 3, citing cost and speed concerns with licensing. Internal messages suggest Meta accessed LibGen, a repository of over 7.5 million pirated books, with apparent approval from Mark Zuckerberg. Employees allegedly took steps to obscure the dataset’s origins. OpenAI has also been implicated in using LibGen.
MoreIncident 9952 Reports
The New York Times Sues OpenAI and Microsoft Over Alleged Unauthorized AI Training on Its Content
2023-12-27
The New York Times alleges that OpenAI and Microsoft used millions of its articles without permission to train AI models, including ChatGPT. The lawsuit claims the companies scraped and reproduced copyrighted content without compensation, in turn undermining the Times’s business and competing with its journalism. Some AI outputs allegedly regurgitate Times articles verbatim. The lawsuit seeks damages and demands the destruction of AI models trained on its content.
MoreIncident 5891 Report
Proliferation of AI-Generated News Websites and Content Farms Across Multiple Languages Degrading Information Integrity
2023-05-01
Scores of AI-generated news websites and content farms are producing low-quality, clickbait content in a variety of languages. They are reportedly spreading false information and degrading the quality of information available online. These sites often lack human oversight, feature repetitive language, and sometimes fabricate information, posing a threat to the credibility of online news sources.
MoreRelated Entities
Other entities that are related to the same incident. For example, if the developer of an incident is this entity but the deployer is another entity, they are marked as related entities.
Related Entities
OpenAI
Incidents involved as both Developer and Deployer
- Incident 9973 Reports
Meta and OpenAI Accused of Using LibGen’s Pirated Books to Train AI Models
- Incident 9952 Reports
The New York Times Sues OpenAI and Microsoft Over Alleged Unauthorized AI Training on Its Content