Incident 12: Common Biases of Vector Embeddings

Description: Researchers from Boston University and Microsoft Research, New England demonstrated gender bias in the most common techniques used to embed words for natural language processing (NLP).

Tools

New Report New Response DiscoverView History

Entities

View all entities

Alleged: Microsoft Research , Boston University and Google developed an AI system deployed by Microsoft Research and Boston University, which harmed Women and Minority Groups.

Incident Stats

Incident ID

Report Count

Incident Date

2016-07-21

Editors

Sean McGregor

Applied Taxonomies

CSETv0, CSETv1, GMF, MIT

CSETv1 Taxonomy Classifications

Taxonomy Details

Incident Number

CSETv0 Taxonomy Classifications

Taxonomy Details

Public Sector Deployment

Lives Lost

Intent

Unclear

Near Miss

Unclear/unknown

Ending Date

2016-01-01T00:00:00.000Z

Beginning Date

2016-01-01T00:00:00.000Z

MIT Taxonomy Classifications

Machine-Classified

Taxonomy Details

Risk Subdomain

1.1. Unfair discrimination and misrepresentation

Risk Domain

Discrimination and Toxicity

Entity

Timing

Post-deployment

Intent

Unintentional

Incident Reports

Reports Timeline

Man is to Computer Programmer as Woman is to Homemaker? Debiasing Word Embeddings

arxiv.org

arxiv.org · 2016

The blind application of machine learning runs the risk of amplifying biases present in data. Such a danger is facing us with word embedding, a popular framework to represent text data as vectors which has been used in many machine learning…

Variants

A "variant" is an AI incident similar to a known case—it has the same causes, harms, and AI system. Instead of listing it separately, we group it under the first reported incident. Unlike other incidents, variants do not need to have been reported outside the AIID. Learn more from the research paper.

Seen something similar?

Similar Incidents

By textual similarity

Did our AI mess up? Flag the unrelated incidents

Similar Incidents

By textual similarity

Did our AI mess up? Flag the unrelated incidents

Incident 12: Common Biases of Vector Embeddings

Tools

Entities

Incident Stats

CSETv1 Taxonomy Classifications

CSETv0 Taxonomy Classifications

MIT Taxonomy Classifications

Incident Reports

Reports Timeline

Man is to Computer Programmer as Woman is to Homemaker? Debiasing Word Embeddings

Man is to Computer Programmer as Woman is to Homemaker? Debiasing Word Embeddings

Variants

Similar Incidents

By textual similarity

Gender Biases in Google Translate

Personal voice assistants struggle with black voices, new study shows

High-Toxicity Assessed on Text Involving Women and Minority Groups

Similar Incidents

By textual similarity

Gender Biases in Google Translate

Personal voice assistants struggle with black voices, new study shows

High-Toxicity Assessed on Text Involving Women and Minority Groups