Five sources of bias in natural language processing

IRIS

Recently, there has been an increased interest in demographically grounded bias in natural language processing (NLP) applications. Much of the recent work has focused on describing bias and providing an overview of bias in a larger context. Here, we provide a simple, actionable summary of this recent work. We outline five sources where bias can occur in NLP systems: (1) the data, (2) the annotation process, (3) the input representations, (4) the models, and finally (5) the research design (or how we conceptualize our research). We explore each of the bias sources in detail in this article, including examples and links to related work, as well as potential counter-measures.

Five sources of bias in natural language processing

Hovy, Dirk^{Membro del Collaboration Group};

2021

Abstract

Recently, there has been an increased interest in demographically grounded bias in natural language processing (NLP) applications. Much of the recent work has focused on describing bias and providing an overview of bias in a larger context. Here, we provide a simple, actionable summary of this recent work. We outline five sources where bias can occur in NLP systems: (1) the data, (2) the annotation process, (3) the input representations, (4) the models, and finally (5) the research design (or how we conceptualize our research). We explore each of the bias sources in detail in this article, including examples and links to related work, as well as potential counter-measures.

Scheda breve

Scheda completa

Scheda completa (DC)

	Year / Anno
	
				2021
			
	Data di prima pubblicazione on line
	
				2021
			
	DOI
	
				https://dx.doi.org/10.1111/lnc3.12432
			
	Journal / Rivista
	
				LANGUAGE AND LINGUISTICS COMPASS
			
	Tutti gli autori
	
						Hovy, Dirk; Prabhumoye, Shrimai
					
	Appare nelle tipologie:
	
				02 - Article by invitation / Articolo su invito

File in questo prodotto:

File	Dimensione	Formato
lnc3.12432.pdf accesso aperto Descrizione: article Tipologia: Pdf editoriale (Publisher's layout) Licenza: Creative commons Dimensione 349.72 kB Formato Adobe PDF Visualizza/Apri	349.72 kB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11565/4042499

Citazioni

ND

278

204

social impact