Characterising and Mitigating Aggregation-Bias in Crowdsourced Toxicity Annotations

Agathe Balayn; Panagiotis Mavridis; Alessandro Bozzon; Benjamin Timmermans; Zoltán Szlávik

Characterising and Mitigating Aggregation-Bias in Crowdsourced Toxicity Annotations

Agathe Balayn, Panagiotis Mavridis, Alessandro Bozzon, Benjamin Timmermans, Zoltán Szlávik

Web Information Systems

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

3 Citations (Scopus)

156 Downloads (Pure)

Abstract

Training machine learning (ML) models for natural language processing usually requires large amount of data, often acquired through crowdsourcing. The way this data is collected and aggregated can have an effect on the outputs of the trained model such as ignoring the labels which differ from the majority. In this paper we investigate how label aggregation can bias the ML results towards certain data samples and propose a methodology to highlight and mitigate this bias. Although our work is applicable to any kind of label aggregation for data subject to multiple interpretations, we focus on the effects of the bias introduced by majority voting on toxicity prediction over sentences. Our preliminary results point out that we can mitigate the majority-bias and get increased prediction accuracy for the minority opinions if we take into account the different labels from annotators when training adapted models, rather than rely on the aggregated labels.

Original language	English
Title of host publication	Proceedings of the 1st Workshop on Subjectivity, Ambiguity and Disagreement in Crowdsourcing, and Short Paper Proceedings of the 1st Workshop on Disentangling the Relation Between Crowdsourcing and Bias Management
Editors	Lora Aroyo, Anca Dumitrache, Praveen Paritosh, Alex Quinn, Chris Welty, Alessandro Checco, Gianluca Demartini, Ujwal Gadiraju, Cristina Sarasua
Publisher	CEUR-WS
Pages	67-71
Number of pages	5
Volume	2276
Publication status	Published - 2018
Event	1st Workshop on Subjectivity, Ambiguity and Disagreement in Crowdsourcing, and 1st Workshop on Disentangling the Relation Between Crowdsourcing and Bias Management - University of Zurich, Zurich, Switzerland Duration: 5 Jul 2018 → 5 Jul 2018 https://sites.google.com/view/crowdbias

Publication series

Name	CEUR Workshop Proceedings
Volume	2276
ISSN (Electronic)	1613-0073

Conference

Conference	1st Workshop on Subjectivity, Ambiguity and Disagreement in Crowdsourcing, and 1st Workshop on Disentangling the Relation Between Crowdsourcing and Bias Management
Abbreviated title	SAD2018 CrowdBias2018
Country/Territory	Switzerland
City	Zurich
Period	5/07/18 → 5/07/18
Internet address	https://sites.google.com/view/crowdbias

Bibliographical note

Accepted Author Manuscript

Keywords

dataset bias
Machine Learning fairness
crowdsourcing
annotation aggregation

Access to Document

Characterising and Mitigating Aggregation-Bias in Crowdsourced Toxicity AnnotationsAccepted author manuscript, 321 KB

http://ceur-ws.org/Vol-2276

Cite this

Balayn, A., Mavridis, P., Bozzon, A., Timmermans, B., & Szlávik, Z. (2018). Characterising and Mitigating Aggregation-Bias in Crowdsourced Toxicity Annotations. In L. Aroyo, A. Dumitrache, P. Paritosh, A. Quinn, C. Welty, A. Checco, G. Demartini, U. Gadiraju, & C. Sarasua (Eds.), Proceedings of the 1st Workshop on Subjectivity, Ambiguity and Disagreement in Crowdsourcing, and Short Paper Proceedings of the 1st Workshop on Disentangling the Relation Between Crowdsourcing and Bias Management (Vol. 2276, pp. 67-71). Article 7 (CEUR Workshop Proceedings; Vol. 2276). CEUR-WS. http://ceur-ws.org/Vol-2276

Balayn, Agathe ; Mavridis, Panagiotis ; Bozzon, Alessandro et al. / Characterising and Mitigating Aggregation-Bias in Crowdsourced Toxicity Annotations. Proceedings of the 1st Workshop on Subjectivity, Ambiguity and Disagreement in Crowdsourcing, and Short Paper Proceedings of the 1st Workshop on Disentangling the Relation Between Crowdsourcing and Bias Management. editor / Lora Aroyo ; Anca Dumitrache ; Praveen Paritosh ; Alex Quinn ; Chris Welty ; Alessandro Checco ; Gianluca Demartini ; Ujwal Gadiraju ; Cristina Sarasua. Vol. 2276 CEUR-WS, 2018. pp. 67-71 (CEUR Workshop Proceedings).

@inproceedings{43f84e8d71f7437984a6b6cac86253e5,

title = "Characterising and Mitigating Aggregation-Bias in Crowdsourced Toxicity Annotations",

abstract = "Training machine learning (ML) models for natural language processing usually requires large amount of data, often acquired through crowdsourcing. The way this data is collected and aggregated can have an effect on the outputs of the trained model such as ignoring the labels which differ from the majority. In this paper we investigate how label aggregation can bias the ML results towards certain data samples and propose a methodology to highlight and mitigate this bias. Although our work is applicable to any kind of label aggregation for data subject to multiple interpretations, we focus on the effects of the bias introduced by majority voting on toxicity prediction over sentences. Our preliminary results point out that we can mitigate the majority-bias and get increased prediction accuracy for the minority opinions if we take into account the different labels from annotators when training adapted models, rather than rely on the aggregated labels.",

keywords = "dataset bias, Machine Learning fairness, crowdsourcing, annotation aggregation",

author = "Agathe Balayn and Panagiotis Mavridis and Alessandro Bozzon and Benjamin Timmermans and Zolt{\'a}n Szl{\'a}vik",

note = "Accepted Author Manuscript; 1st Workshop on Subjectivity, Ambiguity and Disagreement in Crowdsourcing, and 1st Workshop on Disentangling the Relation Between Crowdsourcing and Bias Management, SAD2018 CrowdBias2018 ; Conference date: 05-07-2018 Through 05-07-2018",

year = "2018",

language = "English",

volume = "2276",

series = "CEUR Workshop Proceedings",

publisher = "CEUR-WS",

pages = "67--71",

editor = "Lora Aroyo and Anca Dumitrache and Praveen Paritosh and Alex Quinn and Chris Welty and Alessandro Checco and Gianluca Demartini and Ujwal Gadiraju and Cristina Sarasua",

booktitle = "Proceedings of the 1st Workshop on Subjectivity, Ambiguity and Disagreement in Crowdsourcing, and Short Paper Proceedings of the 1st Workshop on Disentangling the Relation Between Crowdsourcing and Bias Management",

url = "https://sites.google.com/view/crowdbias",

}

Balayn, A, Mavridis, P, Bozzon, A, Timmermans, B & Szlávik, Z 2018, Characterising and Mitigating Aggregation-Bias in Crowdsourced Toxicity Annotations. in L Aroyo, A Dumitrache, P Paritosh, A Quinn, C Welty, A Checco, G Demartini, U Gadiraju & C Sarasua (eds), Proceedings of the 1st Workshop on Subjectivity, Ambiguity and Disagreement in Crowdsourcing, and Short Paper Proceedings of the 1st Workshop on Disentangling the Relation Between Crowdsourcing and Bias Management. vol. 2276, 7, CEUR Workshop Proceedings, vol. 2276, CEUR-WS, pp. 67-71, 1st Workshop on Subjectivity, Ambiguity and Disagreement in Crowdsourcing, and 1st Workshop on Disentangling the Relation Between Crowdsourcing and Bias Management, Zurich, Switzerland, 5/07/18. <http://ceur-ws.org/Vol-2276>

Characterising and Mitigating Aggregation-Bias in Crowdsourced Toxicity Annotations. / Balayn, Agathe; Mavridis, Panagiotis; Bozzon, Alessandro et al.
Proceedings of the 1st Workshop on Subjectivity, Ambiguity and Disagreement in Crowdsourcing, and Short Paper Proceedings of the 1st Workshop on Disentangling the Relation Between Crowdsourcing and Bias Management. ed. / Lora Aroyo; Anca Dumitrache; Praveen Paritosh; Alex Quinn; Chris Welty; Alessandro Checco; Gianluca Demartini; Ujwal Gadiraju; Cristina Sarasua. Vol. 2276 CEUR-WS, 2018. p. 67-71 7 (CEUR Workshop Proceedings; Vol. 2276).

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

TY - GEN

T1 - Characterising and Mitigating Aggregation-Bias in Crowdsourced Toxicity Annotations

AU - Balayn, Agathe

AU - Mavridis, Panagiotis

AU - Bozzon, Alessandro

AU - Timmermans, Benjamin

AU - Szlávik, Zoltán

N1 - Accepted Author Manuscript

PY - 2018

Y1 - 2018

N2 - Training machine learning (ML) models for natural language processing usually requires large amount of data, often acquired through crowdsourcing. The way this data is collected and aggregated can have an effect on the outputs of the trained model such as ignoring the labels which differ from the majority. In this paper we investigate how label aggregation can bias the ML results towards certain data samples and propose a methodology to highlight and mitigate this bias. Although our work is applicable to any kind of label aggregation for data subject to multiple interpretations, we focus on the effects of the bias introduced by majority voting on toxicity prediction over sentences. Our preliminary results point out that we can mitigate the majority-bias and get increased prediction accuracy for the minority opinions if we take into account the different labels from annotators when training adapted models, rather than rely on the aggregated labels.

AB - Training machine learning (ML) models for natural language processing usually requires large amount of data, often acquired through crowdsourcing. The way this data is collected and aggregated can have an effect on the outputs of the trained model such as ignoring the labels which differ from the majority. In this paper we investigate how label aggregation can bias the ML results towards certain data samples and propose a methodology to highlight and mitigate this bias. Although our work is applicable to any kind of label aggregation for data subject to multiple interpretations, we focus on the effects of the bias introduced by majority voting on toxicity prediction over sentences. Our preliminary results point out that we can mitigate the majority-bias and get increased prediction accuracy for the minority opinions if we take into account the different labels from annotators when training adapted models, rather than rely on the aggregated labels.

KW - dataset bias

KW - Machine Learning fairness

KW - crowdsourcing

KW - annotation aggregation

M3 - Conference contribution

VL - 2276

T3 - CEUR Workshop Proceedings

SP - 67

EP - 71

BT - Proceedings of the 1st Workshop on Subjectivity, Ambiguity and Disagreement in Crowdsourcing, and Short Paper Proceedings of the 1st Workshop on Disentangling the Relation Between Crowdsourcing and Bias Management

A2 - Aroyo, Lora

A2 - Dumitrache, Anca

A2 - Paritosh, Praveen

A2 - Quinn, Alex

A2 - Welty, Chris

A2 - Checco, Alessandro

A2 - Demartini, Gianluca

A2 - Gadiraju, Ujwal

A2 - Sarasua, Cristina

PB - CEUR-WS

T2 - 1st Workshop on Subjectivity, Ambiguity and Disagreement in Crowdsourcing, and 1st Workshop on Disentangling the Relation Between Crowdsourcing and Bias Management

Y2 - 5 July 2018 through 5 July 2018

ER -

Balayn A, Mavridis P, Bozzon A, Timmermans B, Szlávik Z. Characterising and Mitigating Aggregation-Bias in Crowdsourced Toxicity Annotations. In Aroyo L, Dumitrache A, Paritosh P, Quinn A, Welty C, Checco A, Demartini G, Gadiraju U, Sarasua C, editors, Proceedings of the 1st Workshop on Subjectivity, Ambiguity and Disagreement in Crowdsourcing, and Short Paper Proceedings of the 1st Workshop on Disentangling the Relation Between Crowdsourcing and Bias Management. Vol. 2276. CEUR-WS. 2018. p. 67-71. 7. (CEUR Workshop Proceedings).

Characterising and Mitigating Aggregation-Bias in Crowdsourced Toxicity Annotations

Abstract

Publication series

Conference

Bibliographical note

Keywords

Access to Document

Fingerprint

Cite this