Automatic Identification of Harmful, Aggressive, Abusive, and Offensive Language on the Web: A Survey of Technical Biases Informed by Psychology Literature

Agathe Balayn; Jie Yang; Zoltán Szlávik; Alessandro Bozzon

doi:10.1145/3479158

Automatic Identification of Harmful, Aggressive, Abusive, and Offensive Language on the Web: A Survey of Technical Biases Informed by Psychology Literature

Agathe Balayn, Jie Yang, Zoltán Szlávik, Alessandro Bozzon

Research output: Contribution to journal › Article › Scientific › peer-review

292 Downloads (Pure)

Abstract

The automatic detection of conflictual languages (harmful, aggressive, abusive, and offensive languages) is essential to provide a healthy conversation environment on the Web. To design and develop detection systems that are capable of achieving satisfactory performance, a thorough understanding of the nature and properties of the targeted type of conflictual language is of great importance. The scientific communities investigating human psychology and social behavior have studied these languages in details, but their insights have only partially reached the computer science community.

In this survey, we aim both at systematically characterizing the conceptual properties of online conflictual languages, and at investigating the extent to which they are reflected in state-of-the-art automatic detection systems. Through an analysis of psychology literature, we provide a reconciled taxonomy that denotes the ensemble of conflictual languages typically studied in computer science. We then characterize the conceptual mismatches that can be observed in the main semantic and contextual properties of these languages and their treatment in computer science works; and systematically uncover resulting technical biases in the design of machine learning classification models and the dataset created for their training. Finally, we discuss diverse research opportunities for the computer science community and reflect on broader technical and structural issues.

Original language	English
Article number	11
Pages (from-to)	11:1 - 11:56
Number of pages	56
Journal	ACM Transactions on Social Computing
Volume	4
Issue number	3
DOIs	https://doi.org/10.1145/3479158
Publication status	Published - 2021

Keywords

Bias
discrimination
cyberbullying
offensive language
abusive language
harassment
toxic language
harmful language

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

Access to Document

10.1145/3479158

3479158Final published version, 2.1 MBLicence: CC BY

Cite this

@article{e7a5bb9692b04d62874b370f8d9adbc8,

title = "Automatic Identification of Harmful, Aggressive, Abusive, and Offensive Language on the Web: A Survey of Technical Biases Informed by Psychology Literature",

abstract = "The automatic detection of conflictual languages (harmful, aggressive, abusive, and offensive languages) is essential to provide a healthy conversation environment on the Web. To design and develop detection systems that are capable of achieving satisfactory performance, a thorough understanding of the nature and properties of the targeted type of conflictual language is of great importance. The scientific communities investigating human psychology and social behavior have studied these languages in details, but their insights have only partially reached the computer science community.In this survey, we aim both at systematically characterizing the conceptual properties of online conflictual languages, and at investigating the extent to which they are reflected in state-of-the-art automatic detection systems. Through an analysis of psychology literature, we provide a reconciled taxonomy that denotes the ensemble of conflictual languages typically studied in computer science. We then characterize the conceptual mismatches that can be observed in the main semantic and contextual properties of these languages and their treatment in computer science works; and systematically uncover resulting technical biases in the design of machine learning classification models and the dataset created for their training. Finally, we discuss diverse research opportunities for the computer science community and reflect on broader technical and structural issues.",

keywords = "Bias, discrimination, cyberbullying, offensive language, abusive language, harassment, toxic language, harmful language",

author = "Agathe Balayn and Jie Yang and Zolt{\'a}n Szl{\'a}vik and Alessandro Bozzon",

year = "2021",

doi = "10.1145/3479158",

language = "English",

volume = "4",

pages = "11:1 -- 11:56",

journal = "ACM Transactions on Social Computing",

number = "3",

}

Automatic Identification of Harmful, Aggressive, Abusive, and Offensive Language on the Web: A Survey of Technical Biases Informed by Psychology Literature. / Balayn, Agathe ; Yang, Jie; Szlávik, Zoltán et al.
In: ACM Transactions on Social Computing, Vol. 4, No. 3, 11, 2021, p. 11:1 - 11:56.

Research output: Contribution to journal › Article › Scientific › peer-review

TY - JOUR

T1 - Automatic Identification of Harmful, Aggressive, Abusive, and Offensive Language on the Web

T2 - A Survey of Technical Biases Informed by Psychology Literature

AU - Balayn, Agathe

AU - Yang, Jie

AU - Szlávik, Zoltán

AU - Bozzon, Alessandro

PY - 2021

Y1 - 2021

N2 - The automatic detection of conflictual languages (harmful, aggressive, abusive, and offensive languages) is essential to provide a healthy conversation environment on the Web. To design and develop detection systems that are capable of achieving satisfactory performance, a thorough understanding of the nature and properties of the targeted type of conflictual language is of great importance. The scientific communities investigating human psychology and social behavior have studied these languages in details, but their insights have only partially reached the computer science community.In this survey, we aim both at systematically characterizing the conceptual properties of online conflictual languages, and at investigating the extent to which they are reflected in state-of-the-art automatic detection systems. Through an analysis of psychology literature, we provide a reconciled taxonomy that denotes the ensemble of conflictual languages typically studied in computer science. We then characterize the conceptual mismatches that can be observed in the main semantic and contextual properties of these languages and their treatment in computer science works; and systematically uncover resulting technical biases in the design of machine learning classification models and the dataset created for their training. Finally, we discuss diverse research opportunities for the computer science community and reflect on broader technical and structural issues.

AB - The automatic detection of conflictual languages (harmful, aggressive, abusive, and offensive languages) is essential to provide a healthy conversation environment on the Web. To design and develop detection systems that are capable of achieving satisfactory performance, a thorough understanding of the nature and properties of the targeted type of conflictual language is of great importance. The scientific communities investigating human psychology and social behavior have studied these languages in details, but their insights have only partially reached the computer science community.In this survey, we aim both at systematically characterizing the conceptual properties of online conflictual languages, and at investigating the extent to which they are reflected in state-of-the-art automatic detection systems. Through an analysis of psychology literature, we provide a reconciled taxonomy that denotes the ensemble of conflictual languages typically studied in computer science. We then characterize the conceptual mismatches that can be observed in the main semantic and contextual properties of these languages and their treatment in computer science works; and systematically uncover resulting technical biases in the design of machine learning classification models and the dataset created for their training. Finally, we discuss diverse research opportunities for the computer science community and reflect on broader technical and structural issues.

KW - Bias

KW - discrimination

KW - cyberbullying

KW - offensive language

KW - abusive language

KW - harassment

KW - toxic language

KW - harmful language

U2 - 10.1145/3479158

DO - 10.1145/3479158

M3 - Article

VL - 4

SP - 11:1 - 11:56

JO - ACM Transactions on Social Computing

JF - ACM Transactions on Social Computing

IS - 3

M1 - 11

ER -

Automatic Identification of Harmful, Aggressive, Abusive, and Offensive Language on the Web: A Survey of Technical Biases Informed by Psychology Literature

Abstract

Keywords

UN SDGs

Access to Document

Fingerprint

Cite this