How do you feel?:  Measuring User-Perceived Value for Rejecting Machine Decisions in Hate Speech Detection

Philippe Lammerts; Philip Lippmann; Yen Chia Hsu; Fabio Casati; Jie Yang

doi:10.1145/3600211.3604655

How do you feel? Measuring User-Perceived Value for Rejecting Machine Decisions in Hate Speech Detection

Philippe Lammerts, Philip Lippmann, Yen Chia Hsu, Fabio Casati, Jie Yang

Web Information Systems

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

18 Downloads (Pure)

Abstract

Hate speech moderation remains a challenging task for social media platforms. Human-AI collaborative systems offer the potential to combine the strengths of humans' reliability and the scalability of machine learning to tackle this issue effectively. While methods for task handover in human-AI collaboration exist that consider the costs of incorrect predictions, insufficient attention has been paid to accurately estimating these costs. In this work, we propose a value-sensitive rejection mechanism that automatically rejects machine decisions for human moderation based on users' value perceptions regarding machine decisions. We conduct a crowdsourced survey study with 160 participants to evaluate their perception of correct and incorrect machine decisions in the domain of hate speech detection, as well as occurrences where the system rejects making a prediction. Here, we introduce Magnitude Estimation, an unbounded scale, as the preferred method for measuring user (dis)agreement with machine decisions. Our results show that Magnitude Estimation can provide a reliable measurement of participants' perception of machine decisions. By integrating user-perceived value into human-AI collaboration, we further show that it can guide us in 1) determining when to accept or reject machine decisions to obtain the optimal total value a model can deliver and 2) selecting better classification models as compared to the more widely used target of model accuracy.

Original language	English
Title of host publication	AIES 2023 - Proceedings of the 2023 AAAI/ACM Conference on AI, Ethics, and Society
Place of Publication	New York
Publisher	Association for Computing Machinery (ACM)
Pages	834-844
Number of pages	11
ISBN (Print)	979-8-4007-0231-0
DOIs	https://doi.org/10.1145/3600211.3604655
Publication status	Published - 2023
Event	2023 AAAI / ACM Conference on Artificial Intelligence, Ethics, and Society, AIES 2023 - Montreal, Canada Duration: 8 Aug 2023 → 10 Aug 2023

Conference

Conference	2023 AAAI / ACM Conference on Artificial Intelligence, Ethics, and Society, AIES 2023
Country/Territory	Canada
City	Montreal
Period	8/08/23 → 10/08/23

Keywords

crowdsourcing
hate speech
human-in-the-loop
machine confidence
rejection
value-sensitive machine learning

Access to Document

10.1145/3600211.3604655

3600211.3604655Final published version, 799 KBLicence: CC BY-NC

Cite this

@inproceedings{5fc082f509934136a6f9576294c6b275,

title = "How do you feel?: Measuring User-Perceived Value for Rejecting Machine Decisions in Hate Speech Detection",

abstract = "Hate speech moderation remains a challenging task for social media platforms. Human-AI collaborative systems offer the potential to combine the strengths of humans' reliability and the scalability of machine learning to tackle this issue effectively. While methods for task handover in human-AI collaboration exist that consider the costs of incorrect predictions, insufficient attention has been paid to accurately estimating these costs. In this work, we propose a value-sensitive rejection mechanism that automatically rejects machine decisions for human moderation based on users' value perceptions regarding machine decisions. We conduct a crowdsourced survey study with 160 participants to evaluate their perception of correct and incorrect machine decisions in the domain of hate speech detection, as well as occurrences where the system rejects making a prediction. Here, we introduce Magnitude Estimation, an unbounded scale, as the preferred method for measuring user (dis)agreement with machine decisions. Our results show that Magnitude Estimation can provide a reliable measurement of participants' perception of machine decisions. By integrating user-perceived value into human-AI collaboration, we further show that it can guide us in 1) determining when to accept or reject machine decisions to obtain the optimal total value a model can deliver and 2) selecting better classification models as compared to the more widely used target of model accuracy.",

keywords = "crowdsourcing, hate speech, human-in-the-loop, machine confidence, rejection, value-sensitive machine learning",

author = "Philippe Lammerts and Philip Lippmann and Hsu, {Yen Chia} and Fabio Casati and Jie Yang",

year = "2023",

doi = "10.1145/3600211.3604655",

language = "English",

isbn = "979-8-4007-0231-0",

pages = "834--844",

booktitle = "AIES 2023 - Proceedings of the 2023 AAAI/ACM Conference on AI, Ethics, and Society",

publisher = "Association for Computing Machinery (ACM)",

address = "United States",

note = "2023 AAAI / ACM Conference on Artificial Intelligence, Ethics, and Society, AIES 2023 ; Conference date: 08-08-2023 Through 10-08-2023",

}

Lammerts, P, Lippmann, P, Hsu, YC, Casati, F & Yang, J 2023, How do you feel? Measuring User-Perceived Value for Rejecting Machine Decisions in Hate Speech Detection. in AIES 2023 - Proceedings of the 2023 AAAI/ACM Conference on AI, Ethics, and Society. Association for Computing Machinery (ACM), New York, pp. 834-844, 2023 AAAI / ACM Conference on Artificial Intelligence, Ethics, and Society, AIES 2023, Montreal, Canada, 8/08/23. https://doi.org/10.1145/3600211.3604655

How do you feel? Measuring User-Perceived Value for Rejecting Machine Decisions in Hate Speech Detection. / Lammerts, Philippe; Lippmann, Philip; Hsu, Yen Chia et al.
AIES 2023 - Proceedings of the 2023 AAAI/ACM Conference on AI, Ethics, and Society. New York: Association for Computing Machinery (ACM), 2023. p. 834-844.

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

TY - GEN

T1 - How do you feel?

T2 - 2023 AAAI / ACM Conference on Artificial Intelligence, Ethics, and Society, AIES 2023

AU - Lammerts, Philippe

AU - Lippmann, Philip

AU - Hsu, Yen Chia

AU - Casati, Fabio

AU - Yang, Jie

PY - 2023

Y1 - 2023

N2 - Hate speech moderation remains a challenging task for social media platforms. Human-AI collaborative systems offer the potential to combine the strengths of humans' reliability and the scalability of machine learning to tackle this issue effectively. While methods for task handover in human-AI collaboration exist that consider the costs of incorrect predictions, insufficient attention has been paid to accurately estimating these costs. In this work, we propose a value-sensitive rejection mechanism that automatically rejects machine decisions for human moderation based on users' value perceptions regarding machine decisions. We conduct a crowdsourced survey study with 160 participants to evaluate their perception of correct and incorrect machine decisions in the domain of hate speech detection, as well as occurrences where the system rejects making a prediction. Here, we introduce Magnitude Estimation, an unbounded scale, as the preferred method for measuring user (dis)agreement with machine decisions. Our results show that Magnitude Estimation can provide a reliable measurement of participants' perception of machine decisions. By integrating user-perceived value into human-AI collaboration, we further show that it can guide us in 1) determining when to accept or reject machine decisions to obtain the optimal total value a model can deliver and 2) selecting better classification models as compared to the more widely used target of model accuracy.

AB - Hate speech moderation remains a challenging task for social media platforms. Human-AI collaborative systems offer the potential to combine the strengths of humans' reliability and the scalability of machine learning to tackle this issue effectively. While methods for task handover in human-AI collaboration exist that consider the costs of incorrect predictions, insufficient attention has been paid to accurately estimating these costs. In this work, we propose a value-sensitive rejection mechanism that automatically rejects machine decisions for human moderation based on users' value perceptions regarding machine decisions. We conduct a crowdsourced survey study with 160 participants to evaluate their perception of correct and incorrect machine decisions in the domain of hate speech detection, as well as occurrences where the system rejects making a prediction. Here, we introduce Magnitude Estimation, an unbounded scale, as the preferred method for measuring user (dis)agreement with machine decisions. Our results show that Magnitude Estimation can provide a reliable measurement of participants' perception of machine decisions. By integrating user-perceived value into human-AI collaboration, we further show that it can guide us in 1) determining when to accept or reject machine decisions to obtain the optimal total value a model can deliver and 2) selecting better classification models as compared to the more widely used target of model accuracy.

KW - crowdsourcing

KW - hate speech

KW - human-in-the-loop

KW - machine confidence

KW - rejection

KW - value-sensitive machine learning

UR - http://www.scopus.com/inward/record.url?scp=85173608009&partnerID=8YFLogxK

U2 - 10.1145/3600211.3604655

DO - 10.1145/3600211.3604655

M3 - Conference contribution

AN - SCOPUS:85173608009

SN - 979-8-4007-0231-0

SP - 834

EP - 844

BT - AIES 2023 - Proceedings of the 2023 AAAI/ACM Conference on AI, Ethics, and Society

PB - Association for Computing Machinery (ACM)

CY - New York

Y2 - 8 August 2023 through 10 August 2023

ER -