Bias in Automated Speaker Recognition

Wiebke Toussaint Hutiri; Aaron Yi Ding

doi:10.1145/3531146.3533089

Bias in Automated Speaker Recognition

Wiebke Toussaint Hutiri^*, Aaron Yi Ding

^*Corresponding author for this work

Information and Communication Technology

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

17 Citations (Scopus)

118 Downloads (Pure)

Abstract

Automated speaker recognition uses data processing to identify speakers by their voice. Today, automated speaker recognition is deployed on billions of smart devices and in services such as call centres. Despite their wide-scale deployment and known sources of bias in related domains like face recognition and natural language processing, bias in automated speaker recognition has not been studied systematically. We present an in-depth empirical and analytical study of bias in the machine learning development workflow of speaker verification, a voice biometric and core task in automated speaker recognition. Drawing on an established framework for understanding sources of harm in machine learning, we show that bias exists at every development stage in the well-known VoxCeleb Speaker Recognition Challenge, including data generation, model building, and implementation. Most affected are female speakers and non-US nationalities, who experience significant performance degradation. Leveraging the insights from our findings, we make practical recommendations for mitigating bias in automated speaker recognition, and outline future research directions.

Original language	English
Title of host publication	Proceedings of 2022 5th ACM Conference on Fairness, Accountability, and Transparency, FAccT 2022
Publisher	Association for Computing Machinery (ACM)
Pages	230-247
Number of pages	18
ISBN (Electronic)	978-1-4503-9352-2
DOIs	https://doi.org/10.1145/3531146.3533089
Publication status	Published - 2022
Event	5th ACM Conference on Fairness, Accountability, and Transparency, FAccT 2022 - Virtual, Online, Korea, Republic of Duration: 21 Jun 2022 → 24 Jun 2022

Publication series

Name	ACM International Conference Proceeding Series

Conference

Conference	5th ACM Conference on Fairness, Accountability, and Transparency, FAccT 2022
Country/Territory	Korea, Republic of
City	Virtual, Online
Period	21/06/22 → 24/06/22

Keywords

audit
bias
evaluation
fairness
speaker recognition
speaker verification

Access to Document

10.1145/3531146.3533089

3531146.3533089Final published version, 1.37 MBLicence: CC BY

Cite this

@inproceedings{a6adeb57321145ad93299f134c82d790,

title = "Bias in Automated Speaker Recognition",

abstract = "Automated speaker recognition uses data processing to identify speakers by their voice. Today, automated speaker recognition is deployed on billions of smart devices and in services such as call centres. Despite their wide-scale deployment and known sources of bias in related domains like face recognition and natural language processing, bias in automated speaker recognition has not been studied systematically. We present an in-depth empirical and analytical study of bias in the machine learning development workflow of speaker verification, a voice biometric and core task in automated speaker recognition. Drawing on an established framework for understanding sources of harm in machine learning, we show that bias exists at every development stage in the well-known VoxCeleb Speaker Recognition Challenge, including data generation, model building, and implementation. Most affected are female speakers and non-US nationalities, who experience significant performance degradation. Leveraging the insights from our findings, we make practical recommendations for mitigating bias in automated speaker recognition, and outline future research directions.",

keywords = "audit, bias, evaluation, fairness, speaker recognition, speaker verification",

author = "Hutiri, {Wiebke Toussaint} and Ding, {Aaron Yi}",

year = "2022",

doi = "10.1145/3531146.3533089",

language = "English",

series = "ACM International Conference Proceeding Series",

publisher = "Association for Computing Machinery (ACM)",

pages = "230--247",

booktitle = "Proceedings of 2022 5th ACM Conference on Fairness, Accountability, and Transparency, FAccT 2022",

address = "United States",

note = "5th ACM Conference on Fairness, Accountability, and Transparency, FAccT 2022 ; Conference date: 21-06-2022 Through 24-06-2022",

}

Hutiri, WT & Ding, AY 2022, Bias in Automated Speaker Recognition. in Proceedings of 2022 5th ACM Conference on Fairness, Accountability, and Transparency, FAccT 2022. ACM International Conference Proceeding Series, Association for Computing Machinery (ACM), pp. 230-247, 5th ACM Conference on Fairness, Accountability, and Transparency, FAccT 2022, Virtual, Online, Korea, Republic of, 21/06/22. https://doi.org/10.1145/3531146.3533089

Bias in Automated Speaker Recognition. / Hutiri, Wiebke Toussaint ; Ding, Aaron Yi.
Proceedings of 2022 5th ACM Conference on Fairness, Accountability, and Transparency, FAccT 2022. Association for Computing Machinery (ACM), 2022. p. 230-247 (ACM International Conference Proceeding Series).

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

TY - GEN

T1 - Bias in Automated Speaker Recognition

AU - Hutiri, Wiebke Toussaint

AU - Ding, Aaron Yi

PY - 2022

Y1 - 2022

N2 - Automated speaker recognition uses data processing to identify speakers by their voice. Today, automated speaker recognition is deployed on billions of smart devices and in services such as call centres. Despite their wide-scale deployment and known sources of bias in related domains like face recognition and natural language processing, bias in automated speaker recognition has not been studied systematically. We present an in-depth empirical and analytical study of bias in the machine learning development workflow of speaker verification, a voice biometric and core task in automated speaker recognition. Drawing on an established framework for understanding sources of harm in machine learning, we show that bias exists at every development stage in the well-known VoxCeleb Speaker Recognition Challenge, including data generation, model building, and implementation. Most affected are female speakers and non-US nationalities, who experience significant performance degradation. Leveraging the insights from our findings, we make practical recommendations for mitigating bias in automated speaker recognition, and outline future research directions.

AB - Automated speaker recognition uses data processing to identify speakers by their voice. Today, automated speaker recognition is deployed on billions of smart devices and in services such as call centres. Despite their wide-scale deployment and known sources of bias in related domains like face recognition and natural language processing, bias in automated speaker recognition has not been studied systematically. We present an in-depth empirical and analytical study of bias in the machine learning development workflow of speaker verification, a voice biometric and core task in automated speaker recognition. Drawing on an established framework for understanding sources of harm in machine learning, we show that bias exists at every development stage in the well-known VoxCeleb Speaker Recognition Challenge, including data generation, model building, and implementation. Most affected are female speakers and non-US nationalities, who experience significant performance degradation. Leveraging the insights from our findings, we make practical recommendations for mitigating bias in automated speaker recognition, and outline future research directions.

KW - audit

KW - bias

KW - evaluation

KW - fairness

KW - speaker recognition

KW - speaker verification

UR - http://www.scopus.com/inward/record.url?scp=85132992877&partnerID=8YFLogxK

U2 - 10.1145/3531146.3533089

DO - 10.1145/3531146.3533089

M3 - Conference contribution

AN - SCOPUS:85132992877

T3 - ACM International Conference Proceeding Series

SP - 230

EP - 247

BT - Proceedings of 2022 5th ACM Conference on Fairness, Accountability, and Transparency, FAccT 2022

PB - Association for Computing Machinery (ACM)

T2 - 5th ACM Conference on Fairness, Accountability, and Transparency, FAccT 2022

Y2 - 21 June 2022 through 24 June 2022

ER -

Bias in Automated Speaker Recognition

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this