Tiny, Always-on, and Fragile: Bias Propagation through Design Choices in On-device Machine Learning Workflows

Wiebke (Toussaint) Hutiri; Aaron Yi Ding; Fahim Kawsar; Akhil Mathur

doi:10.1145/3591867

Tiny, Always-on, and Fragile: Bias Propagation through Design Choices in On-device Machine Learning Workflows

Wiebke (Toussaint) Hutiri, Aaron Yi Ding, Fahim Kawsar, Akhil Mathur

Information and Communication Technology

Research output: Contribution to journal › Article › Scientific › peer-review

2 Citations (Scopus)

19 Downloads (Pure)

Abstract

Billions of distributed, heterogeneous, and resource constrained IoT devices deploy on-device machine learning (ML) for private, fast, and offline inference on personal data. On-device ML is highly context dependent and sensitive to user, usage, hardware, and environment attributes. This sensitivity and the propensity toward bias in ML makes it important to study bias in on-device settings. Our study is one of the first investigations of bias in this emerging domain and lays important foundations for building fairer on-device ML. We apply a software engineering lens, investigating the propagation of bias through design choices in on-device ML workflows. We first identify reliability bias as a source of unfairness and propose a measure to quantify it. We then conduct empirical experiments for a keyword spotting task to show how complex and interacting technical design choices amplify and propagate reliability bias. Our results validate that design choices made during model training, like the sample rate and input feature type, and choices made to optimize models, like light-weight architectures, the pruning learning rate, and pruning sparsity, can result in disparate predictive performance across male and female groups. Based on our findings, we suggest low effort strategies for engineers to mitigate bias in on-device ML.

Original language	English
Article number	155
Journal	ACM Transactions on Software Engineering and Methodology
Volume	32
Issue number	6
DOIs	https://doi.org/10.1145/3591867
Publication status	Published - 2023

Keywords

audio keyword spotting
Bias
design choices
embedded machine learning
fairness
on-device machine learning
personal data

Access to Document

10.1145/3591867

3591867Final published version, 5.43 MBLicence: CC BY

Cite this

@article{377634375a914eeab1c03ffb87f7fc99,

title = "Tiny, Always-on, and Fragile: Bias Propagation through Design Choices in On-device Machine Learning Workflows",

abstract = "Billions of distributed, heterogeneous, and resource constrained IoT devices deploy on-device machine learning (ML) for private, fast, and offline inference on personal data. On-device ML is highly context dependent and sensitive to user, usage, hardware, and environment attributes. This sensitivity and the propensity toward bias in ML makes it important to study bias in on-device settings. Our study is one of the first investigations of bias in this emerging domain and lays important foundations for building fairer on-device ML. We apply a software engineering lens, investigating the propagation of bias through design choices in on-device ML workflows. We first identify reliability bias as a source of unfairness and propose a measure to quantify it. We then conduct empirical experiments for a keyword spotting task to show how complex and interacting technical design choices amplify and propagate reliability bias. Our results validate that design choices made during model training, like the sample rate and input feature type, and choices made to optimize models, like light-weight architectures, the pruning learning rate, and pruning sparsity, can result in disparate predictive performance across male and female groups. Based on our findings, we suggest low effort strategies for engineers to mitigate bias in on-device ML.",

keywords = "audio keyword spotting, Bias, design choices, embedded machine learning, fairness, on-device machine learning, personal data",

author = "Hutiri, {Wiebke (Toussaint)} and Ding, {Aaron Yi} and Fahim Kawsar and Akhil Mathur",

year = "2023",

doi = "10.1145/3591867",

language = "English",

volume = "32",

journal = "ACM Transactions on Software Engineering and Methodology",

issn = "1049-331X",

publisher = "Association for Computing Machinery (ACM)",

number = "6",

}

TY - JOUR

T1 - Tiny, Always-on, and Fragile

T2 - Bias Propagation through Design Choices in On-device Machine Learning Workflows

AU - Hutiri, Wiebke (Toussaint)

AU - Ding, Aaron Yi

AU - Kawsar, Fahim

AU - Mathur, Akhil

PY - 2023

Y1 - 2023

N2 - Billions of distributed, heterogeneous, and resource constrained IoT devices deploy on-device machine learning (ML) for private, fast, and offline inference on personal data. On-device ML is highly context dependent and sensitive to user, usage, hardware, and environment attributes. This sensitivity and the propensity toward bias in ML makes it important to study bias in on-device settings. Our study is one of the first investigations of bias in this emerging domain and lays important foundations for building fairer on-device ML. We apply a software engineering lens, investigating the propagation of bias through design choices in on-device ML workflows. We first identify reliability bias as a source of unfairness and propose a measure to quantify it. We then conduct empirical experiments for a keyword spotting task to show how complex and interacting technical design choices amplify and propagate reliability bias. Our results validate that design choices made during model training, like the sample rate and input feature type, and choices made to optimize models, like light-weight architectures, the pruning learning rate, and pruning sparsity, can result in disparate predictive performance across male and female groups. Based on our findings, we suggest low effort strategies for engineers to mitigate bias in on-device ML.

AB - Billions of distributed, heterogeneous, and resource constrained IoT devices deploy on-device machine learning (ML) for private, fast, and offline inference on personal data. On-device ML is highly context dependent and sensitive to user, usage, hardware, and environment attributes. This sensitivity and the propensity toward bias in ML makes it important to study bias in on-device settings. Our study is one of the first investigations of bias in this emerging domain and lays important foundations for building fairer on-device ML. We apply a software engineering lens, investigating the propagation of bias through design choices in on-device ML workflows. We first identify reliability bias as a source of unfairness and propose a measure to quantify it. We then conduct empirical experiments for a keyword spotting task to show how complex and interacting technical design choices amplify and propagate reliability bias. Our results validate that design choices made during model training, like the sample rate and input feature type, and choices made to optimize models, like light-weight architectures, the pruning learning rate, and pruning sparsity, can result in disparate predictive performance across male and female groups. Based on our findings, we suggest low effort strategies for engineers to mitigate bias in on-device ML.

KW - audio keyword spotting

KW - Bias

KW - design choices

KW - embedded machine learning

KW - fairness

KW - on-device machine learning

KW - personal data

UR - http://www.scopus.com/inward/record.url?scp=85167808648&partnerID=8YFLogxK

U2 - 10.1145/3591867

DO - 10.1145/3591867

M3 - Article

AN - SCOPUS:85167808648

SN - 1049-331X

VL - 32

JO - ACM Transactions on Software Engineering and Methodology

JF - ACM Transactions on Software Engineering and Methodology

IS - 6

M1 - 155

ER -

Tiny, Always-on, and Fragile: Bias Propagation through Design Choices in On-device Machine Learning Workflows

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this