Annotation Practices in Societally Impactful Machine Learning Applications: What are Popular Recommender Systems Models Actually Trained On?

Andra Georgiana Sav; Andrew M. Demetriou; Cynthia C.S. Liem

Annotation Practices in Societally Impactful Machine Learning Applications: What are Popular Recommender Systems Models Actually Trained On?

Andra Georgiana Sav, Andrew M. Demetriou, Cynthia C.S. Liem

Multimedia Computing

Research output: Contribution to journal › Conference article › Scientific › peer-review

39 Downloads (Pure)

Abstract

Machine Learning (ML) models influence all aspects of our lives. They also commonly are integrated in recommender systems, which facilitate users’ decision-making processes in various scenarios, such as e-commerce, social media, news and online learning. Training performed on large volumes of data is what ultimately drives such systems to provide meaningful recommendations. However, a lack of standardized practices has been observed when it comes to data collection and annotation methods for ML datasets. This research paper systematically identifies and synthesizes the state of standardization with regard to data collection and annotation reporting in the recommender systems domain, through a systematic literature view into the 100 most-cited recommender systems papers from the most impactful venues within the Computing and Information Technology field. Multiple facets of the employed techniques are touched upon, such as reported human annotations and annotator diversity, label quality, and the public availability of training datasets. Recurrent use of just a few benchmark datasets, poor documentation practices, and reproducibility issues in experiments are some of the most striking findings uncovered by this study. We discuss the necessity of transitioning from pure reliance on algorithmic performance metrics to prioritizing data quality and fit. Finally, concerns are raised when it comes to biases and socio-psychological factors inherent in the datasets, and further exploration of embedding these early in the design of ML models is suggested.

Original language	English
Number of pages	21
Journal	CEUR Workshop Proceedings
Volume	3476
Publication status	Published - 2023
Event	3rd Workshop Perspectives on the Evaluation of Recommender Systems, PERSPECTIVES 2023 - Singapore, Singapore Duration: 19 Sept 2023 → …

Keywords

annotation practices
data collection
machine learning
recommender systems
societal impact

Access to Document

paper1Final published version, 358 KBLicence: CC BY

Cite this

@article{a9787480abc742a98936adc1079f1ba3,

title = "Annotation Practices in Societally Impactful Machine Learning Applications: What are Popular Recommender Systems Models Actually Trained On?",

abstract = "Machine Learning (ML) models influence all aspects of our lives. They also commonly are integrated in recommender systems, which facilitate users{\textquoteright} decision-making processes in various scenarios, such as e-commerce, social media, news and online learning. Training performed on large volumes of data is what ultimately drives such systems to provide meaningful recommendations. However, a lack of standardized practices has been observed when it comes to data collection and annotation methods for ML datasets. This research paper systematically identifies and synthesizes the state of standardization with regard to data collection and annotation reporting in the recommender systems domain, through a systematic literature view into the 100 most-cited recommender systems papers from the most impactful venues within the Computing and Information Technology field. Multiple facets of the employed techniques are touched upon, such as reported human annotations and annotator diversity, label quality, and the public availability of training datasets. Recurrent use of just a few benchmark datasets, poor documentation practices, and reproducibility issues in experiments are some of the most striking findings uncovered by this study. We discuss the necessity of transitioning from pure reliance on algorithmic performance metrics to prioritizing data quality and fit. Finally, concerns are raised when it comes to biases and socio-psychological factors inherent in the datasets, and further exploration of embedding these early in the design of ML models is suggested.",

keywords = "annotation practices, data collection, machine learning, recommender systems, societal impact",

author = "Sav, {Andra Georgiana} and Demetriou, {Andrew M.} and Liem, {Cynthia C.S.}",

year = "2023",

language = "English",

volume = "3476",

journal = "CEUR Workshop Proceedings",

issn = "1613-0073",

publisher = "CEUR-WS",

note = "3rd Workshop Perspectives on the Evaluation of Recommender Systems, PERSPECTIVES 2023 ; Conference date: 19-09-2023",

}

TY - JOUR

T1 - Annotation Practices in Societally Impactful Machine Learning Applications

T2 - 3rd Workshop Perspectives on the Evaluation of Recommender Systems, PERSPECTIVES 2023

AU - Sav, Andra Georgiana

AU - Demetriou, Andrew M.

AU - Liem, Cynthia C.S.

PY - 2023

Y1 - 2023

N2 - Machine Learning (ML) models influence all aspects of our lives. They also commonly are integrated in recommender systems, which facilitate users’ decision-making processes in various scenarios, such as e-commerce, social media, news and online learning. Training performed on large volumes of data is what ultimately drives such systems to provide meaningful recommendations. However, a lack of standardized practices has been observed when it comes to data collection and annotation methods for ML datasets. This research paper systematically identifies and synthesizes the state of standardization with regard to data collection and annotation reporting in the recommender systems domain, through a systematic literature view into the 100 most-cited recommender systems papers from the most impactful venues within the Computing and Information Technology field. Multiple facets of the employed techniques are touched upon, such as reported human annotations and annotator diversity, label quality, and the public availability of training datasets. Recurrent use of just a few benchmark datasets, poor documentation practices, and reproducibility issues in experiments are some of the most striking findings uncovered by this study. We discuss the necessity of transitioning from pure reliance on algorithmic performance metrics to prioritizing data quality and fit. Finally, concerns are raised when it comes to biases and socio-psychological factors inherent in the datasets, and further exploration of embedding these early in the design of ML models is suggested.

AB - Machine Learning (ML) models influence all aspects of our lives. They also commonly are integrated in recommender systems, which facilitate users’ decision-making processes in various scenarios, such as e-commerce, social media, news and online learning. Training performed on large volumes of data is what ultimately drives such systems to provide meaningful recommendations. However, a lack of standardized practices has been observed when it comes to data collection and annotation methods for ML datasets. This research paper systematically identifies and synthesizes the state of standardization with regard to data collection and annotation reporting in the recommender systems domain, through a systematic literature view into the 100 most-cited recommender systems papers from the most impactful venues within the Computing and Information Technology field. Multiple facets of the employed techniques are touched upon, such as reported human annotations and annotator diversity, label quality, and the public availability of training datasets. Recurrent use of just a few benchmark datasets, poor documentation practices, and reproducibility issues in experiments are some of the most striking findings uncovered by this study. We discuss the necessity of transitioning from pure reliance on algorithmic performance metrics to prioritizing data quality and fit. Finally, concerns are raised when it comes to biases and socio-psychological factors inherent in the datasets, and further exploration of embedding these early in the design of ML models is suggested.

KW - annotation practices

KW - data collection

KW - machine learning

KW - recommender systems

KW - societal impact

UR - http://www.scopus.com/inward/record.url?scp=85173511452&partnerID=8YFLogxK

M3 - Conference article

AN - SCOPUS:85173511452

SN - 1613-0073

VL - 3476

JO - CEUR Workshop Proceedings

JF - CEUR Workshop Proceedings

Y2 - 19 September 2023

ER -

Annotation Practices in Societally Impactful Machine Learning Applications: What are Popular Recommender Systems Models Actually Trained On?

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this