Crowd Worker Strategies in Relevance Judgment Tasks

Lei Han; Eddy Maddalena; Alessandro Checco; Cristina Sarasua; Ujwal Gadiraju; Kevin Roitero; Gianluca Demartini

doi:10.1145/3336191.3371857

Crowd Worker Strategies in Relevance Judgment Tasks

Lei Han, Eddy Maddalena, Alessandro Checco, Cristina Sarasua, Ujwal Gadiraju, Kevin Roitero, Gianluca Demartini

Research output: Contribution to conference › Paper › peer-review

20 Citations (Scopus)

Abstract

Crowdsourcing is a popular technique to collect large amounts of human-generated labels, such as relevance judgments used to create information retrieval (IR) evaluation collections. Previous research has shown how collecting high quality labels from a crowdsourcing platform can be challenging. Existing quality assurance techniques focus on answer aggregation or on the use of gold questions where ground-truth data allows to check for the quality of the responses. In this paper, we present qualitative and quantitative results, revealing how different crowd workers adopt different work strate- gies to complete relevance judgment tasks efficiently and their consequent impact on quality. We delve into the techniques and tools that highly experienced crowd workers use to be more effi- cient in completing crowdsourcing micro-tasks. To this end, we use both qualitative results from worker interviews and surveys, as well as the results of a data-driven study of behavioral log data (i.e., clicks, keystrokes and keyboard shortcuts) collected from crowd workers performing relevance judgment tasks. Our results high- light the presence of frequently used shortcut patterns that can speed-up task completion, thus increasing the hourly wage of effi- cient workers. We observe how crowd work experiences result in different types of working strategies, productivity levels, quality and diversity of the crowdsourced judgments.

Original language	English
Pages	241-249
Number of pages	9
DOIs	https://doi.org/10.1145/3336191.3371857
Publication status	Published - 2020
Externally published	Yes
Event	In Proceedings of the ACM International Conference on Web Search and Data Mining (WSDM 2020) - Duration: 3 Feb 2020 → 7 Feb 2020 Conference number: 13

Conference

Conference	In Proceedings of the ACM International Conference on Web Search and Data Mining (WSDM 2020)
Abbreviated title	WSDM'20
Period	3/02/20 → 7/02/20

Keywords

Crowdsourcing
IR evaluation
Relevance judgment
User behavior

Access to Document

10.1145/3336191.3371857

Cite this

@conference{db92534173f6426db03cda4143fde230,

title = "Crowd Worker Strategies in Relevance Judgment Tasks",

abstract = "Crowdsourcing is a popular technique to collect large amounts of human-generated labels, such as relevance judgments used to create information retrieval (IR) evaluation collections. Previous research has shown how collecting high quality labels from a crowdsourcing platform can be challenging. Existing quality assurance techniques focus on answer aggregation or on the use of gold questions where ground-truth data allows to check for the quality of the responses. In this paper, we present qualitative and quantitative results, revealing how different crowd workers adopt different work strate- gies to complete relevance judgment tasks efficiently and their consequent impact on quality. We delve into the techniques and tools that highly experienced crowd workers use to be more effi- cient in completing crowdsourcing micro-tasks. To this end, we use both qualitative results from worker interviews and surveys, as well as the results of a data-driven study of behavioral log data (i.e., clicks, keystrokes and keyboard shortcuts) collected from crowd workers performing relevance judgment tasks. Our results high- light the presence of frequently used shortcut patterns that can speed-up task completion, thus increasing the hourly wage of effi- cient workers. We observe how crowd work experiences result in different types of working strategies, productivity levels, quality and diversity of the crowdsourced judgments. ",

keywords = "Crowdsourcing, IR evaluation, Relevance judgment, User behavior",

author = "Lei Han and Eddy Maddalena and Alessandro Checco and Cristina Sarasua and Ujwal Gadiraju and Kevin Roitero and Gianluca Demartini",

year = "2020",

doi = "10.1145/3336191.3371857",

language = "English",

pages = "241--249",

note = "In Proceedings of the ACM International Conference on Web Search and Data Mining (WSDM 2020), WSDM'20 ; Conference date: 03-02-2020 Through 07-02-2020",

}

TY - CONF

T1 - Crowd Worker Strategies in Relevance Judgment Tasks

AU - Han, Lei

AU - Maddalena, Eddy

AU - Checco, Alessandro

AU - Sarasua, Cristina

AU - Gadiraju, Ujwal

AU - Roitero, Kevin

AU - Demartini, Gianluca

N1 - Conference code: 13

PY - 2020

Y1 - 2020

N2 - Crowdsourcing is a popular technique to collect large amounts of human-generated labels, such as relevance judgments used to create information retrieval (IR) evaluation collections. Previous research has shown how collecting high quality labels from a crowdsourcing platform can be challenging. Existing quality assurance techniques focus on answer aggregation or on the use of gold questions where ground-truth data allows to check for the quality of the responses. In this paper, we present qualitative and quantitative results, revealing how different crowd workers adopt different work strate- gies to complete relevance judgment tasks efficiently and their consequent impact on quality. We delve into the techniques and tools that highly experienced crowd workers use to be more effi- cient in completing crowdsourcing micro-tasks. To this end, we use both qualitative results from worker interviews and surveys, as well as the results of a data-driven study of behavioral log data (i.e., clicks, keystrokes and keyboard shortcuts) collected from crowd workers performing relevance judgment tasks. Our results high- light the presence of frequently used shortcut patterns that can speed-up task completion, thus increasing the hourly wage of effi- cient workers. We observe how crowd work experiences result in different types of working strategies, productivity levels, quality and diversity of the crowdsourced judgments.

AB - Crowdsourcing is a popular technique to collect large amounts of human-generated labels, such as relevance judgments used to create information retrieval (IR) evaluation collections. Previous research has shown how collecting high quality labels from a crowdsourcing platform can be challenging. Existing quality assurance techniques focus on answer aggregation or on the use of gold questions where ground-truth data allows to check for the quality of the responses. In this paper, we present qualitative and quantitative results, revealing how different crowd workers adopt different work strate- gies to complete relevance judgment tasks efficiently and their consequent impact on quality. We delve into the techniques and tools that highly experienced crowd workers use to be more effi- cient in completing crowdsourcing micro-tasks. To this end, we use both qualitative results from worker interviews and surveys, as well as the results of a data-driven study of behavioral log data (i.e., clicks, keystrokes and keyboard shortcuts) collected from crowd workers performing relevance judgment tasks. Our results high- light the presence of frequently used shortcut patterns that can speed-up task completion, thus increasing the hourly wage of effi- cient workers. We observe how crowd work experiences result in different types of working strategies, productivity levels, quality and diversity of the crowdsourced judgments.

KW - Crowdsourcing

KW - IR evaluation

KW - Relevance judgment

KW - User behavior

UR - http://www.scopus.com/inward/record.url?scp=85079523647&partnerID=8YFLogxK

U2 - 10.1145/3336191.3371857

DO - 10.1145/3336191.3371857

M3 - Paper

SP - 241

EP - 249

T2 - In Proceedings of the ACM International Conference on Web Search and Data Mining (WSDM 2020)

Y2 - 3 February 2020 through 7 February 2020

ER -

Crowd Worker Strategies in Relevance Judgment Tasks

Abstract

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this