Using Mixed Incentives to Document Xi’an Guanzhong

Juhong Zhan; Yue Jiang; Christopher Cieri; Mark Liberman; Jiahong Yuan; Yiya Chen; Odette Scharenborg

Using Mixed Incentives to Document Xi’an Guanzhong

Juhong Zhan, Yue Jiang, Christopher Cieri^*, Mark Liberman, Jiahong Yuan, Yiya Chen, Odette Scharenborg

^*Corresponding author for this work

Multimedia Computing

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

9 Downloads (Pure)

Abstract

This paper describes our use of mixed incentives and the citizen science portal LanguageARC to prepare, collect and quality control a large corpus of object namings for the purpose of providing speech data to document the under-represented Guanzhong dialect of Chinese spoken in the Shaanxi province in the environs of Xi’an.

Original language	English
Title of host publication	2nd Workshop on Novel Incentives in Data Collection from People
Subtitle of host publication	Models, Implementations, Challenges and Results, NIDCP 2022 - Proceedings at LREC 2022 Workshop - Language Resources and Evaluation Conference
Editors	James Fiumara, Christopher Cieri, Mark Liberman, Chris Callison-Burch
Publisher	European Language Resources Association (ELRA)
Pages	32-37
Number of pages	6
ISBN (Electronic)	9782493814050
Publication status	Published - 2022
Event	2nd Workshop on Novel Incentives in Data Collection from People: Models, Implementations, Challenges and Results, NIDCP 2022 - Marseille, France Duration: 20 Jun 2022 → 25 Jun 2022

Publication series

Name	2nd Workshop on Novel Incentives in Data Collection from People: Models, Implementations, Challenges and Results, NIDCP 2022 - Proceedings at LREC 2022 Workshop - Language Resources and Evaluation Conference

Conference

Conference	2nd Workshop on Novel Incentives in Data Collection from People: Models, Implementations, Challenges and Results, NIDCP 2022
Country/Territory	France
City	Marseille
Period	20/06/22 → 25/06/22

Bibliographical note

Green Open Access added to TU Delft Institutional Repository ‘You share, we take care!’ – Taverne project https://www.openaccess.nl/en/you-share-we-take-care
Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.

Keywords

annotation
language resources
linguistic data
novel incentives
under-resourced languages

Access to Document

2022.nidcp-1.6Final published version, 832 KB

Cite this

Zhan, J., Jiang, Y., Cieri, C., Liberman, M., Yuan, J., Chen, Y., & Scharenborg, O. (2022). Using Mixed Incentives to Document Xi’an Guanzhong. In J. Fiumara, C. Cieri, M. Liberman, & C. Callison-Burch (Eds.), 2nd Workshop on Novel Incentives in Data Collection from People: Models, Implementations, Challenges and Results, NIDCP 2022 - Proceedings at LREC 2022 Workshop - Language Resources and Evaluation Conference (pp. 32-37). (2nd Workshop on Novel Incentives in Data Collection from People: Models, Implementations, Challenges and Results, NIDCP 2022 - Proceedings at LREC 2022 Workshop - Language Resources and Evaluation Conference). European Language Resources Association (ELRA).

Zhan, Juhong ; Jiang, Yue ; Cieri, Christopher et al. / Using Mixed Incentives to Document Xi’an Guanzhong. 2nd Workshop on Novel Incentives in Data Collection from People: Models, Implementations, Challenges and Results, NIDCP 2022 - Proceedings at LREC 2022 Workshop - Language Resources and Evaluation Conference. editor / James Fiumara ; Christopher Cieri ; Mark Liberman ; Chris Callison-Burch. European Language Resources Association (ELRA), 2022. pp. 32-37 (2nd Workshop on Novel Incentives in Data Collection from People: Models, Implementations, Challenges and Results, NIDCP 2022 - Proceedings at LREC 2022 Workshop - Language Resources and Evaluation Conference).

@inproceedings{78a8f4d4df464215a807fb1dc5af99e7,

title = "Using Mixed Incentives to Document Xi{\textquoteright}an Guanzhong",

abstract = "This paper describes our use of mixed incentives and the citizen science portal LanguageARC to prepare, collect and quality control a large corpus of object namings for the purpose of providing speech data to document the under-represented Guanzhong dialect of Chinese spoken in the Shaanxi province in the environs of Xi{\textquoteright}an.",

keywords = "annotation, language resources, linguistic data, novel incentives, under-resourced languages",

author = "Juhong Zhan and Yue Jiang and Christopher Cieri and Mark Liberman and Jiahong Yuan and Yiya Chen and Odette Scharenborg",

note = "Green Open Access added to TU Delft Institutional Repository {\textquoteleft}You share, we take care!{\textquoteright} – Taverne project https://www.openaccess.nl/en/you-share-we-take-care Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public. ; 2nd Workshop on Novel Incentives in Data Collection from People: Models, Implementations, Challenges and Results, NIDCP 2022 ; Conference date: 20-06-2022 Through 25-06-2022",

year = "2022",

language = "English",

series = "2nd Workshop on Novel Incentives in Data Collection from People: Models, Implementations, Challenges and Results, NIDCP 2022 - Proceedings at LREC 2022 Workshop - Language Resources and Evaluation Conference",

publisher = "European Language Resources Association (ELRA)",

pages = "32--37",

editor = "James Fiumara and Christopher Cieri and Mark Liberman and Chris Callison-Burch",

booktitle = "2nd Workshop on Novel Incentives in Data Collection from People",

}

Zhan, J, Jiang, Y, Cieri, C, Liberman, M, Yuan, J, Chen, Y & Scharenborg, O 2022, Using Mixed Incentives to Document Xi’an Guanzhong. in J Fiumara, C Cieri, M Liberman & C Callison-Burch (eds), 2nd Workshop on Novel Incentives in Data Collection from People: Models, Implementations, Challenges and Results, NIDCP 2022 - Proceedings at LREC 2022 Workshop - Language Resources and Evaluation Conference. 2nd Workshop on Novel Incentives in Data Collection from People: Models, Implementations, Challenges and Results, NIDCP 2022 - Proceedings at LREC 2022 Workshop - Language Resources and Evaluation Conference, European Language Resources Association (ELRA), pp. 32-37, 2nd Workshop on Novel Incentives in Data Collection from People: Models, Implementations, Challenges and Results, NIDCP 2022, Marseille, France, 20/06/22.

Using Mixed Incentives to Document Xi’an Guanzhong. / Zhan, Juhong; Jiang, Yue; Cieri, Christopher et al.
2nd Workshop on Novel Incentives in Data Collection from People: Models, Implementations, Challenges and Results, NIDCP 2022 - Proceedings at LREC 2022 Workshop - Language Resources and Evaluation Conference. ed. / James Fiumara; Christopher Cieri; Mark Liberman; Chris Callison-Burch. European Language Resources Association (ELRA), 2022. p. 32-37 (2nd Workshop on Novel Incentives in Data Collection from People: Models, Implementations, Challenges and Results, NIDCP 2022 - Proceedings at LREC 2022 Workshop - Language Resources and Evaluation Conference).

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

TY - GEN

T1 - Using Mixed Incentives to Document Xi’an Guanzhong

AU - Zhan, Juhong

AU - Jiang, Yue

AU - Cieri, Christopher

AU - Liberman, Mark

AU - Yuan, Jiahong

AU - Chen, Yiya

AU - Scharenborg, Odette

N1 - Green Open Access added to TU Delft Institutional Repository ‘You share, we take care!’ – Taverne project https://www.openaccess.nl/en/you-share-we-take-care Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.

PY - 2022

Y1 - 2022

N2 - This paper describes our use of mixed incentives and the citizen science portal LanguageARC to prepare, collect and quality control a large corpus of object namings for the purpose of providing speech data to document the under-represented Guanzhong dialect of Chinese spoken in the Shaanxi province in the environs of Xi’an.

AB - This paper describes our use of mixed incentives and the citizen science portal LanguageARC to prepare, collect and quality control a large corpus of object namings for the purpose of providing speech data to document the under-represented Guanzhong dialect of Chinese spoken in the Shaanxi province in the environs of Xi’an.

KW - annotation

KW - language resources

KW - linguistic data

KW - novel incentives

KW - under-resourced languages

UR - http://www.scopus.com/inward/record.url?scp=85145878905&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:85145878905

T3 - 2nd Workshop on Novel Incentives in Data Collection from People: Models, Implementations, Challenges and Results, NIDCP 2022 - Proceedings at LREC 2022 Workshop - Language Resources and Evaluation Conference

SP - 32

EP - 37

BT - 2nd Workshop on Novel Incentives in Data Collection from People

A2 - Fiumara, James

A2 - Cieri, Christopher

A2 - Liberman, Mark

A2 - Callison-Burch, Chris

PB - European Language Resources Association (ELRA)

T2 - 2nd Workshop on Novel Incentives in Data Collection from People: Models, Implementations, Challenges and Results, NIDCP 2022

Y2 - 20 June 2022 through 25 June 2022

ER -

Zhan J, Jiang Y, Cieri C, Liberman M, Yuan J, Chen Y et al. Using Mixed Incentives to Document Xi’an Guanzhong. In Fiumara J, Cieri C, Liberman M, Callison-Burch C, editors, 2nd Workshop on Novel Incentives in Data Collection from People: Models, Implementations, Challenges and Results, NIDCP 2022 - Proceedings at LREC 2022 Workshop - Language Resources and Evaluation Conference. European Language Resources Association (ELRA). 2022. p. 32-37. (2nd Workshop on Novel Incentives in Data Collection from People: Models, Implementations, Challenges and Results, NIDCP 2022 - Proceedings at LREC 2022 Workshop - Language Resources and Evaluation Conference).

Using Mixed Incentives to Document Xi’an Guanzhong

Abstract

Publication series

Conference

Bibliographical note

Keywords

Access to Document

Other files and links

Fingerprint

Cite this