TY - JOUR
T1 - Investigating model performance in language identification
T2 - 24th International Speech Communication Association, Interspeech 2023
AU - Styles, Suzy J.
AU - Chua, Victoria Y.H.
AU - Woon, Fei Ting
AU - Liu, Hexin
AU - Perera, Leibny Paola Garcia
AU - Khudanpur, Sanjeev
AU - Khong, Andy W.H.
AU - Dauwels, Justin
PY - 2023
Y1 - 2023
N2 - Language development experts need tools that can automatically identify languages from fluent, conversational speech and provide reliable estimates of usage rates at the level of an individual recording. However, LID systems are typically evaluated on metrics such as equal error rate and balanced accuracy, applied at the level of an entire speech corpus. These overview metrics do not provide information about model performance at the level of individual speakers, recordings, or units of speech with different linguistic characteristics. Overview statistics may mask systematic errors in model performance for some subsets of the data, and consequently, have worse performance on data derived from some subsets of human speakers, creating a kind of algorithmic bias. Here, we investigate how well a number of LID systems perform on individual recordings and speech units with different linguistic properties in the MERLIon CCS Challenge featuring accented code-switched child-directed speech.
AB - Language development experts need tools that can automatically identify languages from fluent, conversational speech and provide reliable estimates of usage rates at the level of an individual recording. However, LID systems are typically evaluated on metrics such as equal error rate and balanced accuracy, applied at the level of an entire speech corpus. These overview metrics do not provide information about model performance at the level of individual speakers, recordings, or units of speech with different linguistic characteristics. Overview statistics may mask systematic errors in model performance for some subsets of the data, and consequently, have worse performance on data derived from some subsets of human speakers, creating a kind of algorithmic bias. Here, we investigate how well a number of LID systems perform on individual recordings and speech units with different linguistic properties in the MERLIon CCS Challenge featuring accented code-switched child-directed speech.
KW - child-directed speech
KW - code-switching
KW - language diarization
KW - language identification
UR - http://www.scopus.com/inward/record.url?scp=85162743570&partnerID=8YFLogxK
U2 - 10.21437/Interspeech.2023-1707
DO - 10.21437/Interspeech.2023-1707
M3 - Conference article
AN - SCOPUS:85162743570
SN - 2308-457X
VL - 2023-August
SP - 4129
EP - 4133
JO - Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
JF - Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
Y2 - 20 August 2023 through 24 August 2023
ER -