Study of the performance of automatic speech recognition systems in speakers with Parkinson’s Disease

Laureano Moro-Velazquez, JaeJin Cho, Shinji Watanabe, Mark A. Hasegawa-Johnson, Odette Scharenborg, Heejin Kim, Najim Dehak

Research output: Chapter in Book/Conference proceedings/Edited volumeConference contributionScientificpeer-review

1 Citation (Scopus)
5 Downloads (Pure)

Abstract

Parkinson’s Disease (PD) affects motor capabilities of patients, who in some cases need to use human-computer assistive technologies to regain independence. The objective of this work is to study in detail the differences in error patterns from state-of-the-art Automatic Speech Recognition (ASR) systems on speech from people with and without PD. Two different speech recognizers (attention-based end-to-end and Deep Neural Network - Hidden Markov Models hybrid systems) were trained on a Spanish language corpus and subsequently tested on speech from 43 speakers with PD and 46 without PD. The differences related to error rates, substitutions, insertions and deletions of characters and phonetic units between the two groups were analyzed, showing that the word error rate is 27% higher in speakers with PD than in control speakers, with a moderated correlation between that rate and the developmental stage of the disease. The errors were related to all manner classes, and were more pronounced in the vowel /u/. This study is the first to evaluate ASR systems’ responses to speech from patients at different stages of PD in Spanish. The analyses showed general trends but individual speech deficits must be studied in the future when designing new ASR systems for this population.
Original languageEnglish
Title of host publicationProceedings of Interspeech 2019
EditorsG. Kubin, T. Hain, B. Schuller, D.E. Zarka, P. Hodl
PublisherISCA
Pages3875-3879
Number of pages5
Volume2019-September
DOIs
Publication statusPublished - 2019
EventInterspeech 2019
: Crossroads of Speech and Language
- Graz, Austria
Duration: 15 Sep 201919 Sep 2019

Publication series

NameProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
ISSN (Print)2308-457X

Conference

ConferenceInterspeech 2019
CountryAustria
CityGraz
Period15/09/1919/09/19

Keywords

  • Automatic speech recognition
  • Deep neural networks
  • Dysarthria
  • Parkinson's disease
  • Word error rate

Fingerprint Dive into the research topics of 'Study of the performance of automatic speech recognition systems in speakers with Parkinson’s Disease'. Together they form a unique fingerprint.

  • Cite this

    Moro-Velazquez, L., Cho, J., Watanabe, S., Hasegawa-Johnson, M. A., Scharenborg, O., Kim, H., & Dehak, N. (2019). Study of the performance of automatic speech recognition systems in speakers with Parkinson’s Disease. In G. Kubin, T. Hain, B. Schuller, D. E. Zarka, & P. Hodl (Eds.), Proceedings of Interspeech 2019 (Vol. 2019-September, pp. 3875-3879). (Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH). ISCA. https://doi.org/10.21437/Interspeech.2019-2993