Abstract
Research has shown that automatic speech recognition (ASR) systems exhibit biases against different speaker groups, e.g., based on age or gender. This paper presents an investigation into bias in recent Flemish ASR. Seeing as Belgian Dutch, which is also known as Flemish, is often not included in Dutch ASR systems, a state-of-the-art ASR system for Dutch is trained using the Netherlandic Dutch data from the Spoken Dutch Corpus. Using the Flemish data from the JASMIN-CGN corpus, word error rates for various regional variants of Flemish are then compared. In addition, the most misrecognized phonemes are compared across speaker groups. The evaluation confirms a bias against speakers from West Flanders and Limburg, as well as against children, male speakers, and non-native speakers.
Original language | English |
---|---|
Title of host publication | Proceedings of the ESSV Konferenz Elektronische Sprachsignalverarbeitung |
Number of pages | 8 |
Publication status | Published - 2023 |
Event | ESSV Konferenz Elektronische Sprachsignalverarbeitung - Munich, Germany Duration: 1 Mar 2023 → 3 Mar 2023 Conference number: 34 |
Conference
Conference | ESSV Konferenz Elektronische Sprachsignalverarbeitung |
---|---|
Abbreviated title | ESSV 2023 |
Country/Territory | Germany |
City | Munich |
Period | 1/03/23 → 3/03/23 |