Filter
Conference contribution

Search results

  • 2023

    BIAS in Flemish automatic speech recognition

    Herygers, A., Verkhodanova, V., Coler, M., Scharenborg, O. E. & Georges, M., 2023, Proceedings of the ESSV Konferenz Elektronische Sprachsignalverarbeitung. 8 p.

    Research output: Chapter in Book/Conference proceedings/Edited volumeConference contributionScientificpeer-review

    Open Access
    File
    7 Downloads (Pure)
  • DAIS: The Delft Database of EEG Recordings of Dutch Articulated and Imagined Speech

    Dekker, B., Schouten, A. & Scharenborg, O., 2023, Proceedings of the ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Piscataway: IEEE, 5 p. (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings; vol. 2023-June).

    Research output: Chapter in Book/Conference proceedings/Edited volumeConference contributionScientificpeer-review

    Open Access
    File
    64 Downloads (Pure)
  • Exploring Data Augmentation in Bias Mitigation Against Non-Native-Accented Speech

    Zhang, Y., Herygers, A., Patel, T., Yue, Z. & Scharenborg, O., 2023, Proceedings of the 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU). IEEE, 8 p.

    Research output: Chapter in Book/Conference proceedings/Edited volumeConference contributionScientificpeer-review

  • Improving Adaptive Learning Models Using Prosodic Speech Features

    Wilschut, T., Sense, F., Scharenborg, O. & van Rijn, H., 2023, Artificial Intelligence in Education - 24th International Conference, AIED 2023, Proceedings. Wang, N., Rebolledo-Mendez, G., Matsuda, N., Santos, O. C. & Dimitrova, V. (eds.). Springer, p. 255-266 12 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 13916 LNAI).

    Research output: Chapter in Book/Conference proceedings/Edited volumeConference contributionScientificpeer-review

    Open Access
    File
    9 Downloads (Pure)
  • Improving Whispered Speech Recognition Performance Using Pseudo-Whispered Based Data Augmentation

    Lin, Z., Patel, T. & Scharenborg, O., 2023, Proceedings of the 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU). IEEE, 8 p.

    Research output: Chapter in Book/Conference proceedings/Edited volumeConference contributionScientificpeer-review

  • The Multimodal Information Based Speech Processing (Misp) 2022 Challenge: Audio-Visual Diarization And Recognition

    Wang, Z., Wu, S., Chen, H., He, M-K., Du, J., Lee, C-H., Chen, J., Watanabe, S., Siniscalchi, S. M., Scharenborg, O., Liu, D. & More Authors, 2023, Proceedings of the ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Piscataway: IEEE, 5 p. (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings; vol. 2023-June).

    Research output: Chapter in Book/Conference proceedings/Edited volumeConference contributionScientificpeer-review

    Open Access
    File
    22 Downloads (Pure)
  • 2022

    The First Multimodal Information Based Speech Processing (Misp) Challenge: Data, Tasks, Baselines And Results

    Chen, H., Zhou, H., Du, J., Lee, C-H., Chen, J., Watanabe, S., Siniscalchi, S. M., Scharenborg, O., Liu, D-Y. & More Authors, 2022, Proceedings of the ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Piscataway: IEEE, p. 9266-9270 5 p. 9746683

    Research output: Chapter in Book/Conference proceedings/Edited volumeConference contributionScientificpeer-review

    Open Access
    File
    24 Citations (Scopus)
    318 Downloads (Pure)
  • Towards Identity Preserving Normal to Dysarthric Voice Conversion

    Huang, W-C., Halpern, B. M., Violeta, L. P., Scharenborg, O. & Toda, T., 2022, Proceedings of the ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Piscataway: IEEE, p. 6672-6676 5 p. 9747550

    Research output: Chapter in Book/Conference proceedings/Edited volumeConference contributionScientificpeer-review

    Open Access
    File
    12 Citations (Scopus)
    5 Downloads (Pure)
  • Using Mixed Incentives to Document Xi’an Guanzhong

    Zhan, J., Jiang, Y., Cieri, C., Liberman, M., Yuan, J., Chen, Y. & Scharenborg, O., 2022, 2nd Workshop on Novel Incentives in Data Collection from People: Models, Implementations, Challenges and Results, NIDCP 2022 - Proceedings at LREC 2022 Workshop - Language Resources and Evaluation Conference. Fiumara, J., Cieri, C., Liberman, M. & Callison-Burch, C. (eds.). European Language Resources Association (ELRA), p. 32-37 6 p. (2nd Workshop on Novel Incentives in Data Collection from People: Models, Implementations, Challenges and Results, NIDCP 2022 - Proceedings at LREC 2022 Workshop - Language Resources and Evaluation Conference).

    Research output: Chapter in Book/Conference proceedings/Edited volumeConference contributionScientificpeer-review

    Open Access
    File
    9 Downloads (Pure)
  • 2021

    Align or attend? Toward More Efficient and Accurate Spoken Word Discovery Using Speech-to-Image Retrieval

    Wang, L., Wang, X., Hasegawa-Johnson, M., Scharenborg, O. & Dehak, N., 2021, ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Piscataway: IEEE, p. 7603-7607 5 p. 9414418

    Research output: Chapter in Book/Conference proceedings/Edited volumeConference contributionScientificpeer-review

    7 Citations (Scopus)
  • How phonotactics affect multilingual and zero-shot asr performance

    Feng, S., Żelasko, P., Moro-Velázquez, L., Abavisani, A., Hasegawa-Johnson, M., Scharenborg, O. & Dehak, N., 2021, ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Piscataway: IEEE, p. 7238-7242 5 p. 9414478

    Research output: Chapter in Book/Conference proceedings/Edited volumeConference contributionScientificpeer-review

    Open Access
    File
    13 Citations (Scopus)
    34 Downloads (Pure)
  • Learning fine-grained semantics in spoken language using visual grounding

    Wang, X., Tian, T., Zhu, J. & Scharenborg, O., 2021, 2021 IEEE International Symposium on Circuits and Systems (ISCAS). Piscataway: IEEE, 5 p. 9401232

    Research output: Chapter in Book/Conference proceedings/Edited volumeConference contributionScientificpeer-review

    Open Access
    File
    4 Citations (Scopus)
    43 Downloads (Pure)
  • Learning to recognise words using visually grounded speech

    Scholten, S., Merkx, D. & Scharenborg, O., 2021, 2021 IEEE International Symposium on Circuits and Systems (ISCAS). Piscataway: IEEE, 5 p. 9401692

    Research output: Chapter in Book/Conference proceedings/Edited volumeConference contributionScientificpeer-review

    Open Access
    File
    9 Citations (Scopus)
    24 Downloads (Pure)
  • Pathological voice adaptation with autoencoder-based voice conversion

    Illa, M., Halpern, B. M., van Son, R., Moro-Velázquez, L. & Scharenborg, O., 2021, Proceedings of the 11th ISCA Speech Synthesis Workshop (SSW 11). ISCA, p. 19-24 6 p.

    Research output: Chapter in Book/Conference proceedings/Edited volumeConference contributionScientificpeer-review

    Open Access
    File
    23 Downloads (Pure)
  • Show and speak: Directly synthesize spoken description of images

    Wang, X., Feng, S., Zhu, J., Hasegawa-Johnson, M. & Scharenborg, O., 2021, ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Piscataway: IEEE, p. 4190-4194 5 p. 9414021. (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings).

    Research output: Chapter in Book/Conference proceedings/Edited volumeConference contributionScientificpeer-review

    Open Access
    File
    2 Citations (Scopus)
    22 Downloads (Pure)
  • The effectiveness of self-supervised representation learning in zero-resource subword modeling

    Feng, S. & Scharenborg, O., 2021, 55th Asilomar Conference on Signals, Systems and Computers, ACSSC 2021: Proceedings. Matthews, M. B. (ed.). IEEE, p. 1414-1418 5 p. 9723168. (Conference Record - Asilomar Conference on Signals, Systems and Computers; vol. 2021-October).

    Research output: Chapter in Book/Conference proceedings/Edited volumeConference contributionScientificpeer-review

    Open Access
    File
    32 Downloads (Pure)
  • The effects of onset and offset masking on the time course of non-native spoken-word recognition in noise

    Hintz, F., Voeten, C. C., McQueen, J. M. & Scharenborg, O., 2021, Proceedings of the Cognitive Science (CogSci) conference. Cognitive Science Society, p. 133-139 7 p.

    Research output: Chapter in Book/Conference proceedings/Edited volumeConference contributionScientificpeer-review

  • Unsupervised acoustic unit discovery by leveraging a language-independent subword discriminative feature representation

    Feng, S., Zelasko, P., Moro-Velázquez, L. & Scharenborg, O., 2021, 22nd Annual Conference of the International Speech Communication Association, INTERSPEECH 2021. International Speech Communication Association, p. 1534-1538 5 p. (Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH; vol. 2).

    Research output: Chapter in Book/Conference proceedings/Edited volumeConference contributionScientificpeer-review

    Open Access
    File
    29 Downloads (Pure)
  • 2020

    Detecting and analysing spontaneous oral cancer speech in the wild

    Halpern, B. M., van Son, R., van den Brekel, M. W. M. & Scharenborg, O., 2020, Proceedings of Interspeech 2020. ISCA, p. 4826 - 4830 5 p. (Interspeech 2020).

    Research output: Chapter in Book/Conference proceedings/Edited volumeConference contributionScientificpeer-review

    6 Citations (Scopus)
  • Evaluating automatically generated phoneme captions for images

    van der Hout, J., D’Haese, Z., Hasegawa-Johnson, M. & Scharenborg, O., 2020, Proceedings of Interspeech 2020. ISCA, p. 2317 - 2321 5 p. (Interspeech 2020).

    Research output: Chapter in Book/Conference proceedings/Edited volumeConference contributionScientificpeer-review

    Open Access
    File
    3 Citations (Scopus)
    32 Downloads (Pure)
  • S2IGAN: Speech-to-Image Generation via Adversarial Learning

    Wang, X., Qiao, T., Zhu, J., Hanjalic, A. & Scharenborg, O., 2020, Proceedings of Interspeech 2020. ISCA, p. 2292 - 2296 5 p. (Interspeech 2020).

    Research output: Chapter in Book/Conference proceedings/Edited volumeConference contributionScientificpeer-review

    Open Access
    File
    3 Citations (Scopus)
    33 Downloads (Pure)
  • That Sounds Familiar: an Analysis of Phonetic Representations Transfer Across Languages

    Żelasko, P., Moro-Velázquez, L., Hasegawa-Johnson, M., Scharenborg, O. & Dehak, N., 2020, Proceedings of Interspeech 2020. ISCA, p. 3705 - 3709 5 p. (Interspeech 2020).

    Research output: Chapter in Book/Conference proceedings/Edited volumeConference contributionScientificpeer-review

    Open Access
    File
    13 Citations (Scopus)
    33 Downloads (Pure)
  • Unsupervised Subword Modeling Using Autoregressive Pretraining and Cross-Lingual Phone-Aware Modeling

    Feng, S. & Scharenborg, O., 2020, Proceedings of Interspeech 2020. ISCA, p. 2732 - 2736 5 p. (Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH).

    Research output: Chapter in Book/Conference proceedings/Edited volumeConference contributionScientificpeer-review

    Open Access
    File
    3 Citations (Scopus)
    31 Downloads (Pure)
  • 2019

    Study of the performance of automatic speech recognition systems in speakers with Parkinson’s Disease

    Moro-Velazquez, L., Cho, J., Watanabe, S., Hasegawa-Johnson, M. A., Scharenborg, O., Kim, H. & Dehak, N., 2019, Proceedings of Interspeech 2019. Kubin, G., Hain, T., Schuller, B., Zarka, D. E. & Hodl, P. (eds.). ISCA, Vol. 2019-September. p. 3875-3879 5 p. (Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH).

    Research output: Chapter in Book/Conference proceedings/Edited volumeConference contributionScientificpeer-review

    Open Access
    File
    17 Citations (Scopus)
    97 Downloads (Pure)
  • The neural correlates underlying lexically-guided perceptual learning

    Scharenborg, O., Koemans, J., Smith, C., Hasegawa-Johnson, M. & Federmeier, K. D., 2019, Proceedings of Interspeech 2019. Kubin, G., Hain, T., Schuller, B., Zarka, D. E. & Hodl, P. (eds.). ISCA, p. 1223-1227 5 p. (Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH).

    Research output: Chapter in Book/Conference proceedings/Edited volumeConference contributionScientificpeer-review

    Open Access
    File
    2 Citations (Scopus)
    105 Downloads (Pure)
  • The representation of speech and its processing in the human brain and deep neural networks

    Scharenborg, O., 2019, Speech and Computer: 21st International Conference, SPECOM 2019, Proceedings. Salah, A. A., Karpov, A. & Potapova, R. (eds.). Cham: Springer, p. 1-8 8 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 11658 LNAI).

    Research output: Chapter in Book/Conference proceedings/Edited volumeConference contributionScientificpeer-review

    Open Access
    File
    40 Downloads (Pure)
  • The representation of speech in deep neural networks

    Scharenborg, O., van der Gouw, N., Larson, M. & Marchiori, E., 2019, MultiMedia Modeling: 25th International Conference, MMM 2019, Proceedings. Kompatsiaris, I., Huet, B., Mezaris, V., Gurrin, C., Cheng, W-H. & Vrochidis, S. (eds.). Part II ed. Cham: Springer, p. 194-205 12 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 11296 LNCS).

    Research output: Chapter in Book/Conference proceedings/Edited volumeConference contributionScientificpeer-review

    Open Access
    File
    7 Citations (Scopus)
    124 Downloads (Pure)
  • The Time-Course of Phoneme Category Adaptation in Deep Neural Networks

    Ni, J., Hasegawa-Johnson, M. & Scharenborg, O., 2019, Statistical Language and Speech Processing: 7th International Conference, SLSP 2019. Martín-Vide, C., Purver, M. & Pollak, S. (eds.). Cham: Springer, p. 3-15 13 p. (Part of the Lecture Notes in Computer Science book series, Also part of the Lecture Notes in Artificial Intelligence book sub series ; vol. 11816).

    Research output: Chapter in Book/Conference proceedings/Edited volumeConference contributionScientificpeer-review

    Open Access
    File
    50 Downloads (Pure)
  • 2018

    Building an ASR System for Mboshi Using A Cross-language Definition of Acoustic Units Approach

    Scharenborg, O., Ebel, P., Ciannella, F., Hasegawa-Johnson, M. & Dehak, N., 2018, Proceedings of the 6th Workshop on Spoken Language Technologies for Under-resourced Languages (SLTU): 29-31 August 2018, Gurugram, India. New Delhi, India: ISCA, p. 167-171 5 p.

    Research output: Chapter in Book/Conference proceedings/Edited volumeConference contributionScientificpeer-review

    Open Access
    File
    130 Downloads (Pure)
  • The Conversation Continues: The Effect of Lyrics and Music Complexity of Background Music on Spoken-Word Recognition

    Scharenborg, O. & Larson, M., 2018, Proceedings of Interspeech 2018. Yegnanarayana, B. (ed.). International Speech Communication Association, p. 2280-2284 5 p.

    Research output: Chapter in Book/Conference proceedings/Edited volumeConference contributionScientificpeer-review

    Open Access
    File
    2 Citations (Scopus)
    36 Downloads (Pure)
  • The Role of Articulatory Feature Representation Quality in a Computational Model of Human Spoken-Word Recognition

    Scharenborg, O. & Merkx, D., 2018, Proceedings of the Machine Learning in Speech and Language Processing Workshop. Hyderabad, India, p. 1-3 3 p.

    Research output: Chapter in Book/Conference proceedings/Edited volumeConference contributionScientificpeer-review

    Open Access
    File
    58 Downloads (Pure)
  • Visualizing Phoneme Category Adaptation in Deep Neural Networks

    Scharenborg, O., Tiesmeyer, S., Hasegawa-Johnson, M. & Dehak, N., 3 Sept 2018, Proceedings of Interspeech 2018. Yegnanarayana, B. (ed.). India: International Speech Communication Association, p. 1482-1486 5 p.

    Research output: Chapter in Book/Conference proceedings/Edited volumeConference contributionScientificpeer-review

    File
    7 Citations (Scopus)
    32 Downloads (Pure)