O.E. Scharenborg

2023

BIAS in Flemish automatic speech recognition

Herygers, A., Verkhodanova, V., Coler, M., Scharenborg, O. E. & Georges, M., 2023, Proceedings of the ESSV Konferenz Elektronische Sprachsignalverarbeitung. 8 p.

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

Open Access

File

7 Downloads (Pure)

DAIS: The Delft Database of EEG Recordings of Dutch Articulated and Imagined Speech

Dekker, B., Schouten, A. & Scharenborg, O., 2023, Proceedings of the ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Piscataway: IEEE, 5 p. (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings; vol. 2023-June).

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

Open Access

File

64 Downloads (Pure)

Exploring Data Augmentation in Bias Mitigation Against Non-Native-Accented Speech

Zhang, Y., Herygers, A., Patel, T., Yue, Z. & Scharenborg, O., 2023, Proceedings of the 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU). IEEE, 8 p.

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

Improving Adaptive Learning Models Using Prosodic Speech Features

Wilschut, T., Sense, F., Scharenborg, O. & van Rijn, H., 2023, Artificial Intelligence in Education - 24th International Conference, AIED 2023, Proceedings. Wang, N., Rebolledo-Mendez, G., Matsuda, N., Santos, O. C. & Dimitrova, V. (eds.). Springer, p. 255-266 12 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 13916 LNAI).

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

Open Access

File

9 Downloads (Pure)

Improving Whispered Speech Recognition Performance Using Pseudo-Whispered Based Data Augmentation

Lin, Z., Patel, T. & Scharenborg, O., 2023, Proceedings of the 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU). IEEE, 8 p.

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

The Multimodal Information Based Speech Processing (Misp) 2022 Challenge: Audio-Visual Diarization And Recognition

Wang, Z., Wu, S., Chen, H., He, M-K., Du, J., Lee, C-H., Chen, J., Watanabe, S., Siniscalchi, S. M., Scharenborg, O., Liu, D. & More Authors, 2023, Proceedings of the ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Piscataway: IEEE, 5 p. (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings; vol. 2023-June).

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

Open Access

File

22 Downloads (Pure)

2022

The First Multimodal Information Based Speech Processing (Misp) Challenge: Data, Tasks, Baselines And Results

Chen, H., Zhou, H., Du, J., Lee, C-H., Chen, J., Watanabe, S., Siniscalchi, S. M., Scharenborg, O., Liu, D-Y. & More Authors, 2022, Proceedings of the ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Piscataway: IEEE, p. 9266-9270 5 p. 9746683

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

Open Access

File

24 Citations (Scopus)

318 Downloads (Pure)

Towards Identity Preserving Normal to Dysarthric Voice Conversion

Huang, W-C., Halpern, B. M., Violeta, L. P., Scharenborg, O. & Toda, T., 2022, Proceedings of the ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Piscataway: IEEE, p. 6672-6676 5 p. 9747550

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

Open Access

File

12 Citations (Scopus)

5 Downloads (Pure)

Using Mixed Incentives to Document Xi’an Guanzhong

Zhan, J., Jiang, Y., Cieri, C., Liberman, M., Yuan, J., Chen, Y. & Scharenborg, O., 2022, 2nd Workshop on Novel Incentives in Data Collection from People: Models, Implementations, Challenges and Results, NIDCP 2022 - Proceedings at LREC 2022 Workshop - Language Resources and Evaluation Conference. Fiumara, J., Cieri, C., Liberman, M. & Callison-Burch, C. (eds.). European Language Resources Association (ELRA), p. 32-37 6 p. (2nd Workshop on Novel Incentives in Data Collection from People: Models, Implementations, Challenges and Results, NIDCP 2022 - Proceedings at LREC 2022 Workshop - Language Resources and Evaluation Conference).

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

Open Access

File

9 Downloads (Pure)

2021

Align or attend? Toward More Efficient and Accurate Spoken Word Discovery Using Speech-to-Image Retrieval

Wang, L., Wang, X., Hasegawa-Johnson, M., Scharenborg, O. & Dehak, N., 2021, ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Piscataway: IEEE, p. 7603-7607 5 p. 9414418

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

7 Citations (Scopus)

How phonotactics affect multilingual and zero-shot asr performance

Feng, S., Żelasko, P., Moro-Velázquez, L., Abavisani, A., Hasegawa-Johnson, M., Scharenborg, O. & Dehak, N., 2021, ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Piscataway: IEEE, p. 7238-7242 5 p. 9414478

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

Open Access

File

13 Citations (Scopus)

34 Downloads (Pure)

Learning fine-grained semantics in spoken language using visual grounding

Wang, X., Tian, T., Zhu, J. & Scharenborg, O., 2021, 2021 IEEE International Symposium on Circuits and Systems (ISCAS). Piscataway: IEEE, 5 p. 9401232

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

Open Access

File

4 Citations (Scopus)

43 Downloads (Pure)

Learning to recognise words using visually grounded speech

Scholten, S., Merkx, D. & Scharenborg, O., 2021, 2021 IEEE International Symposium on Circuits and Systems (ISCAS). Piscataway: IEEE, 5 p. 9401692

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

Open Access

File

9 Citations (Scopus)

24 Downloads (Pure)

Pathological voice adaptation with autoencoder-based voice conversion

Illa, M., Halpern, B. M., van Son, R., Moro-Velázquez, L. & Scharenborg, O., 2021, Proceedings of the 11th ISCA Speech Synthesis Workshop (SSW 11). ISCA, p. 19-24 6 p.

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

Open Access

File

23 Downloads (Pure)

Show and speak: Directly synthesize spoken description of images

Wang, X., Feng, S., Zhu, J., Hasegawa-Johnson, M. & Scharenborg, O., 2021, ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Piscataway: IEEE, p. 4190-4194 5 p. 9414021. (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings).

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

Open Access

File

2 Citations (Scopus)

22 Downloads (Pure)

The effectiveness of self-supervised representation learning in zero-resource subword modeling

Feng, S. & Scharenborg, O., 2021, 55th Asilomar Conference on Signals, Systems and Computers, ACSSC 2021: Proceedings. Matthews, M. B. (ed.). IEEE, p. 1414-1418 5 p. 9723168. (Conference Record - Asilomar Conference on Signals, Systems and Computers; vol. 2021-October).

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

Open Access

File

32 Downloads (Pure)

The effects of onset and offset masking on the time course of non-native spoken-word recognition in noise

Hintz, F., Voeten, C. C., McQueen, J. M. & Scharenborg, O., 2021, Proceedings of the Cognitive Science (CogSci) conference. Cognitive Science Society, p. 133-139 7 p.

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

Unsupervised acoustic unit discovery by leveraging a language-independent subword discriminative feature representation

Feng, S., Zelasko, P., Moro-Velázquez, L. & Scharenborg, O., 2021, 22nd Annual Conference of the International Speech Communication Association, INTERSPEECH 2021. International Speech Communication Association, p. 1534-1538 5 p. (Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH; vol. 2).

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

Open Access

File

29 Downloads (Pure)

2020

Detecting and analysing spontaneous oral cancer speech in the wild

Halpern, B. M., van Son, R., van den Brekel, M. W. M. & Scharenborg, O., 2020, Proceedings of Interspeech 2020. ISCA, p. 4826 - 4830 5 p. (Interspeech 2020).

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

6 Citations (Scopus)

Evaluating automatically generated phoneme captions for images

van der Hout, J., D’Haese, Z., Hasegawa-Johnson, M. & Scharenborg, O., 2020, Proceedings of Interspeech 2020. ISCA, p. 2317 - 2321 5 p. (Interspeech 2020).

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

Open Access

File

3 Citations (Scopus)

32 Downloads (Pure)

S2IGAN: Speech-to-Image Generation via Adversarial Learning

Wang, X., Qiao, T., Zhu, J., Hanjalic, A. & Scharenborg, O., 2020, Proceedings of Interspeech 2020. ISCA, p. 2292 - 2296 5 p. (Interspeech 2020).

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

Open Access

File

3 Citations (Scopus)

33 Downloads (Pure)

That Sounds Familiar: an Analysis of Phonetic Representations Transfer Across Languages

Żelasko, P., Moro-Velázquez, L., Hasegawa-Johnson, M., Scharenborg, O. & Dehak, N., 2020, Proceedings of Interspeech 2020. ISCA, p. 3705 - 3709 5 p. (Interspeech 2020).

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

Open Access

File

13 Citations (Scopus)

33 Downloads (Pure)

Unsupervised Subword Modeling Using Autoregressive Pretraining and Cross-Lingual Phone-Aware Modeling

Feng, S. & Scharenborg, O., 2020, Proceedings of Interspeech 2020. ISCA, p. 2732 - 2736 5 p. (Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH).

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

Open Access

File

3 Citations (Scopus)

31 Downloads (Pure)

2019

Study of the performance of automatic speech recognition systems in speakers with Parkinson’s Disease

Moro-Velazquez, L., Cho, J., Watanabe, S., Hasegawa-Johnson, M. A., Scharenborg, O., Kim, H. & Dehak, N., 2019, Proceedings of Interspeech 2019. Kubin, G., Hain, T., Schuller, B., Zarka, D. E. & Hodl, P. (eds.). ISCA, Vol. 2019-September. p. 3875-3879 5 p. (Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH).

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

Open Access

File

17 Citations (Scopus)

97 Downloads (Pure)

The neural correlates underlying lexically-guided perceptual learning

Scharenborg, O., Koemans, J., Smith, C., Hasegawa-Johnson, M. & Federmeier, K. D., 2019, Proceedings of Interspeech 2019. Kubin, G., Hain, T., Schuller, B., Zarka, D. E. & Hodl, P. (eds.). ISCA, p. 1223-1227 5 p. (Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH).

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

Open Access

File

2 Citations (Scopus)

105 Downloads (Pure)

The representation of speech and its processing in the human brain and deep neural networks

Scharenborg, O., 2019, Speech and Computer: 21st International Conference, SPECOM 2019, Proceedings. Salah, A. A., Karpov, A. & Potapova, R. (eds.). Cham: Springer, p. 1-8 8 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 11658 LNAI).

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

Open Access

File

40 Downloads (Pure)

The representation of speech in deep neural networks

Scharenborg, O., van der Gouw, N., Larson, M. & Marchiori, E., 2019, MultiMedia Modeling: 25th International Conference, MMM 2019, Proceedings. Kompatsiaris, I., Huet, B., Mezaris, V., Gurrin, C., Cheng, W-H. & Vrochidis, S. (eds.). Part II ed. Cham: Springer, p. 194-205 12 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 11296 LNCS).

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

Open Access

File

7 Citations (Scopus)

124 Downloads (Pure)

The Time-Course of Phoneme Category Adaptation in Deep Neural Networks

Ni, J., Hasegawa-Johnson, M. & Scharenborg, O., 2019, Statistical Language and Speech Processing: 7th International Conference, SLSP 2019. Martín-Vide, C., Purver, M. & Pollak, S. (eds.). Cham: Springer, p. 3-15 13 p. (Part of the Lecture Notes in Computer Science book series, Also part of the Lecture Notes in Artificial Intelligence book sub series ; vol. 11816).

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

Open Access

File

50 Downloads (Pure)

2018

Building an ASR System for Mboshi Using A Cross-language Definition of Acoustic Units Approach

Scharenborg, O., Ebel, P., Ciannella, F., Hasegawa-Johnson, M. & Dehak, N., 2018, Proceedings of the 6th Workshop on Spoken Language Technologies for Under-resourced Languages (SLTU): 29-31 August 2018, Gurugram, India. New Delhi, India: ISCA, p. 167-171 5 p.

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

Open Access

File

130 Downloads (Pure)

The Conversation Continues: The Effect of Lyrics and Music Complexity of Background Music on Spoken-Word Recognition

Scharenborg, O. & Larson, M., 2018, Proceedings of Interspeech 2018. Yegnanarayana, B. (ed.). International Speech Communication Association, p. 2280-2284 5 p.

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

Open Access

File

2 Citations (Scopus)

36 Downloads (Pure)

The Role of Articulatory Feature Representation Quality in a Computational Model of Human Spoken-Word Recognition

Scharenborg, O. & Merkx, D., 2018, Proceedings of the Machine Learning in Speech and Language Processing Workshop. Hyderabad, India, p. 1-3 3 p.

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

Open Access

File

58 Downloads (Pure)

Visualizing Phoneme Category Adaptation in Deep Neural Networks

Scharenborg, O., Tiesmeyer, S., Hasegawa-Johnson, M. & Dehak, N., 3 Sept 2018, Proceedings of Interspeech 2018. Yegnanarayana, B. (ed.). India: International Speech Communication Association, p. 1482-1486 5 p.

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

File

7 Citations (Scopus)

32 Downloads (Pure)

O.E. Scharenborg

Research output

Search results