Speech technology for unwritten languages

Odette Scharenborg, Laurent Besacier, Alan W. Black, Mark Hasegawa-Johnson, Florian Metze, Graham Neubig, Sebastian Stueker, Pierre Godard, M Mueller, More Authors

Research output: Contribution to journalArticleScientificpeer-review

28 Downloads (Pure)

Abstract

Speech technology plays an important role in our everyday life. Among others, speech is used for human-computer interaction, for instance for information retrieval and on-line shopping. In the case of an unwritten language, however, speech technology is unfortunately difficult to create, because it cannot be created by the standard combination of pre-trained speech-to-text and text-to-speech subsystems. The research presented in this article takes the first steps towards speech technology for unwritten languages. Specifically, the aim of this work was 1) to learn speech-to-meaning representations without using text as an intermediate representation, and 2) to test the sufficiency of the learned representations to regenerate speech or translated text, or to retrieve images that depict the meaning of an utterance in an unwritten language. The results suggest that building systems that go directly from speech-to-meaning and from meaning-to-speech, bypassing the need for text, is possible.

Original languageEnglish
Article number8998182
Pages (from-to)964-975
Number of pages12
JournalIEEE/ACM Transactions on Audio Speech and Language Processing
Volume28
DOIs
Publication statusPublished - 2020

Keywords

  • Speech processing
  • automatic speech recognition
  • image retrieval
  • speech synthesis
  • unsupervised learning

Fingerprint Dive into the research topics of 'Speech technology for unwritten languages'. Together they form a unique fingerprint.

  • Cite this

    Scharenborg, O., Besacier, L., Black, A. W., Hasegawa-Johnson, M., Metze, F., Neubig, G., Stueker, S., Godard, P., Mueller, M., & More Authors (2020). Speech technology for unwritten languages. IEEE/ACM Transactions on Audio Speech and Language Processing, 28, 964-975. [8998182]. https://doi.org/10.1109/TASLP.2020.2973896