Skip to main navigation Skip to search Skip to main content

Multimodal Conversational Events Estimation in Complex Social Scenes

Litian Li*

*Corresponding author for this work

Research output: Chapter in Book/Conference proceedings/Edited volumeConference contributionScientificpeer-review

Abstract

Conversational events, such as speaking turns, backchannels, topic changes, and laughter, are central to the structure of multiparty interaction and play a key role in shaping its dynamics. However, detecting such events in real world social settings remains challenging due to perceptual ambiguity, visual occlusion, signal noise, and limitations in acquiring high quality audio data. This work addresses these challenges by focusing on spontaneous interactions in socially complex and privacy sensitive environments, exploring multimodal, nonverbal cues that do not rely on audio. The goal is to develop a novel modeling approach for group context awareness to infer conversational events and support social scene understanding under real world constraints..
Original languageEnglish
Title of host publicationICMI 2025 - Proceedings of the 27th International Conference on Multimodal Interaction
EditorsRam Subramanian, Yukiko I. Nakano, Tom Gedeon, Mohan Kankanhalli, Tanaya Guha, Jainendra Shukla, Gelareh Mohammadi, Oya Celiktutan
Place of PublicationNew York, NY
PublisherAssociation for Computing Machinery (ACM)
Pages716-720
Number of pages5
ISBN (Electronic)9798400714993
DOIs
Publication statusPublished - 2025
Event27th International Conference on Multimodal Interaction, ICMI 2025 - Canberra, Australia
Duration: 13 Oct 202517 Oct 2025

Conference

Conference27th International Conference on Multimodal Interaction, ICMI 2025
Country/TerritoryAustralia
CityCanberra
Period13/10/2517/10/25

Bibliographical note

Green Open Access added to TU Delft Institutional Repository as part of the Taverne amendment. More information about this copyright law amendment can be found at https://www.openaccess.nl. Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.

Keywords

  • Human-centered computing
  • Multimodal interaction
  • Multimodal Machine Learning
  • Social Signal Processing

Fingerprint

Dive into the research topics of 'Multimodal Conversational Events Estimation in Complex Social Scenes'. Together they form a unique fingerprint.

Cite this