Abstract
Conversational events, such as speaking turns, backchannels, topic changes, and laughter, are central to the structure of multiparty interaction and play a key role in shaping its dynamics. However, detecting such events in real world social settings remains challenging due to perceptual ambiguity, visual occlusion, signal noise, and limitations in acquiring high quality audio data. This work addresses these challenges by focusing on spontaneous interactions in socially complex and privacy sensitive environments, exploring multimodal, nonverbal cues that do not rely on audio. The goal is to develop a novel modeling approach for group context awareness to infer conversational events and support social scene understanding under real world constraints..
| Original language | English |
|---|---|
| Title of host publication | ICMI 2025 - Proceedings of the 27th International Conference on Multimodal Interaction |
| Editors | Ram Subramanian, Yukiko I. Nakano, Tom Gedeon, Mohan Kankanhalli, Tanaya Guha, Jainendra Shukla, Gelareh Mohammadi, Oya Celiktutan |
| Place of Publication | New York, NY |
| Publisher | Association for Computing Machinery (ACM) |
| Pages | 716-720 |
| Number of pages | 5 |
| ISBN (Electronic) | 9798400714993 |
| DOIs | |
| Publication status | Published - 2025 |
| Event | 27th International Conference on Multimodal Interaction, ICMI 2025 - Canberra, Australia Duration: 13 Oct 2025 → 17 Oct 2025 |
Conference
| Conference | 27th International Conference on Multimodal Interaction, ICMI 2025 |
|---|---|
| Country/Territory | Australia |
| City | Canberra |
| Period | 13/10/25 → 17/10/25 |
Bibliographical note
Green Open Access added to TU Delft Institutional Repository as part of the Taverne amendment. More information about this copyright law amendment can be found at https://www.openaccess.nl. Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.Keywords
- Human-centered computing
- Multimodal interaction
- Multimodal Machine Learning
- Social Signal Processing
Fingerprint
Dive into the research topics of 'Multimodal Conversational Events Estimation in Complex Social Scenes'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver