Towards creating a conversational memory for long-term meeting support: predicting memorable moments in multi-party conversations through eye-gaze

Maria Tsfasman; Kristian Fenech; Morita Tarvirdians; Andras Lorincz; Catholijn Jonker; Catharine Oertel

doi:10.1145/3536221.3556613

Towards creating a conversational memory for long-term meeting support: predicting memorable moments in multi-party conversations through eye-gaze

Maria Tsfasman, Kristian Fenech, Morita Tarvirdians, Andras Lorincz, Catholijn Jonker, Catharine Oertel

Interactive Intelligence

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

1 Citation (Scopus)

155 Downloads (Pure)

Abstract

When working in a group, it is essential to understand each other's viewpoints to increase group cohesion and meeting productivity. This can be challenging in teams: participants might be left misunderstood and the discussion could be going around in circles. To tackle this problem, previous research on group interactions has addressed topics such as dominance detection, group engagement, and group creativity. Conversational memory, however, remains a widely unexplored area in the field of multimodal analysis of group interaction. The ability to track what each participant or a group as a whole find memorable from each meeting would allow a system or agent to continuously optimise its strategy to help a team meet its goals. In the present paper, we therefore investigate what participants take away from each meeting and how it is reflected in group dynamics.As a first step toward such a system, we recorded a multimodal longitudinal meeting corpus (MEMO), which comprises a first-party annotation of what participants remember from a discussion and why they remember it. We investigated whether participants of group interactions encode what they remember non-verbally and whether we can use such non-verbal multimodal features to predict what groups are likely to remember automatically. We devise a coding scheme to cluster participants' memorisation reasons into higher-level constructs. We find that low-level multimodal cues, such as gaze and speaker activity, can predict conversational memorability. We also find that non-verbal signals can indicate when a memorable moment starts and ends. We could predict four levels of conversational memorability with an average accuracy of 44 %. We also showed that reasons related to participants' personal feelings and experiences are the most frequently mentioned grounds for remembering meeting segments.

Original language	English
Title of host publication	ICMI 2022 - Proceedings of the 2022 International Conference on Multimodal Interaction
Publisher	Association for Computing Machinery (ACM)
Pages	94-104
Number of pages	11
ISBN (Electronic)	9781450393904
DOIs	https://doi.org/10.1145/3536221.3556613
Publication status	Published - 2022
Event	24th ACM International Conference on Multimodal Interaction, ICMI 2022 - Bangalore, India Duration: 7 Nov 2022 → 11 Nov 2022

Publication series

Name	ACM International Conference Proceeding Series

Conference

Conference	24th ACM International Conference on Multimodal Interaction, ICMI 2022
Country/Territory	India
City	Bangalore
Period	7/11/22 → 11/11/22

Keywords

conversational memory
multi-modal corpora
multi-party interaction
social signals

Access to Document

10.1145/3536221.3556613

3536221.3556613Final published version, 3.98 MBLicence: CC BY

Cite this

Tsfasman, M., Fenech, K., Tarvirdians, M., Lorincz, A., Jonker, C., & Oertel, C. (2022). Towards creating a conversational memory for long-term meeting support: predicting memorable moments in multi-party conversations through eye-gaze. In ICMI 2022 - Proceedings of the 2022 International Conference on Multimodal Interaction (pp. 94-104). (ACM International Conference Proceeding Series). Association for Computing Machinery (ACM). https://doi.org/10.1145/3536221.3556613

Tsfasman, Maria ; Fenech, Kristian ; Tarvirdians, Morita et al. / Towards creating a conversational memory for long-term meeting support : predicting memorable moments in multi-party conversations through eye-gaze. ICMI 2022 - Proceedings of the 2022 International Conference on Multimodal Interaction. Association for Computing Machinery (ACM), 2022. pp. 94-104 (ACM International Conference Proceeding Series).

@inproceedings{668a53e91688433b9934eb0de73dc89f,

title = "Towards creating a conversational memory for long-term meeting support: predicting memorable moments in multi-party conversations through eye-gaze",

abstract = "When working in a group, it is essential to understand each other's viewpoints to increase group cohesion and meeting productivity. This can be challenging in teams: participants might be left misunderstood and the discussion could be going around in circles. To tackle this problem, previous research on group interactions has addressed topics such as dominance detection, group engagement, and group creativity. Conversational memory, however, remains a widely unexplored area in the field of multimodal analysis of group interaction. The ability to track what each participant or a group as a whole find memorable from each meeting would allow a system or agent to continuously optimise its strategy to help a team meet its goals. In the present paper, we therefore investigate what participants take away from each meeting and how it is reflected in group dynamics.As a first step toward such a system, we recorded a multimodal longitudinal meeting corpus (MEMO), which comprises a first-party annotation of what participants remember from a discussion and why they remember it. We investigated whether participants of group interactions encode what they remember non-verbally and whether we can use such non-verbal multimodal features to predict what groups are likely to remember automatically. We devise a coding scheme to cluster participants' memorisation reasons into higher-level constructs. We find that low-level multimodal cues, such as gaze and speaker activity, can predict conversational memorability. We also find that non-verbal signals can indicate when a memorable moment starts and ends. We could predict four levels of conversational memorability with an average accuracy of 44 %. We also showed that reasons related to participants' personal feelings and experiences are the most frequently mentioned grounds for remembering meeting segments. ",

keywords = "conversational memory, multi-modal corpora, multi-party interaction, social signals",

author = "Maria Tsfasman and Kristian Fenech and Morita Tarvirdians and Andras Lorincz and Catholijn Jonker and Catharine Oertel",

year = "2022",

doi = "10.1145/3536221.3556613",

language = "English",

series = "ACM International Conference Proceeding Series",

publisher = "Association for Computing Machinery (ACM)",

pages = "94--104",

booktitle = "ICMI 2022 - Proceedings of the 2022 International Conference on Multimodal Interaction",

address = "United States",

note = "24th ACM International Conference on Multimodal Interaction, ICMI 2022 ; Conference date: 07-11-2022 Through 11-11-2022",

}

Tsfasman, M, Fenech, K, Tarvirdians, M, Lorincz, A, Jonker, C & Oertel, C 2022, Towards creating a conversational memory for long-term meeting support: predicting memorable moments in multi-party conversations through eye-gaze. in ICMI 2022 - Proceedings of the 2022 International Conference on Multimodal Interaction. ACM International Conference Proceeding Series, Association for Computing Machinery (ACM), pp. 94-104, 24th ACM International Conference on Multimodal Interaction, ICMI 2022, Bangalore, India, 7/11/22. https://doi.org/10.1145/3536221.3556613

Towards creating a conversational memory for long-term meeting support: predicting memorable moments in multi-party conversations through eye-gaze. / Tsfasman, Maria; Fenech, Kristian; Tarvirdians, Morita et al.
ICMI 2022 - Proceedings of the 2022 International Conference on Multimodal Interaction. Association for Computing Machinery (ACM), 2022. p. 94-104 (ACM International Conference Proceeding Series).

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

TY - GEN

T1 - Towards creating a conversational memory for long-term meeting support

T2 - 24th ACM International Conference on Multimodal Interaction, ICMI 2022

AU - Tsfasman, Maria

AU - Fenech, Kristian

AU - Tarvirdians, Morita

AU - Lorincz, Andras

AU - Jonker, Catholijn

AU - Oertel, Catharine

PY - 2022

Y1 - 2022

N2 - When working in a group, it is essential to understand each other's viewpoints to increase group cohesion and meeting productivity. This can be challenging in teams: participants might be left misunderstood and the discussion could be going around in circles. To tackle this problem, previous research on group interactions has addressed topics such as dominance detection, group engagement, and group creativity. Conversational memory, however, remains a widely unexplored area in the field of multimodal analysis of group interaction. The ability to track what each participant or a group as a whole find memorable from each meeting would allow a system or agent to continuously optimise its strategy to help a team meet its goals. In the present paper, we therefore investigate what participants take away from each meeting and how it is reflected in group dynamics.As a first step toward such a system, we recorded a multimodal longitudinal meeting corpus (MEMO), which comprises a first-party annotation of what participants remember from a discussion and why they remember it. We investigated whether participants of group interactions encode what they remember non-verbally and whether we can use such non-verbal multimodal features to predict what groups are likely to remember automatically. We devise a coding scheme to cluster participants' memorisation reasons into higher-level constructs. We find that low-level multimodal cues, such as gaze and speaker activity, can predict conversational memorability. We also find that non-verbal signals can indicate when a memorable moment starts and ends. We could predict four levels of conversational memorability with an average accuracy of 44 %. We also showed that reasons related to participants' personal feelings and experiences are the most frequently mentioned grounds for remembering meeting segments.

AB - When working in a group, it is essential to understand each other's viewpoints to increase group cohesion and meeting productivity. This can be challenging in teams: participants might be left misunderstood and the discussion could be going around in circles. To tackle this problem, previous research on group interactions has addressed topics such as dominance detection, group engagement, and group creativity. Conversational memory, however, remains a widely unexplored area in the field of multimodal analysis of group interaction. The ability to track what each participant or a group as a whole find memorable from each meeting would allow a system or agent to continuously optimise its strategy to help a team meet its goals. In the present paper, we therefore investigate what participants take away from each meeting and how it is reflected in group dynamics.As a first step toward such a system, we recorded a multimodal longitudinal meeting corpus (MEMO), which comprises a first-party annotation of what participants remember from a discussion and why they remember it. We investigated whether participants of group interactions encode what they remember non-verbally and whether we can use such non-verbal multimodal features to predict what groups are likely to remember automatically. We devise a coding scheme to cluster participants' memorisation reasons into higher-level constructs. We find that low-level multimodal cues, such as gaze and speaker activity, can predict conversational memorability. We also find that non-verbal signals can indicate when a memorable moment starts and ends. We could predict four levels of conversational memorability with an average accuracy of 44 %. We also showed that reasons related to participants' personal feelings and experiences are the most frequently mentioned grounds for remembering meeting segments.

KW - conversational memory

KW - multi-modal corpora

KW - multi-party interaction

KW - social signals

UR - http://www.scopus.com/inward/record.url?scp=85142854529&partnerID=8YFLogxK

U2 - 10.1145/3536221.3556613

DO - 10.1145/3536221.3556613

M3 - Conference contribution

AN - SCOPUS:85142854529

T3 - ACM International Conference Proceeding Series

SP - 94

EP - 104

BT - ICMI 2022 - Proceedings of the 2022 International Conference on Multimodal Interaction

PB - Association for Computing Machinery (ACM)

Y2 - 7 November 2022 through 11 November 2022

ER -

Tsfasman M, Fenech K, Tarvirdians M, Lorincz A, Jonker C , Oertel C. Towards creating a conversational memory for long-term meeting support: predicting memorable moments in multi-party conversations through eye-gaze. In ICMI 2022 - Proceedings of the 2022 International Conference on Multimodal Interaction. Association for Computing Machinery (ACM). 2022. p. 94-104. (ACM International Conference Proceeding Series). doi: 10.1145/3536221.3556613

Towards creating a conversational memory for long-term meeting support: predicting memorable moments in multi-party conversations through eye-gaze

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this