Multimodal Video-to-Video Linking: Turning to the Crowd for Insight and Evaluation

Maria Eskevich; Martha Larson; Robin Aly; Serwah Sabetghadam; Gareth J.F. Jones; Roeland Ordelman; Benoit Huet

doi:10.1007/978-3-319-51814-5_24

Multimodal Video-to-Video Linking: Turning to the Crowd for Insight and Evaluation

Maria Eskevich, Martha Larson, Robin Aly, Serwah Sabetghadam, Gareth J.F. Jones, Roeland Ordelman, Benoit Huet

Multimedia Computing

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

4 Citations (Scopus)

Abstract

Video-to-video linking systems allow users to explore and exploit the content of a large-scale multimedia collection interactively and without the need to formulate specific queries. We present a short introduction to video-to-video linking (also called ‘video hyperlinking’), and describe the latest edition of the Video Hyperlinking (LNK) task at TRECVid 2016. The emphasis of the LNK task in 2016 is on multimodality as used by videomakers to communicate their intended message. Crowdsourcing makes three critical contributions to the LNK task. First, it allows us to verify the multimodal nature of the anchors (queries) used in the task. Second, it enables us to evaluate the performance of video-to-video linking systems at large scale. Third, it gives us insights into how people understand the relevance relationship between two linked video segments. These insights are valuable since the relationship between video segments can manifest itself at different levels of abstraction.

Original language	English
Title of host publication	MultiMedia Modeling
Subtitle of host publication	23rd International Conference, MMM 2017, proceedings
Editors	Laurent Amsaleg, Gylfi Þór Guðmundsson, Cathal Gurrin , Björn Þór Jónsson , Shin’ichi Satoh
Place of Publication	Cham
Publisher	Springer
Pages	280-292
Number of pages	13
Edition	Part II
ISBN (Electronic)	978-3-319-51814-5
ISBN (Print)	978-3-319-51813-8
DOIs	https://doi.org/10.1007/978-3-319-51814-5_24
Publication status	Published - 2017
Event	MMM 2017: 23rd International Conference on Multimedia Modeling - Reykjavik, Iceland Duration: 4 Jan 2017 → 6 Jan 2017

Publication series

Name	Lecture Notes in Computer Science
Publisher	Springer International Publishing
Volume	10133
ISSN (Electronic)	0302-9743

Conference

Conference	MMM 2017
Country/Territory	Iceland
City	Reykjavik
Period	4/01/17 → 6/01/17

Keywords

Crowdsourcing
Video-to-video linking
Link evaluation
Verbal-visual information

Access to Document

10.1007/978-3-319-51814-5_24

Cite this

Eskevich, M., Larson, M., Aly, R., Sabetghadam, S., Jones, G. J. F., Ordelman, R., & Huet, B. (2017). Multimodal Video-to-Video Linking: Turning to the Crowd for Insight and Evaluation. In L. Amsaleg, G. Þór Guðmundsson, C. Gurrin , B. Þór Jónsson , & S. Satoh (Eds.), MultiMedia Modeling: 23rd International Conference, MMM 2017, proceedings (Part II ed., pp. 280-292). (Lecture Notes in Computer Science; Vol. 10133). Springer. https://doi.org/10.1007/978-3-319-51814-5_24

Eskevich, Maria ; Larson, Martha ; Aly, Robin et al. / Multimodal Video-to-Video Linking : Turning to the Crowd for Insight and Evaluation. MultiMedia Modeling: 23rd International Conference, MMM 2017, proceedings. editor / Laurent Amsaleg ; Gylfi Þór Guðmundsson ; Cathal Gurrin ; Björn Þór Jónsson ; Shin’ichi Satoh . Part II. ed. Cham : Springer, 2017. pp. 280-292 (Lecture Notes in Computer Science).

@inproceedings{daf9c824120c4830a5ef7be2fb56993f,

title = "Multimodal Video-to-Video Linking: Turning to the Crowd for Insight and Evaluation",

abstract = "Video-to-video linking systems allow users to explore and exploit the content of a large-scale multimedia collection interactively and without the need to formulate specific queries. We present a short introduction to video-to-video linking (also called {\textquoteleft}video hyperlinking{\textquoteright}), and describe the latest edition of the Video Hyperlinking (LNK) task at TRECVid 2016. The emphasis of the LNK task in 2016 is on multimodality as used by videomakers to communicate their intended message. Crowdsourcing makes three critical contributions to the LNK task. First, it allows us to verify the multimodal nature of the anchors (queries) used in the task. Second, it enables us to evaluate the performance of video-to-video linking systems at large scale. Third, it gives us insights into how people understand the relevance relationship between two linked video segments. These insights are valuable since the relationship between video segments can manifest itself at different levels of abstraction.",

keywords = "Crowdsourcing, Video-to-video linking, Link evaluation, Verbal-visual information",

author = "Maria Eskevich and Martha Larson and Robin Aly and Serwah Sabetghadam and Jones, {Gareth J.F.} and Roeland Ordelman and Benoit Huet",

year = "2017",

doi = "10.1007/978-3-319-51814-5_24",

language = "English",

isbn = "978-3-319-51813-8",

series = "Lecture Notes in Computer Science",

publisher = "Springer",

pages = "280--292",

editor = "Laurent Amsaleg and { {\TH}{\'o}r Gu{\dh}mundsson}, Gylfi and { Gurrin }, Cathal and { {\TH}{\'o}r J{\'o}nsson }, Bj{\"o}rn and {Satoh }, {Shin{\textquoteright}ichi }",

booktitle = "MultiMedia Modeling",

edition = "Part II",

note = "MMM 2017 : 23rd International Conference on Multimedia Modeling ; Conference date: 04-01-2017 Through 06-01-2017",

}

Eskevich, M, Larson, M, Aly, R, Sabetghadam, S, Jones, GJF, Ordelman, R & Huet, B 2017, Multimodal Video-to-Video Linking: Turning to the Crowd for Insight and Evaluation. in L Amsaleg, G Þór Guðmundsson, C Gurrin , B Þór Jónsson & S Satoh (eds), MultiMedia Modeling: 23rd International Conference, MMM 2017, proceedings. Part II edn, Lecture Notes in Computer Science, vol. 10133, Springer, Cham, pp. 280-292, MMM 2017, Reykjavik, Iceland, 4/01/17. https://doi.org/10.1007/978-3-319-51814-5_24

Multimodal Video-to-Video Linking: Turning to the Crowd for Insight and Evaluation. / Eskevich, Maria; Larson, Martha; Aly, Robin et al.
MultiMedia Modeling: 23rd International Conference, MMM 2017, proceedings. ed. / Laurent Amsaleg; Gylfi Þór Guðmundsson; Cathal Gurrin ; Björn Þór Jónsson ; Shin’ichi Satoh . Part II. ed. Cham: Springer, 2017. p. 280-292 (Lecture Notes in Computer Science; Vol. 10133).

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

TY - GEN

T1 - Multimodal Video-to-Video Linking

T2 - MMM 2017

AU - Eskevich, Maria

AU - Larson, Martha

AU - Aly, Robin

AU - Sabetghadam, Serwah

AU - Jones, Gareth J.F.

AU - Ordelman, Roeland

AU - Huet, Benoit

PY - 2017

Y1 - 2017

N2 - Video-to-video linking systems allow users to explore and exploit the content of a large-scale multimedia collection interactively and without the need to formulate specific queries. We present a short introduction to video-to-video linking (also called ‘video hyperlinking’), and describe the latest edition of the Video Hyperlinking (LNK) task at TRECVid 2016. The emphasis of the LNK task in 2016 is on multimodality as used by videomakers to communicate their intended message. Crowdsourcing makes three critical contributions to the LNK task. First, it allows us to verify the multimodal nature of the anchors (queries) used in the task. Second, it enables us to evaluate the performance of video-to-video linking systems at large scale. Third, it gives us insights into how people understand the relevance relationship between two linked video segments. These insights are valuable since the relationship between video segments can manifest itself at different levels of abstraction.

AB - Video-to-video linking systems allow users to explore and exploit the content of a large-scale multimedia collection interactively and without the need to formulate specific queries. We present a short introduction to video-to-video linking (also called ‘video hyperlinking’), and describe the latest edition of the Video Hyperlinking (LNK) task at TRECVid 2016. The emphasis of the LNK task in 2016 is on multimodality as used by videomakers to communicate their intended message. Crowdsourcing makes three critical contributions to the LNK task. First, it allows us to verify the multimodal nature of the anchors (queries) used in the task. Second, it enables us to evaluate the performance of video-to-video linking systems at large scale. Third, it gives us insights into how people understand the relevance relationship between two linked video segments. These insights are valuable since the relationship between video segments can manifest itself at different levels of abstraction.

KW - Crowdsourcing

KW - Video-to-video linking

KW - Link evaluation

KW - Verbal-visual information

U2 - 10.1007/978-3-319-51814-5_24

DO - 10.1007/978-3-319-51814-5_24

M3 - Conference contribution

SN - 978-3-319-51813-8

T3 - Lecture Notes in Computer Science

SP - 280

EP - 292

BT - MultiMedia Modeling

A2 - Amsaleg, Laurent

A2 - Þór Guðmundsson, Gylfi

A2 - Gurrin , Cathal

A2 - Þór Jónsson , Björn

A2 - Satoh , Shin’ichi

PB - Springer

CY - Cham

Y2 - 4 January 2017 through 6 January 2017

ER -

Eskevich M, Larson M, Aly R, Sabetghadam S, Jones GJF, Ordelman R et al. Multimodal Video-to-Video Linking: Turning to the Crowd for Insight and Evaluation. In Amsaleg L, Þór Guðmundsson G, Gurrin C, Þór Jónsson B, Satoh S, editors, MultiMedia Modeling: 23rd International Conference, MMM 2017, proceedings. Part II ed. Cham: Springer. 2017. p. 280-292. (Lecture Notes in Computer Science). doi: 10.1007/978-3-319-51814-5_24

Multimodal Video-to-Video Linking: Turning to the Crowd for Insight and Evaluation

Abstract

Publication series

Conference

Keywords

Access to Document

Fingerprint

Cite this