Extracting moods from pictures and sounds

A Hanjalic

Extracting moods from pictures and sounds

Multimedia Computing

Research output: Contribution to journal › Article › Scientific › peer-review

Abstract

Abstract This paper considers how we feel about the content we see or hear. As opposed to the cognitive content information composed of the facts about the genre, temporal content structures and spatiotemporal content elements, we are interested in obtaining the information about the feelings, emotions, and moods evoked by a speech, audio, or video clip. We refer to the latter as the affective content, and to the terms such as happy or exciting as the affective labels of an audiovisual signal. In the first part of the paper, we explore the possibilities for representing and modeling the affective content of an audiovisual signal to effectively bridge the affective gap. Without loosing generality, we refer to this signal simply as video, which we see as an image sequence with an accompanying soundtrack. Then, we show the high potential of the affective video content analysis for enhancing the content recommendation functionalities of the future PVRs and VOD systems. We conclude this paper by outlining some interesting research challenges in the field.

Original language	Undefined/Unknown
Pages (from-to)	90-100
Number of pages	11
Journal	IEEE Signal Processing Magazine
Volume	23
Issue number	2
Publication status	Published - 2006

Keywords

academic journal papers
CWTS JFIS >= 2.00

Cite this

@article{db93174bf1f94e5e9e791a92cc0a982a,

title = "Extracting moods from pictures and sounds",

abstract = "Abstract This paper considers how we feel about the content we see or hear. As opposed to the cognitive content information composed of the facts about the genre, temporal content structures and spatiotemporal content elements, we are interested in obtaining the information about the feelings, emotions, and moods evoked by a speech, audio, or video clip. We refer to the latter as the affective content, and to the terms such as happy or exciting as the affective labels of an audiovisual signal. In the first part of the paper, we explore the possibilities for representing and modeling the affective content of an audiovisual signal to effectively bridge the affective gap. Without loosing generality, we refer to this signal simply as video, which we see as an image sequence with an accompanying soundtrack. Then, we show the high potential of the affective video content analysis for enhancing the content recommendation functionalities of the future PVRs and VOD systems. We conclude this paper by outlining some interesting research challenges in the field.",

keywords = "academic journal papers, CWTS JFIS >= 2.00",

author = "A Hanjalic",

year = "2006",

language = "Undefined/Unknown",

volume = "23",

pages = "90--100",

journal = "IEEE Signal Processing Magazine",

issn = "1053-5888",

publisher = "Institute of Electrical and Electronics Engineers (IEEE)",

number = "2",

}

TY - JOUR

T1 - Extracting moods from pictures and sounds

AU - Hanjalic, A

PY - 2006

Y1 - 2006

N2 - Abstract This paper considers how we feel about the content we see or hear. As opposed to the cognitive content information composed of the facts about the genre, temporal content structures and spatiotemporal content elements, we are interested in obtaining the information about the feelings, emotions, and moods evoked by a speech, audio, or video clip. We refer to the latter as the affective content, and to the terms such as happy or exciting as the affective labels of an audiovisual signal. In the first part of the paper, we explore the possibilities for representing and modeling the affective content of an audiovisual signal to effectively bridge the affective gap. Without loosing generality, we refer to this signal simply as video, which we see as an image sequence with an accompanying soundtrack. Then, we show the high potential of the affective video content analysis for enhancing the content recommendation functionalities of the future PVRs and VOD systems. We conclude this paper by outlining some interesting research challenges in the field.

AB - Abstract This paper considers how we feel about the content we see or hear. As opposed to the cognitive content information composed of the facts about the genre, temporal content structures and spatiotemporal content elements, we are interested in obtaining the information about the feelings, emotions, and moods evoked by a speech, audio, or video clip. We refer to the latter as the affective content, and to the terms such as happy or exciting as the affective labels of an audiovisual signal. In the first part of the paper, we explore the possibilities for representing and modeling the affective content of an audiovisual signal to effectively bridge the affective gap. Without loosing generality, we refer to this signal simply as video, which we see as an image sequence with an accompanying soundtrack. Then, we show the high potential of the affective video content analysis for enhancing the content recommendation functionalities of the future PVRs and VOD systems. We conclude this paper by outlining some interesting research challenges in the field.

KW - academic journal papers

KW - CWTS JFIS >= 2.00

UR - http://ieeexplore.ieee.org/iel5/79/33613/01621452.pdf?isnumber=33613&arnumber=1621452

M3 - Article

SN - 1053-5888

VL - 23

SP - 90

EP - 100

JO - IEEE Signal Processing Magazine

JF - IEEE Signal Processing Magazine

IS - 2

ER -

Extracting moods from pictures and sounds

Abstract

Keywords

Other files and links

Cite this