Low level analysis of video using spatiotemporal pixel blocks

SU Naci, A Hanjalic

Research output: Chapter in Book/Conference proceedings/Edited volumeConference contributionScientificpeer-review

2 Citations (Scopus)

Abstract

Low-level video analysis is an important step for further semantic interpretation of the video. This provides information about the camera work, video editing process, shape, texture, color and topology of the objects and the scenes captured by the camera. Here we introduce a framework capable of extracting the information about the shot boundaries and the camera and object motion, based on the analysis of spatiotemporal pixel blocks in a series of video frames. Extracting the motion information and detecting shot boundaries using the same underlying principle is the main contribution of this paper. Besides, this original principle is likely to improve robustness of the abovementioned low-level video analysis as it avoids typical problems of standard frame-based approaches and the camera motion information provides critical help to improve the shot boundary detection performance. The system is evaluated using TRECVID data [1] with promising results.
Original languageEnglish
Title of host publicationMultimedia Content Representation, Classification and Security
EditorsB Gunsel, AK Jain, AM Tekalp, B Sankur
Place of PublicationBerlin-Heidelberg
PublisherSpringer
Pages777-784
Number of pages8
ISBN (Print)9783540393924
Publication statusPublished - 2006
EventInternational Workshop, MRCS 2006, Istanbul, Turkey - Berlin-Heidelberg
Duration: 11 Sept 200613 Sept 2006

Publication series

Name
PublisherSpringer
NameLecture Notes in Computer Science
Volume4105
ISSN (Print)0302-9743

Conference

ConferenceInternational Workshop, MRCS 2006, Istanbul, Turkey
Period11/09/0613/09/06

Keywords

  • Wiskunde en Informatica
  • Techniek
  • technische Wiskunde en Informatica
  • conference contrib. refereed
  • CWTS 0.75 <= JFIS < 2.00

Fingerprint

Dive into the research topics of 'Low level analysis of video using spatiotemporal pixel blocks'. Together they form a unique fingerprint.

Cite this