Adaptive Gaze Control for Object Detection

Guido de Croon; Eric Postma; Jaap van den Herik

doi:10.1007/s12559-010-9093-9

Adaptive Gaze Control for Object Detection

Guido de Croon, Eric Postma, Jaap van den Herik

Control & Simulation

Research output: Contribution to journal › Article › Scientific › peer-review

10 Citations (Scopus)

Abstract

We propose a novel gaze-control model for detecting objects in images. The model, named act-detect, uses the information from local image samples in order to shift its gaze towards object locations. The model constitutes two main contributions. The first contribution is that the model’s setup makes it computationally highly efficient in comparison with existing window-sliding methods for object detection, while retaining an acceptable detection performance. act-detect is evaluated on a face-detection task using a publicly available image set. In terms of detection performance, act-detect slightly outperforms the window-sliding methods that have been applied to the face-detection task. In terms of computational efficiency, act-detect clearly outperforms the window-sliding methods: it requires in the order of hundreds fewer samples for detection. The second contribution of the model lies in its more extensive use of local samples than previous models: instead of merely using them for verifying object presence at the gaze location, the model uses them to determine a direction and distance to the object of interest. The simultaneous adaptation of both the model’s visual features and its gaze-control strategy leads to the discovery of features and strategies for exploiting the local context of objects. For example, the model uses the spatial relations between the bodies of the persons in the images and their faces. The resulting gaze control is a temporal process, in which the object’s context is exploited at different scales and at different image locations relative to the object.

Original language	English
Pages (from-to)	264–278
Number of pages	15
Journal	Cognitive Computation
Volume	3
Issue number	1
DOIs	https://doi.org/10.1007/s12559-010-9093-9
Publication status	Published - 15 Jan 2011

Keywords

Gaze control
Computationally efficient object detection
Active vision
Evolutionary algorithms

Access to Document

10.1007/s12559-010-9093-9

Cite this

@article{1f559db89279497d84f306843ff9400d,

title = "Adaptive Gaze Control for Object Detection",

abstract = "We propose a novel gaze-control model for detecting objects in images. The model, named act-detect, uses the information from local image samples in order to shift its gaze towards object locations. The model constitutes two main contributions. The first contribution is that the model{\textquoteright}s setup makes it computationally highly efficient in comparison with existing window-sliding methods for object detection, while retaining an acceptable detection performance. act-detect is evaluated on a face-detection task using a publicly available image set. In terms of detection performance, act-detect slightly outperforms the window-sliding methods that have been applied to the face-detection task. In terms of computational efficiency, act-detect clearly outperforms the window-sliding methods: it requires in the order of hundreds fewer samples for detection. The second contribution of the model lies in its more extensive use of local samples than previous models: instead of merely using them for verifying object presence at the gaze location, the model uses them to determine a direction and distance to the object of interest. The simultaneous adaptation of both the model{\textquoteright}s visual features and its gaze-control strategy leads to the discovery of features and strategies for exploiting the local context of objects. For example, the model uses the spatial relations between the bodies of the persons in the images and their faces. The resulting gaze control is a temporal process, in which the object{\textquoteright}s context is exploited at different scales and at different image locations relative to the object.",

keywords = "Gaze control, Computationally efficient object detection, Active vision, Evolutionary algorithms",

author = "{de Croon}, Guido and Eric Postma and {van den Herik}, Jaap",

year = "2011",

month = jan,

day = "15",

doi = "10.1007/s12559-010-9093-9",

language = "English",

volume = "3",

pages = "264–278",

journal = "Cognitive Computation",

issn = "1866-9956",

publisher = "Springer",

number = "1",

}

TY - JOUR

T1 - Adaptive Gaze Control for Object Detection

AU - de Croon, Guido

AU - Postma, Eric

AU - van den Herik, Jaap

PY - 2011/1/15

Y1 - 2011/1/15

N2 - We propose a novel gaze-control model for detecting objects in images. The model, named act-detect, uses the information from local image samples in order to shift its gaze towards object locations. The model constitutes two main contributions. The first contribution is that the model’s setup makes it computationally highly efficient in comparison with existing window-sliding methods for object detection, while retaining an acceptable detection performance. act-detect is evaluated on a face-detection task using a publicly available image set. In terms of detection performance, act-detect slightly outperforms the window-sliding methods that have been applied to the face-detection task. In terms of computational efficiency, act-detect clearly outperforms the window-sliding methods: it requires in the order of hundreds fewer samples for detection. The second contribution of the model lies in its more extensive use of local samples than previous models: instead of merely using them for verifying object presence at the gaze location, the model uses them to determine a direction and distance to the object of interest. The simultaneous adaptation of both the model’s visual features and its gaze-control strategy leads to the discovery of features and strategies for exploiting the local context of objects. For example, the model uses the spatial relations between the bodies of the persons in the images and their faces. The resulting gaze control is a temporal process, in which the object’s context is exploited at different scales and at different image locations relative to the object.

AB - We propose a novel gaze-control model for detecting objects in images. The model, named act-detect, uses the information from local image samples in order to shift its gaze towards object locations. The model constitutes two main contributions. The first contribution is that the model’s setup makes it computationally highly efficient in comparison with existing window-sliding methods for object detection, while retaining an acceptable detection performance. act-detect is evaluated on a face-detection task using a publicly available image set. In terms of detection performance, act-detect slightly outperforms the window-sliding methods that have been applied to the face-detection task. In terms of computational efficiency, act-detect clearly outperforms the window-sliding methods: it requires in the order of hundreds fewer samples for detection. The second contribution of the model lies in its more extensive use of local samples than previous models: instead of merely using them for verifying object presence at the gaze location, the model uses them to determine a direction and distance to the object of interest. The simultaneous adaptation of both the model’s visual features and its gaze-control strategy leads to the discovery of features and strategies for exploiting the local context of objects. For example, the model uses the spatial relations between the bodies of the persons in the images and their faces. The resulting gaze control is a temporal process, in which the object’s context is exploited at different scales and at different image locations relative to the object.

KW - Gaze control

KW - Computationally efficient object detection

KW - Active vision

KW - Evolutionary algorithms

UR - http://resolver.tudelft.nl/uuid:062e44a4-54c0-4fee-a4a9-2c0252f2487d

U2 - 10.1007/s12559-010-9093-9

DO - 10.1007/s12559-010-9093-9

M3 - Article

SN - 1866-9956

VL - 3

SP - 264

EP - 278

JO - Cognitive Computation

JF - Cognitive Computation

IS - 1

ER -

Adaptive Gaze Control for Object Detection

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this