How Do Neural Networks See Depth in Single Images?

Tom van Dijk; Guido de Croon

doi:10.1109/ICCV.2019.00227

How Do Neural Networks See Depth in Single Images?

Control & Simulation

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

126 Citations (Scopus)

103 Downloads (Pure)

Abstract

Deep neural networks have lead to a breakthrough in depth estimation from single images. Recent work shows that the quality of these estimations is rapidly increasing. It is clear that neural networks can see depth in single images. However, to the best of our knowledge, no work currently exists that analyzes what these networks have learned. In this work we take four previously published networks and investigate what depth cues they exploit. We find that all networks ignore the apparent size of known obstacles in favor of their vertical position in the image. The use of the vertical position requires the camera pose to be known; however, we find that these networks only partially recognize changes in camera pitch and roll angles. Small changes in camera pitch are shown to disturb the estimated distance towards obstacles. The use of the vertical image position allows the networks to estimate depth towards arbitrary obstacles - even those not appearing in the training set - but may depend on features that are not universally present.

Original language	English
Title of host publication	Proceedings - 2019 International Conference on Computer Vision, ICCV 2019
Pages	2183-2191
Number of pages	9
ISBN (Electronic)	9781728148038
DOIs	https://doi.org/10.1109/ICCV.2019.00227
Publication status	Published - Oct 2019
Event	The IEEE International Conference on Computer Vision 2019 - Seoul, Korea, Seoul, Korea, Republic of Duration: 27 Oct 2019 → 2 Nov 2019 http://iccv2019.thecvf.com/

Publication series

Name	Proceedings of the IEEE International Conference on Computer Vision
Volume	2019-October
ISSN (Print)	1550-5499

Conference

Conference	The IEEE International Conference on Computer Vision 2019
Abbreviated title	ICCV
Country/Territory	Korea, Republic of
City	Seoul
Period	27/10/19 → 2/11/19
Internet address	http://iccv2019.thecvf.com/

Keywords

neural networks
monocular depth estimation
Depth perception

Access to Document

10.1109/ICCV.2019.00227

van_Dijk_How_Do_Neural_Networks_See_Depth_in_Single_Images_ICCV_2019_paperFinal published version, 1.04 MB

http://openaccess.thecvf.com/content_ICCV_2019/html/van_Dijk_How_Do_Neural_Networks_See_Depth_in_Single_Images_ICCV_2019_paper.html

Cite this

@inproceedings{fbe6b0d4cd10450197da2e6f77f7f4cd,

title = "How Do Neural Networks See Depth in Single Images?",

abstract = "Deep neural networks have lead to a breakthrough in depth estimation from single images. Recent work shows that the quality of these estimations is rapidly increasing. It is clear that neural networks can see depth in single images. However, to the best of our knowledge, no work currently exists that analyzes what these networks have learned. In this work we take four previously published networks and investigate what depth cues they exploit. We find that all networks ignore the apparent size of known obstacles in favor of their vertical position in the image. The use of the vertical position requires the camera pose to be known; however, we find that these networks only partially recognize changes in camera pitch and roll angles. Small changes in camera pitch are shown to disturb the estimated distance towards obstacles. The use of the vertical image position allows the networks to estimate depth towards arbitrary obstacles - even those not appearing in the training set - but may depend on features that are not universally present.",

keywords = "neural networks, monocular depth estimation, Depth perception",

author = "{van Dijk}, Tom and {de Croon}, Guido",

year = "2019",

month = oct,

doi = "10.1109/ICCV.2019.00227",

language = "English",

series = "Proceedings of the IEEE International Conference on Computer Vision",

pages = "2183--2191",

booktitle = "Proceedings - 2019 International Conference on Computer Vision, ICCV 2019",

note = "The IEEE International Conference on Computer Vision 2019, ICCV ; Conference date: 27-10-2019 Through 02-11-2019",

url = "http://iccv2019.thecvf.com/",

}

van Dijk, T & de Croon, G 2019, How Do Neural Networks See Depth in Single Images? in Proceedings - 2019 International Conference on Computer Vision, ICCV 2019., 9009532, Proceedings of the IEEE International Conference on Computer Vision, vol. 2019-October, pp. 2183-2191, The IEEE International Conference on Computer Vision 2019, Seoul, Korea, Republic of, 27/10/19. https://doi.org/10.1109/ICCV.2019.00227

How Do Neural Networks See Depth in Single Images? / van Dijk, Tom ; de Croon, Guido.
Proceedings - 2019 International Conference on Computer Vision, ICCV 2019. 2019. p. 2183-2191 9009532 (Proceedings of the IEEE International Conference on Computer Vision; Vol. 2019-October).

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review