Learning Depth from Single Monocular Images

Open AccessProceedings Article

Learning Depth from Single Monocular Images

- Vol. 18, pp 1161-1168

TLDR

This work begins by collecting a training set of monocular images (of unstructured outdoor environments which include forests, trees, buildings, etc.) and their corresponding ground-truth depthmaps, and applies supervised learning to predict the depthmap as a function of the image.

Abstract:

We consider the task of depth estimation from a single monocular image. We take a supervised learning approach to this problem, in which we begin by collecting a training set of monocular images (of unstructured outdoor environments which include forests, trees, buildings, etc.) and their corresponding ground-truth depthmaps. Then, we apply supervised learning to predict the depthmap as a function of the image. Depth estimation is a challenging problem, since local features alone are insufficient to estimate depth at a point, and one needs to consider the global context of the image. Our model uses a discriminatively-trained Markov Random Field (MRF) that incorporates multiscale local- and global-image features, and models both depths at individual points as well as the relation between depths at different points. We show that, even on unstructured scenes, our algorithm is frequently able to recover fairly accurate depthmaps.

Citations

PDF

Open Access

More filters

Computer vision : a modern approach = 计算机视觉 : 一种现代的方法

David Forsyth, +1 more

TL;DR: Comprehensive and up-to-date, this book includes essential topics that either reflect practical significance or are of theoretical importance and describes numerous important application areas such as image based rendering and digital libraries.

...read moreread less

Proceedings Article

Depth Map Prediction from a Single Image using a Multi-Scale Deep Network

David Eigen, +2 more

TL;DR: In this article, two deep network stacks are employed to make a coarse global prediction based on the entire image, and another to refine this prediction locally, which achieves state-of-the-art results on both NYU Depth and KITTI.

...read moreread less

Proceedings ArticleDOI

Deeper Depth Prediction with Fully Convolutional Residual Networks

Iro Laina, +4 more

TL;DR: A fully convolutional architecture, encompassing residual learning, to model the ambiguous mapping between monocular images and depth maps is proposed and a novel way to efficiently learn feature map up-sampling within the network is presented.

...read moreread less

Journal ArticleDOI

Make3D: Learning 3D Scene Structure from a Single Still Image

Ashutosh Saxena, +2 more

- 01 May 2009 -

IEEE Transactions on Pattern Analysis an...

TL;DR: This work considers the problem of estimating detailed 3D structure from a single still image of an unstructured environment and uses a Markov random field (MRF) to infer a set of "plane parameters" that capture both the 3D location and 3D orientation of the patch.

...read moreread less

Proceedings ArticleDOI

Deep Ordinal Regression Network for Monocular Depth Estimation

Huan Fu, +4 more

TL;DR: Deep Ordinal Regression Network (DORN) as discussed by the authors discretizes depth and recast depth network learning as an ordinal regression problem by training the network using an ordinary regression loss, which achieves much higher accuracy and faster convergence in synch.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

A taxonomy and evaluation of dense two-frame stereo correspondence algorithms

Daniel Scharstein, +2 more

- 09 Dec 2001 -

International Journal of Computer Vision

TL;DR: This paper has designed a stand-alone, flexible C++ implementation that enables the evaluation of individual components and that can easily be extended to include new algorithms.

...read moreread less

Computer vision : a modern approach = 计算机视觉 : 一种现代的方法

David Forsyth, +1 more

TL;DR: Comprehensive and up-to-date, this book includes essential topics that either reflect practical significance or are of theoretical importance and describes numerous important application areas such as image based rendering and digital libraries.

...read moreread less

Book

Computer Vision: A Modern Approach

David Forsyth, +1 more

TL;DR: The accessible presentation of this book gives both a general view of the entire computer vision enterprise and also offers sufficient detail to be able to build useful applications as discussed by the authors, which includes essential topics that either reflect practical significance or are of theoretical importance.

...read moreread less

Proceedings ArticleDOI

Multiscale conditional random fields for image labeling

Xuming He, +2 more

TL;DR: An approach to include contextual features for labeling images, in which each pixel is assigned to one of a finite set of labels, are incorporated into a probabilistic framework, which combines the outputs of several components.

...read moreread less

Proceedings ArticleDOI

High speed obstacle avoidance using monocular vision and reinforcement learning

Jeffrey Lawrence Michels, +2 more

TL;DR: An approach in which supervised learning is first used to estimate depths from single monocular images, which is able to learn monocular vision cues that accurately estimate the relative depths of obstacles in a scene is presented.

...read moreread less

Related Papers (5)

Make3D: Learning 3D Scene Structure from a Single Still Image

Ashutosh Saxena, +2 more

- 01 May 2009 -

IEEE Transactions on Pattern Analysis an...

Learning Depth from Single Monocular Images

Citations

Computer vision : a modern approach = 计算机视觉 : 一种现代的方法

Depth Map Prediction from a Single Image using a Multi-Scale Deep Network

Deeper Depth Prediction with Fully Convolutional Residual Networks

Make3D: Learning 3D Scene Structure from a Single Still Image

Deep Ordinal Regression Network for Monocular Depth Estimation

References

A taxonomy and evaluation of dense two-frame stereo correspondence algorithms

Computer vision : a modern approach = 计算机视觉 : 一种现代的方法

Computer Vision: A Modern Approach

Multiscale conditional random fields for image labeling

High speed obstacle avoidance using monocular vision and reinforcement learning

Related Papers (5)

Make3D: Learning 3D Scene Structure from a Single Still Image

Depth Map Prediction from a Single Image using a Multi-Scale Deep Network

Indoor segmentation and support inference from RGBD images

Deeper Depth Prediction with Fully Convolutional Residual Networks

Predicting Depth, Surface Normals and Semantic Labels with a Common Multi-scale Convolutional Architecture