A Two-Streamed Network for Estimating Fine-Scaled Depth Maps from Single RGB Images

doi:10.1109/ICCV.2017.365

Open AccessProceedings ArticleDOI

A Two-Streamed Network for Estimating Fine-Scaled Depth Maps from Single RGB Images

- pp 3392-3400

TLDR

In this article, a fast-to-train two-streamed CNN is proposed to predict depth and depth gradients, which are then fused together into an accurate and detailed depth map.

Abstract:

Estimating depth from a single RGB image is an ill-posed and inherently ambiguous problem. State-of-the-art deep learning methods can now estimate accurate 2D depth maps, but when the maps are projected into 3D, they lack local detail and are often highly distorted. We propose a fast-to-train two-streamed CNN that predicts depth and depth gradients, which are then fused together into an accurate and detailed depth map. We also define a novel set loss over multiple images; by regularizing the estimation between a common set of images, the network is less prone to overfitting and achieves better accuracy than competing methods. Experiments on the NYU Depth v2 dataset shows that our depth predictions are competitive with state-of-the-art and lead to faithful 3D projections.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

Deep Ordinal Regression Network for Monocular Depth Estimation

Huan Fu, +4 more

TL;DR: Deep Ordinal Regression Network (DORN) as discussed by the authors discretizes depth and recast depth network learning as an ordinal regression problem by training the network using an ordinary regression loss, which achieves much higher accuracy and faster convergence in synch.

...read moreread less

Proceedings ArticleDOI

Densely Connected Pyramid Dehazing Network

He Zhang, +1 more

TL;DR: Zhang et al. as discussed by the authors proposed a Densely Connected Pyramid Dehazing Network (DCPDN), which can jointly learn the transmission map, atmospheric light and dehazing all together.

...read moreread less

Proceedings ArticleDOI

Enforcing Geometric Constraints of Virtual Normal for Depth Prediction

Wei Yin, +3 more

TL;DR: Zhang et al. as mentioned in this paper designed a loss term that enforces one simple type of geometric constraints, namely, virtual normal directions determined by randomly sampled three points in the reconstructed 3D space.

...read moreread less

Posted Content

From big to small: Multi-scale local planar guidance for monocular depth estimation

Jin Han Lee, +3 more

- 24 Jul 2019 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: This paper proposes a network architecture that utilizes novel local planar guidance layers located at multiple stages in the decoding phase that outperforms the state-of-the-art works with significant margin evaluating on challenging benchmarks.

...read moreread less

Proceedings ArticleDOI

Deep Depth Completion of a Single RGB-D Image

Yinda Zhang, +1 more

TL;DR: In this article, a deep network is trained to predict surface normals and occlusion boundaries, which are then combined with raw depth observations provided by the RGB-D camera to solve for all pixels, including those missing in the original observation.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings ArticleDOI

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.

...read moreread less

Proceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

Karen Simonyan, +1 more

TL;DR: In this paper, the authors investigated the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting and showed that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 layers.

...read moreread less

Book ChapterDOI

U-Net: Convolutional Networks for Biomedical Image Segmentation

Olaf Ronneberger, +2 more

TL;DR: Neber et al. as discussed by the authors proposed a network and training strategy that relies on the strong use of data augmentation to use the available annotated samples more efficiently, which can be trained end-to-end from very few images and outperforms the prior best method (a sliding-window convolutional network) on the ISBI challenge for segmentation of neuronal structures in electron microscopic stacks.

...read moreread less

Proceedings ArticleDOI

Fully convolutional networks for semantic segmentation

Jonathan Long, +2 more

TL;DR: The key insight is to build “fully convolutional” networks that take input of arbitrary size and produce correspondingly-sized output with efficient inference and learning.

...read moreread less

Proceedings ArticleDOI

Image-to-Image Translation with Conditional Adversarial Networks

Phillip Isola, +3 more

TL;DR: Conditional adversarial networks are investigated as a general-purpose solution to image-to-image translation problems and it is demonstrated that this approach is effective at synthesizing photos from label maps, reconstructing objects from edge maps, and colorizing images, among other tasks.

...read moreread less

Collapse

A Two-Streamed Network for Estimating Fine-Scaled Depth Maps from Single RGB Images

Citations

Deep Ordinal Regression Network for Monocular Depth Estimation

Densely Connected Pyramid Dehazing Network

Enforcing Geometric Constraints of Virtual Normal for Depth Prediction

From big to small: Multi-scale local planar guidance for monocular depth estimation

Deep Depth Completion of a Single RGB-D Image

References

Deep Residual Learning for Image Recognition

Very Deep Convolutional Networks for Large-Scale Image Recognition

U-Net: Convolutional Networks for Biomedical Image Segmentation

Fully convolutional networks for semantic segmentation

Image-to-Image Translation with Conditional Adversarial Networks

Related Papers (5)

Deeper Depth Prediction with Fully Convolutional Residual Networks

Predicting Depth, Surface Normals and Semantic Labels with a Common Multi-scale Convolutional Architecture

Indoor segmentation and support inference from RGBD images

Depth Map Prediction from a Single Image using a Multi-Scale Deep Network

Deep Residual Learning for Image Recognition