Learning Depth via Interaction.

Open AccessPosted Content

Learning Depth via Interaction.

Chats0

TLDR

This work trains a specialized global-local network architecture with what would be available to a robot interacting with the environment: from extremely sparse depth measurements down to even a single pixel per image, which can be valuable to all robotic systems acting under severe bandwidth or sensing constraints.

Abstract:

Motivated by the astonishing capabilities of natural intelligent agents and inspired by theories from psychology, this paper explores the idea that perception gets coupled to 3D properties of the world via interaction with the environment. Existing works for depth estimation require either massive amounts of annotated training data or some form of hard-coded geometrical constraint. This paper explores a new approach to learning depth perception requiring neither of those. Specifically, we train a specialized global-local network architecture with what would be available to a robot interacting with the environment: from extremely sparse depth measurements down to even a single pixel per image. From a pair of consecutive images, our proposed network outputs a latent representation of the observer’s motion between the images and a dense depth map. Experiments on several datasets show that, when ground truth is available even for just one of the image pixels, the proposed network can learn monocular dense depth estimation up to 22.5% more accurately than state-of-the-art approaches. We believe that this work, despite its scientific interest, lays the foundations to learn depth from extremely sparse supervision, which can be valuable to all robotic systems acting under severe bandwidth or sensing constraints.

References

PDF

Open Access

More filters

Proceedings Article

Adam: A Method for Stochastic Optimization

Diederik P. Kingma, +1 more

TL;DR: This work introduces Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments, and provides a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework.

...read moreread less

Posted Content

Adam: A Method for Stochastic Optimization

Diederik P. Kingma, +1 more

- 22 Dec 2014 -

arXiv: Learning

TL;DR: In this article, the adaptive estimates of lower-order moments are used for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimate of lowerorder moments.

...read moreread less

Book

The Ecological Approach to Visual Perception

James J. Gibson

TL;DR: The relationship between Stimulation and Stimulus Information for visual perception is discussed in detail in this article, where the authors also present experimental evidence for direct perception of motion in the world and movement of the self.

...read moreread less

Multiple View Geometry in Computer Vision.

Bernhard P. Wrobel

TL;DR: This book is referred to read because it is an inspiring book to give you more chance to get experiences and also thoughts and it will show the best book collections and completed collections.

...read moreread less

Journal ArticleDOI

ORB-SLAM: A Versatile and Accurate Monocular SLAM System

Raul Mur-Artal, +2 more

- 24 Aug 2015 -

IEEE Transactions on Robotics

TL;DR: ORB-SLAM as discussed by the authors is a feature-based monocular SLAM system that operates in real time, in small and large indoor and outdoor environments, with a survival of the fittest strategy that selects the points and keyframes of the reconstruction.

...read moreread less

Collapse

Related Papers (5)

Special issue on deep learning for document analysis and recognition

Cheng-Lin Liu, +3 more

- 01 Sep 2018 -

International Journal on Document Analys...

International Journal of Computer Vision

Learning Depth via Interaction.

References

Adam: A Method for Stochastic Optimization

Adam: A Method for Stochastic Optimization

The Ecological Approach to Visual Perception

Multiple View Geometry in Computer Vision.

ORB-SLAM: A Versatile and Accurate Monocular SLAM System

Related Papers (5)

Special issue on deep learning for document analysis and recognition

Computer Analysis of Images and Patterns

Human Behavior Understanding

Computer Vision Systems

Editorial: Special Issue on Machine Vision with Deep Learning