Stereo DSO: Large-Scale Direct Sparse Visual Odometry with Stereo Cameras

doi:10.1109/ICCV.2017.421

Open AccessProceedings ArticleDOI

Stereo DSO: Large-Scale Direct Sparse Visual Odometry with Stereo Cameras

- pp 3923-3931

TLDR

Stereo Direct Sparse Odometry (Stereo DSO) as discussed by the authors integrates constraints from static stereo into the bundle adjustment pipeline of temporal multi-view stereo to improve tracking accuracy and robustness.

Abstract:

We propose Stereo Direct Sparse Odometry (Stereo DSO) as a novel method for highly accurate real-time visual odometry estimation of large-scale environments from stereo cameras. It jointly optimizes for all the model parameters within the active window, including the intrinsic/extrinsic camera parameters of all keyframes and the depth values of all selected pixels. In particular, we propose a novel approach to integrate constraints from static stereo into the bundle adjustment pipeline of temporal multi-view stereo. Real-time optimization is realized by sampling pixels uniformly from image regions with sufficient intensity gradient. Fixed-baseline stereo resolves scale drift. It also reduces the sensitivities to large optical flow and to rolling shutter effect which are known shortcomings of direct image alignment methods. Quantitative evaluation demonstrates that the proposed Stereo DSO outperforms existing state-of-the-art visual odometry methods both in terms of tracking accuracy and robustness. Moreover, our method delivers a more precise metric 3D reconstruction than previous dense/semi-dense direct approaches while providing a higher reconstruction density than feature-based methods.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

Self-Supervised Monocular Depth Hints

Jamie Watson, +3 more

TL;DR: This work studies the problem of ambiguous reprojections in depth-prediction from stereo-based self-supervision, and introduces Depth Hints to alleviate their effects, and produces state-of-the-art depth predictions on the KITTI benchmark.

...read moreread less

Posted Content

Visual-Inertial Navigation: A Concise Review

Guoquan Huang

- 06 Jun 2019 -

arXiv: Robotics

TL;DR: This paper surveys thoroughly the research efforts taken in visual-inertial navigation research and strives to provide a concise but complete review of the related work in the hope to accelerate the VINS research and beyond in the authors' society as a whole.

...read moreread less

Book ChapterDOI

VSO: Visual Semantic Odometry

Konstantinos-Nektarios Lianos, +3 more

TL;DR: A novel visual semantic odometry (VSO) framework to enable medium-term continuous tracking of points using semantics and demonstrates a significant improvement over state-of-the-art baselines in the context of autonomous driving simply by integrating the authors' semantic constraints.

...read moreread less

Proceedings ArticleDOI

Refine and Distill: Exploiting Cycle-Inconsistency and Knowledge Distillation for Unsupervised Monocular Depth Estimation

Andrea Pilzer, +3 more

TL;DR: In this paper, a student network is trained to predict a disparity map such as to recover from a frame in a camera view the associated image in the opposite view, and a backward cycle network is applied to the generated image to re-synthesize back the input image, estimating the opposite disparity.

...read moreread less

Book ChapterDOI

DH3D: Deep Hierarchical 3D Descriptors for Robust Large-Scale 6DoF Relocalization

Juan Du, +2 more

TL;DR: A Siamese network that jointly learns 3D local feature detection and description directly from raw 3D points and integrates FlexConv and Squeeze-and-Excitation to assure that the learned local descriptor captures multi-level geometric information and channel-wise relations.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings ArticleDOI

The Cityscapes Dataset for Semantic Urban Scene Understanding

Marius Cordts, +8 more

TL;DR: This work introduces Cityscapes, a benchmark suite and large-scale dataset to train and test approaches for pixel-level and instance-level semantic labeling, and exceeds previous attempts in terms of dataset size, annotation richness, scene variability, and complexity.

...read moreread less

Journal ArticleDOI

Vision meets robotics: The KITTI dataset

Andreas Geiger, +3 more

- 01 Sep 2013 -

The International Journal of Robotics Re...

TL;DR: A novel dataset captured from a VW station wagon for use in mobile robotics and autonomous driving research, using a variety of sensor modalities such as high-resolution color and grayscale stereo cameras and a high-precision GPS/IMU inertial navigation system.

...read moreread less

Journal ArticleDOI

ORB-SLAM: A Versatile and Accurate Monocular SLAM System

Raul Mur-Artal, +2 more

- 24 Aug 2015 -

IEEE Transactions on Robotics

TL;DR: ORB-SLAM as discussed by the authors is a feature-based monocular SLAM system that operates in real time, in small and large indoor and outdoor environments, with a survival of the fittest strategy that selects the points and keyframes of the reconstruction.

...read moreread less

Proceedings ArticleDOI

Parallel Tracking and Mapping for Small AR Workspaces

Georg Klein, +1 more

TL;DR: A system specifically designed to track a hand-held camera in a small AR workspace, processed in parallel threads on a dual-core computer, that produces detailed maps with thousands of landmarks which can be tracked at frame-rate with accuracy and robustness rivalling that of state-of-the-art model-based systems.

...read moreread less

Journal ArticleDOI

MonoSLAM: Real-Time Single Camera SLAM

Andrew J. Davison, +3 more

- 01 Jun 2007 -

IEEE Transactions on Pattern Analysis an...

TL;DR: The first successful application of the SLAM methodology from mobile robotics to the "pure vision" domain of a single uncontrolled camera, achieving real time but drift-free performance inaccessible to structure from motion approaches is presented.

...read moreread less

Stereo DSO: Large-Scale Direct Sparse Visual Odometry with Stereo Cameras

Citations

Self-Supervised Monocular Depth Hints

Visual-Inertial Navigation: A Concise Review

VSO: Visual Semantic Odometry

Refine and Distill: Exploiting Cycle-Inconsistency and Knowledge Distillation for Unsupervised Monocular Depth Estimation

DH3D: Deep Hierarchical 3D Descriptors for Robust Large-Scale 6DoF Relocalization

References

The Cityscapes Dataset for Semantic Urban Scene Understanding

Vision meets robotics: The KITTI dataset

ORB-SLAM: A Versatile and Accurate Monocular SLAM System

Parallel Tracking and Mapping for Small AR Workspaces

MonoSLAM: Real-Time Single Camera SLAM

Related Papers (5)

ORB-SLAM2: An Open-Source SLAM System for Monocular, Stereo, and RGB-D Cameras

LSD-SLAM: Large-Scale Direct Monocular SLAM

ORB-SLAM: A Versatile and Accurate Monocular SLAM System

Are we ready for autonomous driving? The KITTI vision benchmark suite

Parallel Tracking and Mapping for Small AR Workspaces