scispace - formally typeset
Open AccessProceedings ArticleDOI

Stereo DSO: Large-Scale Direct Sparse Visual Odometry with Stereo Cameras

TLDR
Stereo Direct Sparse Odometry (Stereo DSO) as discussed by the authors integrates constraints from static stereo into the bundle adjustment pipeline of temporal multi-view stereo to improve tracking accuracy and robustness.
Abstract
We propose Stereo Direct Sparse Odometry (Stereo DSO) as a novel method for highly accurate real-time visual odometry estimation of large-scale environments from stereo cameras. It jointly optimizes for all the model parameters within the active window, including the intrinsic/extrinsic camera parameters of all keyframes and the depth values of all selected pixels. In particular, we propose a novel approach to integrate constraints from static stereo into the bundle adjustment pipeline of temporal multi-view stereo. Real-time optimization is realized by sampling pixels uniformly from image regions with sufficient intensity gradient. Fixed-baseline stereo resolves scale drift. It also reduces the sensitivities to large optical flow and to rolling shutter effect which are known shortcomings of direct image alignment methods. Quantitative evaluation demonstrates that the proposed Stereo DSO outperforms existing state-of-the-art visual odometry methods both in terms of tracking accuracy and robustness. Moreover, our method delivers a more precise metric 3D reconstruction than previous dense/semi-dense direct approaches while providing a higher reconstruction density than feature-based methods.

read more

Citations
More filters
Proceedings ArticleDOI

Self-Supervised Monocular Depth Hints

TL;DR: This work studies the problem of ambiguous reprojections in depth-prediction from stereo-based self-supervision, and introduces Depth Hints to alleviate their effects, and produces state-of-the-art depth predictions on the KITTI benchmark.
Posted Content

Visual-Inertial Navigation: A Concise Review

TL;DR: This paper surveys thoroughly the research efforts taken in visual-inertial navigation research and strives to provide a concise but complete review of the related work in the hope to accelerate the VINS research and beyond in the authors' society as a whole.
Book ChapterDOI

VSO: Visual Semantic Odometry

TL;DR: A novel visual semantic odometry (VSO) framework to enable medium-term continuous tracking of points using semantics and demonstrates a significant improvement over state-of-the-art baselines in the context of autonomous driving simply by integrating the authors' semantic constraints.
Proceedings ArticleDOI

Refine and Distill: Exploiting Cycle-Inconsistency and Knowledge Distillation for Unsupervised Monocular Depth Estimation

TL;DR: In this paper, a student network is trained to predict a disparity map such as to recover from a frame in a camera view the associated image in the opposite view, and a backward cycle network is applied to the generated image to re-synthesize back the input image, estimating the opposite disparity.
Book ChapterDOI

DH3D: Deep Hierarchical 3D Descriptors for Robust Large-Scale 6DoF Relocalization

TL;DR: A Siamese network that jointly learns 3D local feature detection and description directly from raw 3D points and integrates FlexConv and Squeeze-and-Excitation to assure that the learned local descriptor captures multi-level geometric information and channel-wise relations.
References
More filters
Proceedings ArticleDOI

The Cityscapes Dataset for Semantic Urban Scene Understanding

TL;DR: This work introduces Cityscapes, a benchmark suite and large-scale dataset to train and test approaches for pixel-level and instance-level semantic labeling, and exceeds previous attempts in terms of dataset size, annotation richness, scene variability, and complexity.
Journal ArticleDOI

Vision meets robotics: The KITTI dataset

TL;DR: A novel dataset captured from a VW station wagon for use in mobile robotics and autonomous driving research, using a variety of sensor modalities such as high-resolution color and grayscale stereo cameras and a high-precision GPS/IMU inertial navigation system.
Journal ArticleDOI

ORB-SLAM: A Versatile and Accurate Monocular SLAM System

TL;DR: ORB-SLAM as discussed by the authors is a feature-based monocular SLAM system that operates in real time, in small and large indoor and outdoor environments, with a survival of the fittest strategy that selects the points and keyframes of the reconstruction.
Proceedings ArticleDOI

Parallel Tracking and Mapping for Small AR Workspaces

TL;DR: A system specifically designed to track a hand-held camera in a small AR workspace, processed in parallel threads on a dual-core computer, that produces detailed maps with thousands of landmarks which can be tracked at frame-rate with accuracy and robustness rivalling that of state-of-the-art model-based systems.
Journal ArticleDOI

MonoSLAM: Real-Time Single Camera SLAM

TL;DR: The first successful application of the SLAM methodology from mobile robotics to the "pure vision" domain of a single uncontrolled camera, achieving real time but drift-free performance inaccessible to structure from motion approaches is presented.
Related Papers (5)