Joint Object Pose Estimation and Shape Reconstruction in Urban Street Scenes Using 3D Shape Priors

doi:10.1007/978-3-319-45886-1_18

Book ChapterDOI

Joint Object Pose Estimation and Shape Reconstruction in Urban Street Scenes Using 3D Shape Priors

- pp 219-230

TLDR

This work proposes a novel approach for using compact shape manifolds of the shape within an object class for object segmentation, pose and shape estimation and demonstrates that the shape manifold alignment method yields improved results over the initial stereo reconstruction and object detection method in depth and pose accuracy.

Abstract:

Estimating the pose and 3D shape of a large variety of instances within an object class from stereo images is a challenging problem, especially in realistic conditions such as urban street scenes. We propose a novel approach for using compact shape manifolds of the shape within an object class for object segmentation, pose and shape estimation. Our method first detects objects and estimates their pose coarsely in the stereo images using a state-of-the-art 3D object detection method. An energy minimization method then aligns shape and pose concurrently with the stereo reconstruction of the object. In experiments, we evaluate our approach for detection, pose and shape estimation of cars in real stereo images of urban street scenes. We demonstrate that our shape manifold alignment method yields improved results over the initial stereo reconstruction and object detection method in depth and pose accuracy.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

Exploring Spatial Context for 3D Semantic Segmentation of Point Clouds

Francis Engelmann, +3 more

TL;DR: This paper builds upon PointNet and proposes two extensions that enlarge the receptive field over the 3D scene and evaluates the proposed strategies on challenging indoor and outdoor datasets and shows improved results in both scenarios.

...read moreread less

Proceedings ArticleDOI

Learning 3D Shape Completion from Laser Scan Data with Weak Supervision

David Stutz, +1 more

TL;DR: This work proposes a weakly-supervised learning-based approach to 3D shape completion which neither requires slow optimization nor direct supervision and is able to compete with a fully supervised baseline and a state-of-the-art data-driven approach while being significantly faster.

...read moreread less

Proceedings ArticleDOI

ApolloCar3D: A Large 3D Car Instance Understanding Benchmark for Autonomous Driving

Xibin Song, +8 more

TL;DR: The ApolloCar3D dataset as discussed by the authors contains 5,277 driving images and over 60k car instances, where each car is fitted with an industry-grade 3D CAD model with absolute model size and semantically labeled keypoints.

...read moreread less

Proceedings ArticleDOI

Leveraging Shape Completion for 3D Siamese Tracking

Silvio Giancola, +2 more

TL;DR: In this article, a Siamese tracker was designed to encode model and candidate shapes into a compact latent representation, and shape completion regularization was used to improve the tracking performance.

...read moreread less

Journal ArticleDOI

Dense 3D Object Reconstruction from a Single Depth View

Bo Yang, +4 more

- 01 Dec 2019 -

IEEE Transactions on Pattern Analysis an...

TL;DR: The key idea is to combine the generative capabilities of 3D encoder-decoder and the conditional adversarial networks framework, to infer accurate and fine-grained 3D structures of objects in high-dimensional voxel space.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings ArticleDOI

Marching cubes: A high resolution 3D surface construction algorithm

William E. Lorensen, +1 more

TL;DR: In this paper, a divide-and-conquer approach is used to generate inter-slice connectivity, and then a case table is created to define triangle topology using linear interpolation.

...read moreread less

Proceedings ArticleDOI

Are we ready for autonomous driving? The KITTI vision benchmark suite

Andreas Geiger, +2 more

TL;DR: The autonomous driving platform is used to develop novel challenging benchmarks for the tasks of stereo, optical flow, visual odometry/SLAM and 3D object detection, revealing that methods ranking high on established datasets such as Middlebury perform below average when being moved outside the laboratory to the real world.

...read moreread less

Journal ArticleDOI

Stereo Processing by Semiglobal Matching and Mutual Information

Heiko Hirschmüller

- 01 Feb 2008 -

IEEE Transactions on Pattern Analysis an...

TL;DR: This paper describes the Semi-Global Matching (SGM) stereo method, which uses a pixelwise, Mutual Information based matching cost for compensating radiometric differences of input images and demonstrates a tolerance against a wide range of radiometric transformations.

...read moreread less

Proceedings ArticleDOI

Object scene flow for autonomous vehicles

Moritz Menze, +1 more

TL;DR: A novel model and dataset for 3D scene flow estimation with an application to autonomous driving by representing each element in the scene by its rigid motion parameters and each superpixel by a 3D plane as well as an index to the corresponding object.

...read moreread less

Proceedings Article

Learning Depth from Single Monocular Images

Ashutosh Saxena, +2 more

TL;DR: This work begins by collecting a training set of monocular images (of unstructured outdoor environments which include forests, trees, buildings, etc.) and their corresponding ground-truth depthmaps, and applies supervised learning to predict the depthmap as a function of the image.

...read moreread less

Related Papers (5)

Are we ready for autonomous driving? The KITTI vision benchmark suite

Andreas Geiger, +2 more

ShapeNet: An Information-Rich 3D Model Repository

Angel X. Chang, +12 more

- 09 Dec 2015 -

arXiv: Graphics

Joint Object Pose Estimation and Shape Reconstruction in Urban Street Scenes Using 3D Shape Priors

Citations

Exploring Spatial Context for 3D Semantic Segmentation of Point Clouds

Learning 3D Shape Completion from Laser Scan Data with Weak Supervision

ApolloCar3D: A Large 3D Car Instance Understanding Benchmark for Autonomous Driving

Leveraging Shape Completion for 3D Siamese Tracking

Dense 3D Object Reconstruction from a Single Depth View

References

Marching cubes: A high resolution 3D surface construction algorithm

Are we ready for autonomous driving? The KITTI vision benchmark suite

Stereo Processing by Semiglobal Matching and Mutual Information

Object scene flow for autonomous vehicles

Learning Depth from Single Monocular Images

Related Papers (5)

Are we ready for autonomous driving? The KITTI vision benchmark suite

ShapeNet: An Information-Rich 3D Model Repository

Mask R-CNN

Adam: A Method for Stochastic Optimization

Monocular 3D Object Detection for Autonomous Driving