Vote3Deep: Fast object detection in 3D point clouds using efficient convolutional neural networks

doi:10.1109/ICRA.2017.7989161

Open AccessProceedings ArticleDOI

Vote3Deep: Fast object detection in 3D point clouds using efficient convolutional neural networks

- pp 1355-1361

TLDR

VoteSDeep as mentioned in this paper leverages a feature-centric voting scheme to implement novel convolutional layers which explicitly exploit the sparsity encountered in the input, and additionally proposes to use an L 1 penalty on the filter activations to further encourage sparsity in the intermediate representations.

Abstract:

This paper proposes a computationally efficient approach to detecting objects natively in 3D point clouds using convolutional neural networks (CNNs). In particular, this is achieved by leveraging a feature-centric voting scheme to implement novel convolutional layers which explicitly exploit the sparsity encountered in the input. To this end, we examine the trade-off between accuracy and speed for different architectures and additionally propose to use an L 1 penalty on the filter activations to further encourage sparsity in the intermediate representations. To the best of our knowledge, this is the first work to propose sparse convolutional layers and L 1 regularisation for efficient large-scale processing of 3D data. We demonstrate the efficacy of our approach on the KITTI object detection benchmark and show that VoteSDeep models with as few as three layers outperform the previous state of the art in both laser and laser-vision based approaches by margins of up to 40% while remaining highly competitive in terms of processing time.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

Multi-view 3D Object Detection Network for Autonomous Driving

Xiaozhi Chen, +4 more

TL;DR: This paper proposes Multi-View 3D networks (MV3D), a sensory-fusion framework that takes both LIDAR point cloud and RGB images as input and predicts oriented 3D bounding boxes and designs a deep fusion scheme to combine region-wise features from multiple views and enable interactions between intermediate layers of different paths.

...read moreread less

Proceedings ArticleDOI

VoxelNet: End-to-End Learning for Point Cloud Based 3D Object Detection

Yin Zhou, +1 more

TL;DR: Zhou et al. as mentioned in this paper propose VoxelNet, a generic 3D detection network that unifies feature extraction and bounding box prediction into a single stage, end-to-end trainable deep network.

...read moreread less

Proceedings ArticleDOI

Frustum PointNets for 3D Object Detection from RGB-D Data

Charles R. Qi, +4 more

TL;DR: This work directly operates on raw point clouds by popping up RGBD scans and leverages both mature 2D object detectors and advanced 3D deep learning for object localization, achieving efficiency as well as high recall for even small objects.

...read moreread less

Posted Content

PointPillars: Fast Encoders for Object Detection from Point Clouds

Alex H. Lang, +5 more

- 14 Dec 2018 -

arXiv: Learning

TL;DR: PointPillars as mentioned in this paper utilizes PointNets to learn a representation of point clouds organized in vertical columns (pillars), which can be used with any standard 2D convolutional detection architecture.

...read moreread less

Proceedings ArticleDOI

PointPillars: Fast Encoders for Object Detection From Point Clouds

Alex H. Lang, +5 more

TL;DR: benchmarks suggest that PointPillars is an appropriate encoding for object detection in point clouds, and proposes a lean downstream network.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings Article

ImageNet Classification with Deep Convolutional Neural Networks

Alex Krizhevsky, +2 more

TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.

...read moreread less

Proceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

Karen Simonyan, +1 more

TL;DR: In this paper, the authors investigated the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting and showed that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 layers.

...read moreread less

Proceedings ArticleDOI

Going deeper with convolutions

Christian Szegedy, +8 more

TL;DR: Inception as mentioned in this paper is a deep convolutional neural network architecture that achieves the new state of the art for classification and detection in the ImageNet Large-Scale Visual Recognition Challenge 2014 (ILSVRC14).

...read moreread less

Proceedings ArticleDOI

Are we ready for autonomous driving? The KITTI vision benchmark suite

Andreas Geiger, +2 more

TL;DR: The autonomous driving platform is used to develop novel challenging benchmarks for the tasks of stereo, optical flow, visual odometry/SLAM and 3D object detection, revealing that methods ranking high on established datasets such as Middlebury perform below average when being moved outside the laboratory to the real world.

...read moreread less

Proceedings Article

Deep Sparse Rectifier Neural Networks

Xavier Glorot, +2 more

TL;DR: This paper shows that rectifying neurons are an even better model of biological neurons and yield equal or better performance than hyperbolic tangent networks in spite of the hard non-linearity and non-dierentiabil ity.

...read moreread less

Vote3Deep: Fast object detection in 3D point clouds using efficient convolutional neural networks

Citations

Multi-view 3D Object Detection Network for Autonomous Driving

VoxelNet: End-to-End Learning for Point Cloud Based 3D Object Detection

Frustum PointNets for 3D Object Detection from RGB-D Data

PointPillars: Fast Encoders for Object Detection from Point Clouds

PointPillars: Fast Encoders for Object Detection From Point Clouds

References

ImageNet Classification with Deep Convolutional Neural Networks

Very Deep Convolutional Networks for Large-Scale Image Recognition

Going deeper with convolutions

Are we ready for autonomous driving? The KITTI vision benchmark suite

Deep Sparse Rectifier Neural Networks

Related Papers (5)

Multi-view 3D Object Detection Network for Autonomous Driving

PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation

VoxelNet: End-to-End Learning for Point Cloud Based 3D Object Detection

PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space

Are we ready for autonomous driving? The KITTI vision benchmark suite