OctNet: Learning Deep 3D Representations at High Resolutions

doi:10.1109/CVPR.2017.701

Open AccessProceedings ArticleDOI

OctNet: Learning Deep 3D Representations at High Resolutions

- pp 6620-6629

TLDR

The utility of the OctNet representation is demonstrated by analyzing the impact of resolution on several 3D tasks including 3D object classification, orientation estimation and point cloud labeling.

Abstract:

We present OctNet, a representation for deep learning with sparse 3D data. In contrast to existing models, our representation enables 3D convolutional networks which are both deep and high resolution. Towards this goal, we exploit the sparsity in the input data to hierarchically partition the space using a set of unbalanced octrees where each leaf node stores a pooled feature representation. This allows to focus memory allocation and computation to the relevant dense regions and enables deeper networks without compromising resolution. We demonstrate the utility of our OctNet representation by analyzing the impact of resolution on several 3D tasks including 3D object classification, orientation estimation and point cloud labeling.

Citations

PDF

Open Access

More filters

Posted Content

PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space

Charles R. Qi, +3 more

- 07 Jun 2017 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: A hierarchical neural network that applies PointNet recursively on a nested partitioning of the input point set and proposes novel set learning layers to adaptively combine features from multiple scales to learn deep point set features efficiently and robustly.

...read moreread less

Proceedings ArticleDOI

DeepSDF: Learning Continuous Signed Distance Functions for Shape Representation

Jeong Joon Park, +4 more

TL;DR: DeepSDF as mentioned in this paper represents a shape's surface by a continuous volumetric field: the magnitude of a point in the field represents the distance to the surface boundary and the sign indicates whether the region is inside (-) or outside (+) of the shape.

...read moreread less

Proceedings ArticleDOI

Frustum PointNets for 3D Object Detection from RGB-D Data

Charles R. Qi, +4 more

TL;DR: This work directly operates on raw point clouds by popping up RGBD scans and leverages both mature 2D object detectors and advanced 3D deep learning for object localization, achieving efficiency as well as high recall for even small objects.

...read moreread less

Proceedings ArticleDOI

KPConv: Flexible and Deformable Convolution for Point Clouds

Hugues Thomas, +5 more

TL;DR: KPConv is a new design of point convolution, i.e. that operates on point clouds without any intermediate representation, that outperform state-of-the-art classification and segmentation approaches on several datasets.

...read moreread less

Proceedings Article

PointCNN: convolution on Χ -transformed points

Yangyan Li, +5 more

TL;DR: This work proposes to learn an Χ-transformation from the input points to simultaneously promote two causes: the first is the weighting of the input features associated with the points, and the second is the permutation of the points into a latent and potentially canonical order.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings ArticleDOI

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.

...read moreread less

Proceedings Article

Adam: A Method for Stochastic Optimization

Diederik P. Kingma, +1 more

TL;DR: This work introduces Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments, and provides a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework.

...read moreread less

Proceedings Article

ImageNet Classification with Deep Convolutional Neural Networks

Alex Krizhevsky, +2 more

TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.

...read moreread less

Posted Content

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

Shaoqing Ren, +3 more

- 04 Jun 2015 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: Faster R-CNN as discussed by the authors proposes a Region Proposal Network (RPN) to generate high-quality region proposals, which are used by Fast R-NN for detection.

...read moreread less

Proceedings Article

Faster R-CNN: towards real-time object detection with region proposal networks

Shaoqing Ren, +3 more

TL;DR: Ren et al. as discussed by the authors proposed a region proposal network (RPN) that shares full-image convolutional features with the detection network, thus enabling nearly cost-free region proposals.

...read moreread less

Collapse

OctNet: Learning Deep 3D Representations at High Resolutions

Citations

PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space

DeepSDF: Learning Continuous Signed Distance Functions for Shape Representation

Frustum PointNets for 3D Object Detection from RGB-D Data

KPConv: Flexible and Deformable Convolution for Point Clouds

PointCNN: convolution on Χ -transformed points

References

Deep Residual Learning for Image Recognition

Adam: A Method for Stochastic Optimization

ImageNet Classification with Deep Convolutional Neural Networks

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

Faster R-CNN: towards real-time object detection with region proposal networks

Related Papers (5)

PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation

PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space

3D ShapeNets: A deep representation for volumetric shapes

VoxNet: A 3D Convolutional Neural Network for real-time object recognition

Multi-view Convolutional Neural Networks for 3D Shape Recognition