scispace - formally typeset
Journal ArticleDOI

Second-Order Spectral Transform Block for 3D Shape Classification and Retrieval

TLDR
A novel network block that generalizes the second-order pooling to 3D surface by designing a learnable non-linear transform on the spectrum of the pooled descriptor is proposed, showing its superiority compared with traditional second- order pooling methods.
Abstract
In this paper, we propose a novel network block, dubbed as second-order spectral transform block, for 3D shape retrieval and classification. This network block generalizes the second-order pooling to 3D surface by designing a learnable non-linear transform on the spectrum of the pooled descriptor. The proposed block consists of following two components. First, the second-order average (SO-Avr) and max-pooling (SO-Max) operations are designed on 3D surface to aggregate local descriptors, which are shown to be more discriminative than the popular average-pooling or max-pooling. Second, a learnable spectral transform parameterized by mixture of power function is proposed to perform non-linear feature mapping in the space of pooled descriptors, i.e., manifold of symmetric positive definite matrix for SO-Avr, and space of symmetric matrix for SO-Max. The proposed block can be plugged into existing network architectures to aggregate local shape descriptors for boosting their performance. We apply it to a shallow network for non-rigid 3D shape analysis and to existing networks for rigid shape analysis, where it improves the first-tier retrieval accuracy by 7.2% on SHREC’14 Real dataset and achieves state-of-the-art classification accuracy on ModelNet40. As an extension, we apply our block to 2D image classification, showing its superiority compared with traditional second-order pooling methods. We also provide theoretical and experimental analysis on stability of the proposed second-order spectral transform block.

read more

Citations
More filters
Journal ArticleDOI

Embedding Regularizer Learning for Multi-View Semi-Supervised Classification

TL;DR: Zhang et al. as mentioned in this paper proposed an embedding regularizer learning scheme for multi-view semi-supervised classification (ERL-MVSC), which integrates diversity, sparsity and consensus to dexterously manipulate multiview data with limited labels.
Journal ArticleDOI

Learning isometry-invariant representations for point cloud analysis

TL;DR: Wang et al. as mentioned in this paper proposed a method to learn isometry-invariant representations for point cloud which is strictly rotation invariant and capable of tolerating pose variations, and a new publicly available dataset, PKUnon-rigid, is provided for non-rigidity 3D shape analysis.
Journal ArticleDOI

EFSCNN: Encoded Feature Sphere Convolution Neural Network for fast non-rigid 3D models classification and retrieval

TL;DR: In this article , an encoded feature sphere (EFS) is constructed to express the non-rigid 3D model through feature encoding, spherical projection, and crucial points extraction.
Journal ArticleDOI

Scale-invariant Mexican Hat wavelet descriptor for non-rigid shape similarity measurement

TL;DR: In this article , the scale-invariant Mexican Hat wavelet (SIMHW) descriptor was proposed to improve the robustness of the model with scale transformation. But the feature description performance is affected to some extent.
References
More filters
Proceedings ArticleDOI

PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation

TL;DR: This paper designs a novel type of neural network that directly consumes point clouds, which well respects the permutation invariance of points in the input and provides a unified architecture for applications ranging from object classification, part segmentation, to scene semantic parsing.
Posted Content

PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space

TL;DR: A hierarchical neural network that applies PointNet recursively on a nested partitioning of the input point set and proposes novel set learning layers to adaptively combine features from multiple scales to learn deep point set features efficiently and robustly.
Proceedings ArticleDOI

3D ShapeNets: A deep representation for volumetric shapes

TL;DR: This work proposes to represent a geometric 3D shape as a probability distribution of binary variables on a 3D voxel grid, using a Convolutional Deep Belief Network, and shows that this 3D deep representation enables significant performance improvement over the-state-of-the-arts in a variety of tasks.
Proceedings Article

PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space

TL;DR: PointNet++ as discussed by the authors applies PointNet recursively on a nested partitioning of the input point set to learn local features with increasing contextual scales, and proposes novel set learning layers to adaptively combine features from multiple scales.
Proceedings ArticleDOI

VoxNet: A 3D Convolutional Neural Network for real-time object recognition

TL;DR: VoxNet is proposed, an architecture to tackle the problem of robust object recognition by integrating a volumetric Occupancy Grid representation with a supervised 3D Convolutional Neural Network (3D CNN).
Related Papers (5)