Second-Order Spectral Transform Block for 3D Shape Classification and Retrieval

doi:10.1109/TIP.2020.2967579

Journal ArticleDOI

Second-Order Spectral Transform Block for 3D Shape Classification and Retrieval

Ruixuan Yu, +2 more

- 23 Jan 2020 -

IEEE Transactions on Image Processing

- Vol. 29, pp 4530-4543

TLDR

A novel network block that generalizes the second-order pooling to 3D surface by designing a learnable non-linear transform on the spectrum of the pooled descriptor is proposed, showing its superiority compared with traditional second- order pooling methods.

Abstract:

In this paper, we propose a novel network block, dubbed as second-order spectral transform block, for 3D shape retrieval and classification. This network block generalizes the second-order pooling to 3D surface by designing a learnable non-linear transform on the spectrum of the pooled descriptor. The proposed block consists of following two components. First, the second-order average (SO-Avr) and max-pooling (SO-Max) operations are designed on 3D surface to aggregate local descriptors, which are shown to be more discriminative than the popular average-pooling or max-pooling. Second, a learnable spectral transform parameterized by mixture of power function is proposed to perform non-linear feature mapping in the space of pooled descriptors, i.e., manifold of symmetric positive definite matrix for SO-Avr, and space of symmetric matrix for SO-Max. The proposed block can be plugged into existing network architectures to aggregate local shape descriptors for boosting their performance. We apply it to a shallow network for non-rigid 3D shape analysis and to existing networks for rigid shape analysis, where it improves the first-tier retrieval accuracy by 7.2% on SHREC’14 Real dataset and achieves state-of-the-art classification accuracy on ModelNet40. As an extension, we apply our block to 2D image classification, showing its superiority compared with traditional second-order pooling methods. We also provide theoretical and experimental analysis on stability of the proposed second-order spectral transform block.

Second-Order Spectral Transform Block for 3D Shape Classification and Retrieval

Citations

Embedding Regularizer Learning for Multi-View Semi-Supervised Classification

Learning isometry-invariant representations for point cloud analysis

EFSCNN: Encoded Feature Sphere Convolution Neural Network for fast non-rigid 3D models classification and retrieval

Average increment scale-invariant heat kernel signature for 3D non-rigid shape analysis

Scale-invariant Mexican Hat wavelet descriptor for non-rigid shape similarity measurement

References

PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation

PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space

3D ShapeNets: A deep representation for volumetric shapes

PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space

VoxNet: A 3D Convolutional Neural Network for real-time object recognition

Related Papers (5)

Shape Classification using Spectral Graph Wavelets

Spectral shape classification

Spatially aggregating spectral descriptors for nonrigid 3D shape retrieval: a comparative survey

A spectral graph wavelet approach for nonrigid 3D shape retrieval

A multiresolution descriptor for deformable 3D shape retrieval