Latent-MVCNN: 3D Shape Recognition Using Multiple Views from Pre-defined or Random Viewpoints

doi:10.1007/S11063-020-10268-X

Journal ArticleDOI

Latent-MVCNN: 3D Shape Recognition Using Multiple Views from Pre-defined or Random Viewpoints

Qian Yu, +3 more

- 01 Aug 2020 -

Neural Processing Letters

- Vol. 52, Iss: 1, pp 581-602

Chats0

TLDR

The experimental results show that the LMVCNN achieves competitive performance in 3D shape recognition on ModelNet10 and ModelNet40 for both the pre-defined and the random viewpoints and exhibits promising performance when the number of view-images is quite small.

Abstract:

The Multi-view Convolution Neural Network (MVCNN) has achieved considerable success in 3D shape recognition. However, 3D shape recognition using view-images from random viewpoints has not been yet exploited in depth. In addition, 3D shape recognition using a small number of view-images remains difficult. To tackle these challenges, we developed a novel Multi-view Convolution Neural Network, “Latent-MVCNN” (LMVCNN), that recognizes 3D shapes using multiple view-images from pre-defined or random viewpoints. The LMVCNN consists of three types of sub Convolution Neural Networks. For each view-image, the first type of CNN outputs multiple category probability distributions and the second type of CNN outputs a latent vector to help the first type of CNN choose the decent distribution. The third type of CNN outputs the transition probabilities from the category probability distributions of one view to the category probability distributions of another view, which further helps the LMVCNN to find the decent category probability distributions for each pair of view-images. The three CNNs cooperate with each other to the obtain satisfactory classification scores. Our experimental results show that the LMVCNN achieves competitive performance in 3D shape recognition on ModelNet10 and ModelNet40 for both the pre-defined and the random viewpoints and exhibits promising performance when the number of view-images is quite small.

Latent-MVCNN: 3D Shape Recognition Using Multiple Views from Pre-defined or Random Viewpoints

Citations

MHFP: Multi-view based hierarchical fusion pooling method for 3D shape recognition

PVLNet: Parameterized-View-Learning neural network for 3D shape recognition

Fusion of a Static and Dynamic Convolutional Neural Network for Multiview 3D Point Cloud Classification

Automatic Representative View Selection of a 3D Cultural Relic Using Depth Variation Entropy and Depth Distribution Entropy

LiSurveying: A high-resolution TLS-LiDAR benchmark

References

Deep Residual Learning for Image Recognition

ImageNet Classification with Deep Convolutional Neural Networks

Very Deep Convolutional Networks for Large-Scale Image Recognition

Going deeper with convolutions

PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation

Related Papers (5)

GVCNN: Group-View Convolutional Neural Networks for 3D Shape Recognition

3D-Assisted Image Feature Synthesis for Novel Views of an Object

View-Adaptive Metric Learning for Multi-view Person Re-identification

Multi-view metric learning for multi-instance image classification.

Learning 3D Object Recognition Models from 2D Images