Discriminant Feature Extraction by Generalized Difference Subspace

doi:10.1109/TPAMI.2022.3168557

Journal ArticleDOI

Discriminant Feature Extraction by Generalized Difference Subspace

Kazuhiro Fukui, +4 more

- 19 Apr 2022 -

IEEE Transactions on Pattern Analysis an...

- Vol. 45, pp 1618-1635

Chats0

TLDR

The discriminant ability of the orthogonal projection of data onto a generalized difference subspace (GDS) both theoretically and experimentally is revealed and two useful extensions of these methods are discussed, nonlinear extension by kernel trick, and the combination of convolutional neural network (CNN) features.

Abstract:

In this paper, we reveal the discriminant capacity of orthogonal data projection onto the generalized difference subspace (GDS), both theoretically and experimentally. In our previous work, we demonstrated that the GDS projection works as a quasi-orthogonalization of class subspaces, which is an effective feature extraction for subspace based classifiers. Here, we further show that GDS projection also works as a discriminant feature extraction through a similar mechanism to the Fisher discriminant analysis (FDA). A direct proof of the connection between GDS projection and FDA is difficult due to the significant difference in their formulations. To circumvent the complication, we first introduce geometrical Fisher discriminant analysis (gFDA) based on a simplified Fisher criterion. It is derived from a heuristic yet practically plausible assumption: the direction of the sample mean vector of a class is largely aligned to the first principal component vector of the class, given that the principal component analysis (PCA) is applied without data centering. gFDA works stably even under few samples, bypassing the small sample size (SSS) problem of FDA. We then prove that gFDA is equivalent to GDS projection with a small correction term. This equivalence ensures GDS projection to inherit the discriminant ability from FDA via gFDA. Furthermore, we discuss two useful extensions of these methods, 1) a nonlinear extension by kernel trick, 2) a combination with CNN features. The equivalence and the effectiveness of the extensions have been verified through extensive experiments on the extended Yale B+, CMU face database, ALOI, ETH80, MNIST, and CIFAR10, mainly focusing on image recognition under small samples.

Discriminant Feature Extraction by Generalized Difference Subspace

Citations

Environmental Sound Classification Based on CNN Latent Subspaces

Time-series Anomaly Detection based on Difference Subspace between Signal Subspaces

Temporal-stochastic tensor features for action recognition

References

The use of multiple measurements in taxonomic problems

Gradient-based learning applied to document recognition

ImageNet Large Scale Visual Recognition Challenge

From few to many: illumination cone models for face recognition under variable lighting and pose

Introduction to statistical pattern recognition (2nd ed.)