scispace - formally typeset
Proceedings ArticleDOI

Covariance discriminative learning: A natural and efficient approach to image set classification

TLDR
A novel discriminative learning approach to image set classification by modeling the image set with its natural second-order statistic, i.e. covariance matrix, which shows the superiority of this method over state-of-the-art ones in both accuracy and efficiency, but also its stability to two real challenges: noisy set data and varying set size.
Abstract
We propose a novel discriminative learning approach to image set classification by modeling the image set with its natural second-order statistic, i.e. covariance matrix. Since nonsingular covariance matrices, a.k.a. symmetric positive definite (SPD) matrices, lie on a Riemannian manifold, classical learning algorithms cannot be directly utilized to classify points on the manifold. By exploring an efficient metric for the SPD matrices, i.e., Log-Euclidean Distance (LED), we derive a kernel function that explicitly maps the covariance matrix from the Riemannian manifold to a Euclidean space. With this explicit mapping, any learning method devoted to vector space can be exploited in either its linear or kernel formulation. Linear Discriminant Analysis (LDA) and Partial Least Squares (PLS) are considered in this paper for their feasibility for our specific problem. We further investigate the conventional linear subspace based set modeling technique and cast it in a unified framework with our covariance matrix based modeling. The proposed method is evaluated on two tasks: face recognition and object categorization. Extensive experimental results show not only the superiority of our method over state-of-the-art ones in both accuracy and efficiency, but also its stability to two real challenges: noisy set data and varying set size.

read more

Citations
More filters
Proceedings ArticleDOI

Learning Expressionlets on Spatio-temporal Manifold for Dynamic Facial Expression Recognition

TL;DR: This paper attempts to solve temporal alignment and semantics-aware dynamic representation problems via manifold modeling of videos based on a novel mid-level representation, i.e. expressionlet, and reports results better than the known state-of-the-art.
Posted Content

Quality Aware Network for Set to Set Recognition

TL;DR: Analysis on gradient spread of this mechanism indicates that the quality learned by the network is beneficial to set-to-set recognition and simplifies the distribution that the network needs to fit.
Proceedings ArticleDOI

Quality Aware Network for Set to Set Recognition

TL;DR: In this article, the quality of each sample can be automatically learned in the training stage, although such information is not explicitly provided during the training process, and the network has two branches, where the first branch extracts appearance feature embedding and the other branch predicts quality score for each sample.
Proceedings ArticleDOI

Projection Metric Learning on Grassmann Manifold with Application to Video based Face Recognition

TL;DR: This work proposes a novel method to learn the Projection Metric directly on Grassmann manifold rather than in Hilbert space, which can be regarded as performing a geometry-aware dimensionality reduction from the original Grassmann manifolds to a lower-dimensional, more discriminative Grassman manifold where more favorable classification can be achieved.
Proceedings Article

Log-Euclidean Metric Learning on Symmetric Positive Definite Manifold with Application to Image Set Classification

TL;DR: This paper proposes a novel metric learning approach to work directly on logarithms of SPD matrices by learning a tangent map that can directly transform the matrix Log-Euclidean Metric from the original tangent space to a new tangentspace of more discriminability.
References
More filters
Journal ArticleDOI

Robust Real-Time Face Detection

TL;DR: In this paper, a face detection framework that is capable of processing images extremely rapidly while achieving high detection rates is described. But the detection performance is limited to 15 frames per second.
Proceedings ArticleDOI

Robust real-time face detection

TL;DR: A new image representation called the “Integral Image” is introduced which allows the features used by the detector to be computed very quickly and a method for combining classifiers in a “cascade” which allows background regions of the image to be quickly discarded while spending more computation on promising face-like regions.
Book ChapterDOI

Relations Between Two Sets of Variates

TL;DR: The concept of correlation and regression may be applied not only to ordinary one-dimensional variates but also to variates of two or more dimensions as discussed by the authors, where the correlation of the horizontal components is ordinarily discussed, whereas the complex consisting of horizontal and vertical deviations may be even more interesting.
Book

Partial Least Squares for Discrimination

TL;DR: Partial least squares (PLS) was not originally designed as a tool for statistical discrimination as discussed by the authors, but applied scientists routinely use PLS for classification and there is substantial empirical evidence to suggest that it performs well in that role.