Statistical Computations on Grassmann and Stiefel Manifolds for Image and Video-Based Recognition
read more
Citations
Handbook of mathematical methods in imaging
3-D Human Action Recognition by Shape Analysis of Motion Trajectories on Riemannian Manifold
Neural Aggregation Network for Video Face Recognition
Neural Aggregation Network for Video Face Recognition
Projection Metric Learning on Grassmann Manifold with Application to Video based Face Recognition
References
From few to many: illumination cone models for face recognition under variable lighting and pose
Statistical shape analysis: clustering, learning, and testing
The Geometry of Algorithms with Orthogonality Constraints
Related Papers (5)
Frequently Asked Questions (18)
Q2. What are the future works mentioned in the paper "Statistical computations on grassmann and stiefel manifolds for image and video-based recognition" ?
In addition to definitions of distances and statistics on manifolds, many interesting problems such as interpolation, smoothing, and time-series modeling on these manifolds of interest are potential directions of future work.
Q3. What is the common distance metric used for comparison of models?
For comparison of models, the most commonly used distance metric is based on subspace angles between column-spaces of the observability matrices [31].
Q4. What are the potential directions of future work?
In addition to definitions of distances and statistics on manifolds, many interesting problems such as interpolation, smoothing, and time-series modeling on these manifolds of interest are potential directions of future work.
Q5. How do the authors evaluate the ith class conditional density at a test-point?
To evaluate the ith class conditional density at a test-point, one merely evaluates the truncated Gaussian by mapping the test-point to the tangent-space at pi.
Q6. How many actors are in the INRIA dataset?
The dataset consists of 10 actors performing 11 actions, each action executed 3 times at varying rates while freely changing orientation.
Q7. What is the common approach for learning the temporal dynamics in the lower-dimensional space?
For high-dimensional time-series data (dynamic textures etc), the most common approach is to first learn a lower-dimensional embedding of the observations via PCA, and learn the temporal dynamics in the lower-dimensional space.
Q8. What is the quotient space of SO(n)?
In order to obtain a quotient space structure for Gn,d , let SO(d)×SO(n−d) be a subgroup of SO(n) using the embedding φb : (SO(d)×SO(n−d))→ SO(n):φb(V1,V2) =V1 00 V2 ∈ SO(n).
Q9. What is the complexity of the OT exp(tA)J?
The computation of the geodesic OT exp(tA)J in the direct form implies a complexity of O(n3), where n = mp for the observability matrix, and n = p for the case of PCA basis vectors.
Q10. What is the way to estimate the parameters of a family of pdfs?
The authors can estimate the parameters of a family of pdfs such as Gaussian, or mixtures of Gaussian and then use the exponential map to wrap these parameters back onto the manifold.
Q11. How do the authors evaluate the class conditional probability using truncated wrapped Gaussian?
As mentioned in section V-A, to evaluate the class conditional probability using truncated wrapped Gaussians, the authors also need to adjust the normalizing constant of each Gaussian.
Q12. What is the common way to model the face of a person under different illumination conditions?
Motivated by this, the set of face images of the same person under varying illumination conditions is frequently modeled as a linear subspace of 9-dimensions [38].
Q13. What is the quotient of the Stiefel manifold?
In their case, it turns out that the Stiefel manifold itself can be interpreted as a quotient of a more basic manifold - the special orthogonal group SO(n).
Q14. What is the introduction to the geometry of the Grassmann manifold?
A good introduction to the geometry of the Stiefel and Grassmann manifolds can be found in [10] who introduced gradient methods on these manifolds in the context of eigenvalue problems.
Q15. What was the first version of this paper?
A preliminary version of this paper was presented in [1], which used extrinsic methods for statistical modeling on the Grassmann manifold.
Q16. What is the equivalence class of Gn,d?
An equivalence class is given by:[O]b = {Oφb(V1,V2)|V1 ∈ SO(d), V2 ∈ SO(n−d)} ,and the set of all such equivalence classes is Gn,d .
Q17. How can the authors reduce the complexity of the OT operations?
By exploiting the special structure of the matrix A, it is possible to reduce the complexity of these operations to no more than O(nd2) and O(d3) which represents a significant reduction.
Q18. What is the tangent space for a point O?
For an arbitrary point O ∈ SO(n), the tangent space at that point is obtained by a simple rotation of TI(SO(n)): TO(SO(n)) = {OX |X ∈ TI(SO(n))}.