Sparse PCA: Optimal rates and adaptive estimation

doi:10.1214/13-AOS1178

Open AccessJournal ArticleDOI

Sparse PCA: Optimal rates and adaptive estimation

T. Tony Cai, +2 more

- 01 Dec 2013 -

Annals of Statistics

- Vol. 41, Iss: 6, pp 3074-3110

Chats0

TLDR

In this paper, the authors considered both minimax and adaptive estimation of the principal subspace in the high dimensional setting and established the optimal rates of convergence for estimating the subspace which are sharp with respect to all the parameters, thus providing a complete characterization of the difficulty of the estimation problem in terms of the convergence rate.

Abstract:

Principal component analysis (PCA) is one of the most commonly used statistical procedures with a wide range of applications. This paper considers both minimax and adaptive estimation of the principal subspace in the high dimensional setting. Under mild technical conditions, we first establish the optimal rates of convergence for estimating the principal subspace which are sharp with respect to all the parameters, thus providing a complete characterization of the difficulty of the estimation problem in term of the convergence rate. The lower bound is obtained by calculating the local metric entropy and an application of Fano’s lemma. The rate optimal estimator is constructed using aggregation, which, however, might not be computationally feasible. We then introduce an adaptive procedure for estimating the principal subspace which is fully data driven and can be computed efficiently. It is shown that the estimator attains the optimal rates of convergence simultaneously over a large collection of the parameter spaces. A key idea in our construction is a reduction scheme which reduces the sparse PCA problem to a high-dimensional multivariate regression problem. This method is potentially also useful for other related problems.

Sparse PCA: Optimal rates and adaptive estimation

Citations

Bridging Convex and Nonconvex Optimization in Robust PCA: Noise, Outliers, and Missing Data

De-biased sparse PCA: Inference and testing for eigenstructure of large covariance matrices

Recovery of simultaneous low rank and two-way sparse coefficient matrices, a nonconvex approach

Efficient Estimation of Linear Functionals of Principal Components

ECA: High Dimensional Elliptical Component Analysis in non-Gaussian Distributions

References

Elements of information theory

Matrix Analysis

An Introduction to Multivariate Statistical Analysis

Introduction to Multivariate Statistical Analysis.

Guaranteed Minimum-Rank Solutions of Linear Matrix Equations via Nuclear Norm Minimization

Related Papers (5)

On Consistency and Sparsity for Principal Components Analysis in High Dimensions

Sparse Principal Component Analysis

Asymptotics of sample eigenstructure for a large dimensional spiked covariance model

On the distribution of the largest eigenvalue in principal components analysis

A penalized matrix decomposition, with applications to sparse principal components and canonical correlation analysis