Sparse PCA: Optimal rates and adaptive estimation

doi:10.1214/13-AOS1178

Open AccessJournal ArticleDOI

Sparse PCA: Optimal rates and adaptive estimation

T. Tony Cai, +2 more

- 01 Dec 2013 -

Annals of Statistics

- Vol. 41, Iss: 6, pp 3074-3110

Chats0

TLDR

In this paper, the authors considered both minimax and adaptive estimation of the principal subspace in the high dimensional setting and established the optimal rates of convergence for estimating the subspace which are sharp with respect to all the parameters, thus providing a complete characterization of the difficulty of the estimation problem in terms of the convergence rate.

Abstract:

Principal component analysis (PCA) is one of the most commonly used statistical procedures with a wide range of applications. This paper considers both minimax and adaptive estimation of the principal subspace in the high dimensional setting. Under mild technical conditions, we first establish the optimal rates of convergence for estimating the principal subspace which are sharp with respect to all the parameters, thus providing a complete characterization of the difficulty of the estimation problem in term of the convergence rate. The lower bound is obtained by calculating the local metric entropy and an application of Fano’s lemma. The rate optimal estimator is constructed using aggregation, which, however, might not be computationally feasible. We then introduce an adaptive procedure for estimating the principal subspace which is fully data driven and can be computed efficiently. It is shown that the estimator attains the optimal rates of convergence simultaneously over a large collection of the parameter spaces. A key idea in our construction is a reduction scheme which reduces the sparse PCA problem to a high-dimensional multivariate regression problem. This method is potentially also useful for other related problems.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

A useful variant of the Davis--Kahan theorem for statisticians

Yi Yu, +2 more

- 01 Jun 2015 -

Biometrika

TL;DR: In this paper, the authors present a variant of the Davis-Kahan theorem that relies only on a population eigenvalue separation condition, making it more natural and convenient for direct application in statistical contexts, and provide an improvement in many cases to the usual bound.

...read moreread less

Posted Content

Fast low-rank estimation by projected gradient descent: General statistical and algorithmic guarantees

Yudong Chen, +1 more

- 10 Sep 2015 -

arXiv: Statistics Theory

TL;DR: This work provides a simple set of conditions under which projected gradient descent, when given a suitable initialization, converges geometrically to a statistically useful solution to the factorized optimization problem with rank constraints.

...read moreread less

Journal Article

Truncated power method for sparse eigenvalue problems

Xiao-Tong Yuan, +1 more

- 01 Jan 2013 -

Journal of Machine Learning Research

TL;DR: In this paper, the authors proposed a truncated power method that can approximately solve the underlying nonconvex optimization problem of sparse eigenvalue problem, which is to extract dominant (largest) sparse Eigenvectors with at most k non-zero components.

...read moreread less

Journal ArticleDOI

An overview of the estimation of large covariance and precision matrices

Jianqing Fan, +2 more

- 01 Feb 2016 -

Econometrics Journal

TL;DR: In this article, the authors provide a selective review of several recent developments on the estimation of large covariance and precision matrices, focusing on two general approaches: a rank-based method and a factor-model based method.

...read moreread less

Proceedings Article

Complexity Theoretic Lower Bounds for Sparse Principal Component Detection

Quentin Berthet, +1 more

TL;DR: The performance of a test is measured by the smallest signal strength that it can detect and a computationally efficient method based on semidefinite programming is proposed and it is proved that the statistical performance of this test cannot be strictly improved by any computationallyefficient method.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Approximation dans les espaces métriques et théorie de l'estimation

Lucien Birgé

- 01 Dec 1983 -

Probability Theory and Related Fields

TL;DR: In this paper, the authors investigated the relation between the speed of estimation and the metric structure of the parameter space Θ, especially in the case when its metric dimension is infinite and gave a construction for some sort of universal estimates the risk of which is bounded by C 2 r q(n) in all cases where the preceding theory applies.

...read moreread less

Journal ArticleDOI

Sparse principal component analysis and iterative thresholding

Zongming Ma

- 12 Dec 2011 -

arXiv: Statistics Theory

TL;DR: Under a spiked covariance model, a new iterative thresholding approach for estimating principal subspaces in the setting where the leading eigenvectors are sparse is proposed and it is found that the new approach recovers the principal subspace and leading eignevectors consistently, and even optimally, in a range of high-dimensional sparse settings.

...read moreread less

Journal ArticleDOI

High-dimensional analysis of semidefinite relaxations for sparse principal components

Arash A. Amini, +1 more

- 01 Oct 2009 -

Annals of Statistics

TL;DR: In this paper, the authors consider a spiked covariance model in which a base matrix is perturbed by adding a k-sparse maximal eigenvector, and analyze two computationally tractable methods for recovering the support set of this maximal eigvector, as follows: (a) a simple diagonal thresholding method, which transitions from success to failure as a function of the rescaled sample size θdia(n, p, k)=n/[k2log(p−k)]; and (b) a more sophisticated semidefinite programming

...read moreread less

Journal Article

An Introduction to Multivariate Statistical Analysis (3rd ed.) (Book)

Christine M. Anderson-Cook

- 01 Jan 2004 -

Journal of the American Statistical Asso...

Journal ArticleDOI

Finite sample approximation results for principal component analysis: a matrix perturbation approach

Boaz Nadler

- 21 Jan 2009 -

arXiv: Statistics Theory

TL;DR: A matrix perturbation view of the "phase transition phenomenon," and a simple linear-algebra based derivation of the eigenvalue and eigenvector overlap in this asymptotic limit of finite sample PCA are presented.

...read moreread less