Minimax Rates of Estimation for High-Dimensional Linear Regression Over $\ell_q$ -Balls

doi:10.1109/TIT.2011.2165799

Open AccessJournal ArticleDOI

Minimax Rates of Estimation for High-Dimensional Linear Regression Over $\ell_q$ -Balls

Garvesh Raskutti, +2 more

- 01 Oct 2011 -

IEEE Transactions on Information Theory

- Vol. 57, Iss: 10, pp 6976-6994

Chats0

TLDR

In this paper, the authors studied the minimax rates of convergence for estimating β* in either l 2-loss and l2-prediction loss, assuming that β* belongs to an l q -ball \BBBq(Rq) for some q ∈ [0, 1].

Abstract:

Consider the high-dimensional linear regression model y = X β* + w, where y ∈ \BBRn is an observation vector, X ∈ \BBRn × d is a design matrix with d >; n, β* ∈ \BBRd is an unknown regression vector, and w ~ N(0, σ2I) is additive Gaussian noise. This paper studies the minimax rates of convergence for estimating β* in either l2-loss and l2-prediction loss, assuming that β* belongs to an lq -ball \BBBq(Rq) for some q ∈ [0,1]. It is shown that under suitable regularity conditions on the design matrix X, the minimax optimal rate in l2-loss and l2-prediction loss scales as Θ(Rq ([(logd)/(n)])1-q/2). The analysis in this paper reveals that conditions on the design matrix X enter into the rates for l2-error and l2-prediction error in complementary ways in the upper and lower bounds. Our proofs of the lower bounds are information theoretic in nature, based on Fano's inequality and results on the metric entropy of the balls \BBBq(Rq), whereas our proofs of the upper bounds are constructive, involving direct analysis of least squares over lq-balls. For the special case q=0, corresponding to models with an exact sparsity constraint, our results show that although computationally efficient l1-based methods can achieve the minimax rates up to constant factors, they require slightly stronger assumptions on the design matrix X than optimal algorithms involving least-squares over the l0-ball.

Minimax Rates of Estimation for High-Dimensional Linear Regression Over $\ell_q$ -Balls

Citations

A unified framework for high-dimensional analysis of M-estimators with decomposable regularizers

A Unified Framework for High-Dimensional Analysis of $M$-Estimators with Decomposable Regularizers

High-Dimensional Statistics: A Non-Asymptotic Viewpoint

Restricted Eigenvalue Properties for Correlated Gaussian Designs

Noisy matrix decomposition via convex relaxation: Optimal rates in high dimensions

References

Elements of information theory

Regression Shrinkage and Selection via the Lasso

Atomic Decomposition by Basis Pursuit

High-dimensional graphs and variable selection with the Lasso

The Dantzig selector: Statistical estimation when p is much larger than n

Related Papers (5)

Regression Shrinkage and Selection via the Lasso

The Dantzig selector: Statistical estimation when p is much larger than n

Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties

On Model Selection Consistency of Lasso

Decoding by linear programming