False Discoveries Occur Early on the Lasso Path

doi:10.1214/16-AOS1521

Open AccessJournal ArticleDOI

False Discoveries Occur Early on the Lasso Path

Weijie J. Su, +2 more

- 01 Oct 2017 -

Annals of Statistics

- Vol. 45, Iss: 5, pp 2133-2150

Chats0

TLDR

It is demonstrated that true features and null features are always interspersed on the Lasso path, and that this phenomenon occurs no matter how strong the effect sizes are.

Abstract:

In regression settings where explanatory variables have very low correlations and there are relatively few effects, each of large magnitude, we expect the Lasso to find the important variables with few errors, if any. This paper shows that in a regime of linear sparsity—meaning that the fraction of variables with a nonvanishing effect tends to a constant, however small—this cannot really be the case, even when the design variables are stochastically independent. We demonstrate that true features and null features are always interspersed on the Lasso path, and that this phenomenon occurs no matter how strong the effect sizes are. We derive a sharp asymptotic trade-off between false and true positive rates or, equivalently, between measures of type I and type II errors along the Lasso path. This trade-off states that if we ever want to achieve a type II error (false negative rate) under a critical value, then anywhere on the Lasso path the type I error (false positive rate) will need to exceed a given threshold so that we can never have both errors at a low level at the same time. Our analysis uses tools from approximate message passing (AMP) theory as well as novel elements to deal with a possibly adaptive selection of the Lasso regularizing parameter.

Citations

PDF

Open Access

More filters

Posted Content

Subset Selection with Shrinkage: Sparse Linear Modeling when the SNR is low

Rahul Mazumder, +2 more

- 10 Aug 2017 -

arXiv: Methodology

TL;DR: This work proposes a close cousin of best-subsets, namely, its $\ell_{q}$-regularized version, forq, which mitigates, to a large extent, the poor predictive performance ofbest-subset selection procedure in the low SNR regimes and performs favorably and generally delivers a substantially sparser model when compared to the best predictive models available via ridge regression and the Lasso.

...read moreread less

Journal ArticleDOI

Discovery of Physics From Data: Universal Laws and Discrepancies.

Brian M. de Silva, +3 more

TL;DR: It is shown that measurement noise and complex secondary physical mechanisms, like unsteady fluid drag forces, can obscure the underlying law of gravitation, leading to an erroneous model.

...read moreread less

Posted Content

Approximate Message Passing algorithms for rotationally invariant matrices.

Zhou Fan, +1 more

- 27 Aug 2020 -

arXiv: Probability

TL;DR: It is shown that this Bayes-AMP algorithm for Principal Components Analysis, when there is prior structure for the principal components (PCs) and possibly non-white noise, provably achieves higher estimation accuracy than the sample PCs.

...read moreread less

Journal ArticleDOI

Familywise error rate control via knockoffs

Lucas Janson, +1 more

- 01 Jan 2016 -

Electronic Journal of Statistics

TL;DR: In this article, the authors present a novel method for controlling the $k$-familywise error rate in the linear regression setting using the knockoffs framework first introduced by Barber and Candes, which can be applied with any design matrix with at least as many observations as variables and does not require knowing the noise variance.

...read moreread less

Journal ArticleDOI

Model selection for hybrid dynamical systems via sparse regression

Niall M. Mangan, +4 more

- 09 Aug 2018 -

arXiv: Dynamical Systems

TL;DR: A new methodology is developed, Hybrid-Sparse Identification of Nonlinear Dynamics, which identifies separate nonlinear dynamical regimes, employs information theory to manage uncertainty and characterizes switching behaviour.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Regression Shrinkage and Selection via the Lasso

Robert Tibshirani

- 01 Jan 1996 -

Journal of the royal statistical society...

TL;DR: A new method for estimation in linear models called the lasso, which minimizes the residual sum of squares subject to the sum of the absolute value of the coefficients being less than a constant, is proposed.

...read moreread less

Journal ArticleDOI

Regularization and variable selection via the elastic net

Hui Zou, +1 more

- 01 Apr 2005 -

Journal of The Royal Statistical Society...

TL;DR: It is shown that the elastic net often outperforms the lasso, while enjoying a similar sparsity of representation, and an algorithm called LARS‐EN is proposed for computing elastic net regularization paths efficiently, much like algorithm LARS does for the lamba.

...read moreread less

Journal ArticleDOI

Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties

Jianqing Fan, +1 more

- 01 Dec 2001 -

Journal of the American Statistical Asso...

TL;DR: In this article, penalized likelihood approaches are proposed to handle variable selection problems, and it is shown that the newly proposed estimators perform as well as the oracle procedure in variable selection; namely, they work as well if the correct submodel were known.

...read moreread less

Journal ArticleDOI

Least angle regression

Bradley Efron, +19 more

- 01 Apr 2004 -

Annals of Statistics

TL;DR: A publicly available algorithm that requires only the same order of magnitude of computational effort as ordinary least squares applied to the full set of covariates is described.

...read moreread less

Journal ArticleDOI

Model selection and estimation in regression with grouped variables

Ming Yuan, +1 more

- 01 Feb 2006 -

Journal of The Royal Statistical Society...

TL;DR: In this paper, instead of selecting factors by stepwise backward elimination, the authors focus on the accuracy of estimation and consider extensions of the lasso, the LARS algorithm and the non-negative garrotte for factor selection.

...read moreread less

Collapse

False Discoveries Occur Early on the Lasso Path

Citations

Subset Selection with Shrinkage: Sparse Linear Modeling when the SNR is low

Discovery of Physics From Data: Universal Laws and Discrepancies.

Approximate Message Passing algorithms for rotationally invariant matrices.

Familywise error rate control via knockoffs

Model selection for hybrid dynamical systems via sparse regression

References

Regression Shrinkage and Selection via the Lasso

Regularization and variable selection via the elastic net

Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties

Least angle regression

Model selection and estimation in regression with grouped variables

Related Papers (5)

Regression Shrinkage and Selection via the Lasso

Regularization and variable selection via the elastic net

Regularization Paths for Generalized Linear Models via Coordinate Descent

The adaptive lasso and its oracle properties

On Model Selection Consistency of Lasso