False Discoveries Occur Early on the Lasso Path

doi:10.1214/16-AOS1521

Open AccessJournal ArticleDOI

False Discoveries Occur Early on the Lasso Path

Weijie J. Su, +2 more

- 01 Oct 2017 -

Annals of Statistics

- Vol. 45, Iss: 5, pp 2133-2150

Chats0

TLDR

It is demonstrated that true features and null features are always interspersed on the Lasso path, and that this phenomenon occurs no matter how strong the effect sizes are.

Abstract:

In regression settings where explanatory variables have very low correlations and there are relatively few effects, each of large magnitude, we expect the Lasso to find the important variables with few errors, if any. This paper shows that in a regime of linear sparsity—meaning that the fraction of variables with a nonvanishing effect tends to a constant, however small—this cannot really be the case, even when the design variables are stochastically independent. We demonstrate that true features and null features are always interspersed on the Lasso path, and that this phenomenon occurs no matter how strong the effect sizes are. We derive a sharp asymptotic trade-off between false and true positive rates or, equivalently, between measures of type I and type II errors along the Lasso path. This trade-off states that if we ever want to achieve a type II error (false negative rate) under a critical value, then anywhere on the Lasso path the type I error (false positive rate) will need to exceed a given threshold so that we can never have both errors at a low level at the same time. Our analysis uses tools from approximate message passing (AMP) theory as well as novel elements to deal with a possibly adaptive selection of the Lasso regularizing parameter.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Sparse regression for plasma physics

Alan A. Kaptanoglu, +3 more

- 01 Mar 2023 -

Physics of Plasmas

TL;DR: In this article , the authors illustrate some of the important ways in which sparse regression appears in plasma physics and point out recent contributions and remaining challenges to solving these problems in this field, and a brief review is provided for the optimization problem and state-of-the-art solvers, especially for constrained and high-dimensional sparse regression.

...read moreread less

Posted Content

DebiNet: Debiasing Linear Models with Nonlinear Overparameterized Neural Networks

Shiyun Xu, +1 more

- 01 Nov 2020 -

arXiv: Machine Learning

TL;DR: This paper incorporates over-parameterized neural networks into semi-parametric models to bridge the gap between inference and prediction, especially in the high dimensional linear problem.

...read moreread less

Journal ArticleDOI

Data-based autonomously discovering method for nonlinear aerodynamic force of quasi-flat plate

Teng Ma, +5 more

- 12 Jan 2023 -

Physics of fluids

TL;DR: In this paper , a group sparse regression method is used to reveal the nonlinear mapping aerodynamics relationship between motion and force from data, and the aeroelastic force function discovered by this method balances modeling accuracy and simplicity.

...read moreread less

Posted Content

The False Positive Control Lasso.

Erik Drysdale, +4 more

- 29 Mar 2019 -

arXiv: Machine Learning

TL;DR: An existing model (the SQRT-Lasso) can be recast as a method of controlling the number of expected false positives, how a similar estimator can be used for all other generalized linear model classes, and this approach can be fit with existing fast Lasso optimization solvers.

...read moreread less

DOI

A unified view of high-dimensional bridge regression

Haolei Weng

TL;DR: A unified view of high-dimensional bridge regression is presented that combines the results obtained in [Bouchut-Boyaval, M3AS (23) 2013] and [M2AS (24) 2013], which show clear trends in both the horizontal and vertical dimensions of the model.

...read moreread less

Citations

Sparse regression for plasma physics

DebiNet: Debiasing Linear Models with Nonlinear Overparameterized Neural Networks

Data-based autonomously discovering method for nonlinear aerodynamic force of quasi-flat plate

The False Positive Control Lasso.

A unified view of high-dimensional bridge regression

References

Regression Shrinkage and Selection via the Lasso

Regularization and variable selection via the elastic net

Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties

Least angle regression

Model selection and estimation in regression with grouped variables

Related Papers (5)

Regression Shrinkage and Selection via the Lasso

Regularization and variable selection via the elastic net

Regularization Paths for Generalized Linear Models via Coordinate Descent

The adaptive lasso and its oracle properties

On Model Selection Consistency of Lasso