Variable Selection With the Strong Heredity Constraint and Its Oracle Property

doi:10.1198/JASA.2010.TM08281

Journal ArticleDOI

Variable Selection With the Strong Heredity Constraint and Its Oracle Property

Nam Hee Choi, +2 more

- 01 Mar 2010 -

Journal of the American Statistical Asso...

- Vol. 105, Iss: 489, pp 354-364

Chats0

TLDR

Numerical results indicate that the LASSO method tends to remove irrelevant variables more effectively and provide better prediction performance than previous work and automatically enforces the heredity constraint.

Abstract:

In this paper, we extend the LASSO method (Tibshirani 1996) for simultaneously fitting a regression model and identifying important interaction terms. Unlike most of the existing variable selection methods, our method automatically enforces the heredity constraint, that is, an interaction term can be included in the model only if the corresponding main terms are also included in the model. Furthermore, we extend our method to generalized linear models, and show that it performs as well as if the true model were given in advance, that is, the oracle property as in Fan and Li (2001) and Fan and Peng (2004). The proof of the oracle property is given in online supplemental materials. Numerical results on both simulation data and real data indicate that our method tends to remove irrelevant variables more effectively and provide better prediction performance than previous work (Yuan, Joseph, and Lin 2007 and Zhao, Rocha, and Yu 2009 as well as the classical LASSO method).

Citations

PDF

Open Access

More filters

SLEP: Sparse Learning with Efficient Projections

Jun Liu, +2 more

Journal ArticleDOI

A lasso for hierarchical interactions

Jacob Bien, +2 more

- 22 May 2012 -

arXiv: Methodology

TL;DR: A precise characterization of the effect of this hierarchy constraint is given, a bound on this estimate reveals the amount of fitting "saved" by the hierarchy constraint, and it is proved that hierarchy holds with probability one.

...read moreread less

Journal ArticleDOI

Variable Selection Using Adaptive Nonlinear Interaction Structures in High Dimensions

Peter Radchenko, +1 more

- 01 Dec 2010 -

Journal of the American Statistical Asso...

TL;DR: This work introduces a new approach, “Variable selection using Adaptive Nonlinear Interaction Structures in High dimensions” (VANISH), that is based on a penalized least squares criterion and is designed for high dimensional nonlinear problems and suggests that VANISH should outperform certain natural competitors when the true interaction structure is sufficiently sparse.

...read moreread less

Journal ArticleDOI

Structured variable selection and estimation

Ming Yuan, +2 more

- 01 Dec 2009 -

The Annals of Applied Statistics

TL;DR: This paper proposes non-negative garrote methods that can naturally incorporate such relationships defined through effect heredity principles or marginality principles, and shows that the methods are very easy to compute and enjoy nice theoretical properties.

...read moreread less

Journal ArticleDOI

Interaction Screening for Ultrahigh-Dimensional Data

Ning Hao, +1 more

- 18 Feb 2014 -

Journal of the American Statistical Asso...

TL;DR: Theoretically, the iFOR algorithms prove that they possess sure screening property for ultrahigh-dimensional settings, and are proposed to tackle forward-selection-based procedures called iFOR, which identify interaction effects in a greedy forward fashion while maintaining the natural hierarchical model structure.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Regression Shrinkage and Selection via the Lasso

Robert Tibshirani

- 01 Jan 1996 -

Journal of the royal statistical society...

TL;DR: A new method for estimation in linear models called the lasso, which minimizes the residual sum of squares subject to the sum of the absolute value of the coefficients being less than a constant, is proposed.

...read moreread less

Book

Generalized Linear Models

Peter McCullagh, +1 more

TL;DR: In this paper, a generalization of the analysis of variance is given for these models using log- likelihoods, illustrated by examples relating to four distributions; the Normal, Binomial (probit analysis, etc.), Poisson (contingency tables), and gamma (variance components).

...read moreread less

Journal ArticleDOI

Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties

Jianqing Fan, +1 more

- 01 Dec 2001 -

Journal of the American Statistical Asso...

TL;DR: In this article, penalized likelihood approaches are proposed to handle variable selection problems, and it is shown that the newly proposed estimators perform as well as the oracle procedure in variable selection; namely, they work as well if the correct submodel were known.

...read moreread less

Journal ArticleDOI

Least angle regression

Bradley Efron, +19 more

- 01 Apr 2004 -

Annals of Statistics

TL;DR: A publicly available algorithm that requires only the same order of magnitude of computational effort as ordinary least squares applied to the full set of covariates is described.

...read moreread less

Journal ArticleDOI

The adaptive lasso and its oracle properties

Hui Zou

- 01 Dec 2006 -

Journal of the American Statistical Asso...

TL;DR: A new version of the lasso is proposed, called the adaptive lasso, where adaptive weights are used for penalizing different coefficients in the ℓ1 penalty, and the nonnegative garotte is shown to be consistent for variable selection.

...read moreread less