Tuning parameter selection in high dimensional penalized likelihood

doi:10.1111/RSSB.12001

Open AccessJournal ArticleDOI

Tuning parameter selection in high dimensional penalized likelihood

Yingying Fan, +1 more

- 01 Jun 2013 -

Journal of The Royal Statistical Society...

- Vol. 75, Iss: 3, pp 531-552

Chats0

TLDR

In this article, the authors proposed to select the tuning parameter by optimizing the generalized information criterion with an appropriate model complexity penalty, which diverges at the rate of some power of ǫ(p) depending on the tail probability behavior of the response variables.

Abstract:

Summary Determining how to select the tuning parameter appropriately is essential in penalized likelihood methods for high dimensional data analysis. We examine this problem in the setting of penalized likelihood methods for generalized linear models, where the dimensionality of covariates p is allowed to increase exponentially with the sample size n. We propose to select the tuning parameter by optimizing the generalized information criterion with an appropriate model complexity penalty. To ensure that we consistently identify the true model, a range for the model complexity penalty is identified in the generlized information criterion. We find that this model complexity penalty should diverge at the rate of some power of log (p) depending on the tail probability behaviour of the response variables. This reveals that using the Akaike information criterion or Bayes information criterion to select the tuning parameter may not be adequate for consistently identifying the true model. On the basis of our theoretical study, we propose a uniform choice of the model complexity penalty and show that the approach proposed consistently identifies the true model among candidate models with asymptotic probability 1. We justify the performance of the procedure proposed by numerical simulations and a gene expression data analysis.

Tuning parameter selection in high dimensional penalized likelihood

Citations

Cross-Fitted Residual Regression for High-Dimensional Heteroscedasticity Pursuit

A comparative machine learning approach for entropy-based damage detection using output-only correlation signal

Logical and test consistency in pairwise multiple comparisons

Sequential Scaled Sparse Factor Regression

Latent Class Models: Design and Diagnosis

References

Regression Shrinkage and Selection via the Lasso

Estimating the Dimension of a Model

Generalized Linear Models

The Elements of Statistical Learning: Data Mining, Inference, and Prediction

Information Theory and an Extension of the Maximum Likelihood Principle

Related Papers (5)

Regression Shrinkage and Selection via the Lasso

Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties

The adaptive lasso and its oracle properties

Estimating the Dimension of a Model

Regularization and variable selection via the elastic net