scispace - formally typeset
Open AccessJournal ArticleDOI

Penalized Composite Quasi-Likelihood for Ultrahigh-Dimensional Variable Selection

Reads0
Chats0
TLDR
A data‐driven weighted linear combination of convex loss functions, together with weighted L1‐penalty is proposed and established a strong oracle property of the method proposed that has both the model selection consistency and estimation efficiency for the true non‐zero coefficients.
Abstract
In high-dimensional model selection problems, penalized least-square approaches have been extensively used. This paper addresses the question of both robustness and efficiency of penalized model selection methods, and proposes a data-driven weighted linear combination of convex loss functions, together with weighted L1-penalty. It is completely data-adaptive and does not require prior knowledge of the error distribution. The weighted L1-penalty is used both to ensure the convexity of the penalty term and to ameliorate the bias caused by the L1-penalty. In the setting with dimensionality much larger than the sample size, we establish a strong oracle property of the proposed method that possesses both the model selection consistency and estimation efficiency for the true non-zero coefficients. As specific examples, we introduce a robust method of composite L1-L2, and optimal composite quantile method and evaluate their performance in both simulated and real data examples.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal Article

A Selective Overview of Variable Selection in High Dimensional Feature Space.

TL;DR: In this paper, a brief account of the recent developments of theory, methods, and implementations for high-dimensional variable selection is presented, with emphasis on independence screening and two-scale methods.
Journal ArticleDOI

New efficient estimation and variable selection methods for semiparametric varying-coefficient partially linear models

TL;DR: This work proposes adaptive penalization methods for variable selection in the semiparametric varying-coefficient partially linear model and proves that the methods possess the oracle property.
Journal ArticleDOI

Sparse High-Dimensional Models in Economics

TL;DR: This paper reviews the literature on sparse high dimensional models and discusses some applications in economics and finance, including variable selection methods that are proved to be effective in high dimensional sparse modeling.
Journal ArticleDOI

Estimating False Discovery Proportion Under Arbitrary Covariance Dependence

TL;DR: In this article, a principal factor approximation (PFA) based method was proposed to solve the problem of false discovery control in large-scale multiple hypothesis testing, where a common threshold is used and a consistent estimate of realized FDP is provided.
Posted Content

Estimating False Discovery Proportion Under Arbitrary Covariance Dependence

TL;DR: An approximate expression for false discovery proportion (FDP) in large-scale multiple testing when a common threshold is used and a consistent estimate of realized FDP is provided, which has important applications in controlling false discovery rate and FDP.
References
More filters
Journal ArticleDOI

Smoothly Clipped Absolute Deviation on High Dimensions

TL;DR: An efficient optimization algorithm is developed that is fast and always converges to a local minimum and it is proved that the SCAD estimator still has the oracle property on high-dimensional problems.
Journal ArticleDOI

SCAD-penalized regression in high-dimensional partially linear models

TL;DR: In this paper, the authors consider the problem of simultaneous variable selection and estimation in partially linear models with a divergent number of covariates in the linear part, under the assumption that the vector of regression coefficients is sparse.
Journal ArticleDOI

Gene expression variation and expression quantitative trait mapping of human chromosome 21 genes

TL;DR: The results of this study provide a representative view of expression variation of chromosome 21 genes, identify loci involved in their regulation and suggest that genes, for which expression differences are significantly larger than 1.5-fold in control samples, are unlikely to be involved in DS-phenotypes present in all affected individuals.

M-estimation of multivariate linear regression parameters under a convex discrepancy function

TL;DR: In this article, a general theory of M-estimation is developed using a convex discrepancy function under what appear to be a necessary set of assumptions to develop a satisfactory asymptotic theory.
Related Papers (5)