Confidence intervals for low dimensional parameters in high dimensional linear models

doi:10.1111/RSSB.12026

Open AccessJournal ArticleDOI

Confidence intervals for low dimensional parameters in high dimensional linear models

Cun-Hui Zhang, +1 more

- 01 Jan 2014 -

Journal of The Royal Statistical Society...

- Vol. 76, Iss: 1, pp 217-242

TLDR

In this article, the authors proposed a method to construct confidence intervals for individual coefficients and linear combinations of several of them in a linear regression model by turning the regression data into an approximate Gaussian sequence of point estimators of individual regression coefficients.

Abstract:

Summary The purpose of this paper is to propose methodologies for statistical inference of low dimensional parameters with high dimensional data. We focus on constructing confidence intervals for individual coefficients and linear combinations of several of them in a linear regression model, although our ideas are applicable in a much broader context. The theoretical results that are presented provide sufficient conditions for the asymptotic normality of the proposed estimators along with a consistent estimator for their finite dimensional covariance matrices. These sufficient conditions allow the number of variables to exceed the sample size and the presence of many small non-zero coefficients. Our methods and theory apply to interval estimation of a preconceived regression coefficient or contrast as well as simultaneous interval estimation of many regression coefficients. Moreover, the method proposed turns the regression data into an approximate Gaussian sequence of point estimators of individual regression coefficients, which can be used to select variables after proper thresholding. The simulation results that are presented demonstrate the accuracy of the coverage probability of the confidence intervals proposed as well as other desirable properties, strongly supporting the theoretical results.

Citations

PDF

Open Access

More filters

ReportDOI

Double/debiased machine learning for treatment and structural parameters

Victor Chernozhukov, +6 more

- 02 Jun 2017 -

Econometrics Journal

TL;DR: In this article, the authors show that the impact of regularization bias and overfitting on estimation of the parameter of interest θ0 can be removed by using two simple, yet critical, ingredients: (1) using Neyman-orthogonal moments/scores that have reduced sensitivity with respect to nuisance parameters, and (2) making use of cross-fitting, which provides an efficient form of data-splitting.

...read moreread less

Journal ArticleDOI

On asymptotically optimal confidence regions and tests for high-dimensional models

Sara van de Geer, +3 more

- 03 Mar 2013 -

arXiv: Statistics Theory

TL;DR: A general method for constructing confidence intervals and statistical tests for single or low-dimensional components of a large parameter vector in a high-dimensional model and develops the corresponding theory which includes a careful analysis for Gaussian, sub-Gaussian and bounded correlated designs.

...read moreread less

Journal Article

Confidence intervals and hypothesis testing for high-dimensional regression

Adel Javanmard, +1 more

- 01 Jan 2014 -

Journal of Machine Learning Research

TL;DR: In this paper, a de-biased version of regularized M-estimators is proposed to construct confidence intervals and p-values for high-dimensional linear regression models, and the resulting confidence intervals have nearly optimal size.

...read moreread less

BookDOI

Simultaneous Statistical Inference

Thorsten Dickhaus

TL;DR: A variety of classical and modern type I and type II error rates in multiple hypotheses testing are defined, some relationships between them are analyzed, and different ways to cope with structured systems of hypotheses are considered.

...read moreread less

Journal ArticleDOI

On asymptotically optimal confidence regions and tests for high-dimensional models

Sara van de Geer, +3 more

- 01 Jun 2014 -

Annals of Statistics

TL;DR: In this paper, a general method for constructing confidence intervals and statistical tests for single or low-dimensional components of a large parameter vector in a high-dimensional model is proposed, which can be easily adjusted for multiplicity taking dependence among tests into account.

...read moreread less