scispace - formally typeset
Search or ask a question

Showing papers by "Iain M. Johnstone published in 2018"


Journal ArticleDOI
18 Jul 2018
TL;DR: The behavior of the bulk of the sample eigenvalues under weak distributional assumptions on the observations has been described and alternative classes of estimation procedures have been developed by exploiting sparsity of the eigenvectors or the covariance matrix.
Abstract: When the data are high dimensional, widely used multivariate statistical methods such as principal component analysis can behave in unexpected ways. In settings where the dimension of the observations is comparable to the sample size, upward bias in sample eigenvalues and inconsistency of sample eigenvectors are among the most notable phenomena that appear. These phenomena, and the limiting behavior of the rescaled extreme sample eigenvalues, have recently been investigated in detail under the spiked covariance model. The behavior of the bulk of the sample eigenvalues under weak distributional assumptions on the observations has been described. These results have been exploited to develop new estimation and hypothesis testing methods for the population covariance matrix. Furthermore, partly in response to these phenomena, alternative classes of estimation procedures have been developed by exploiting sparsity of the eigenvectors or the covariance matrix. This paper gives an orientation to these areas.

96 citations


Posted Content
TL;DR: In this article, the behavior of outlier sample eigenvalues and eigenvectors of MANOVA variance components estimators in multivariate random and mixed effects linear models under a high-dimensional asymptotic regime was studied.
Abstract: We study principal components analyses in multivariate random and mixed effects linear models, assuming a spherical-plus-spikes structure for the covariance matrix of each random effect. We characterize the behavior of outlier sample eigenvalues and eigenvectors of MANOVA variance components estimators in such models under a high-dimensional asymptotic regime. Our results show that an aliasing phenomenon may occur in high dimensions, in which eigenvalues and eigenvectors of the MANOVA estimate for one variance component may be influenced by the other components. We propose an alternative procedure for estimating the true principal eigenvalues and eigenvectors that asymptotically corrects for this aliasing problem.

11 citations


Journal ArticleDOI
TL;DR: Conditions of quantitative bounds on tail sums of eigenvalues outside the bulk which are established here for applications in high dimensional covariance matrix estimation are established.
Abstract: Consider the classical Gaussian unitary ensemble of size N and the real white Wishart ensemble with N variables and n degrees of freedom. In the limits of large N and n, with positive ratio γ in the Wishart case, the expected number of eigenvalues that exit the upper bulk edge is less than one, approaching 0.031 and 0.170 respectively, the latter number being independent of γ. These statements are consequences of quantitative bounds on tail sums of eigenvalues outside the bulk which are established here for applications in high dimensional covariance matrix estimation.

5 citations


Posted Content
TL;DR: In the spiked covariance model, results on asymptotic normality of sample leading eigenvalues and certain projections of the corresponding sample eigenvectors are developed.
Abstract: These expository notes serve as a reference for an accompanying post Morales-Jimenez et al. [2018]. In the spiked covariance model, we develop results on asymptotic normality of sample leading eigenvalues and certain projections of the corresponding sample eigenvectors. The results parallel those of Paul [2007], but are given using the non-Gaussian model of Bai and Yao [2008]. The results are not new, and citations are given, but proofs are collected and organized as a point of departure for Morales-Jimenez et al. [2018].

4 citations


Posted Content
TL;DR: Numerical studies reveal the availability of fast, highly accurate and scalable methodology for binary mixed model analysis and show that fast and accurate quadrature-free inference can be realized for the probit link case with multivariate random effects and higher levels of nesting.
Abstract: Expectation propagation is a general prescription for approximation of integrals in statistical inference problems. Its literature is mainly concerned with Bayesian inference scenarios. However, expectation propagation can also be used to approximate integrals arising in frequentist statistical inference. We focus on likelihood-based inference for binary response mixed models and show that fast and accurate quadrature-free inference can be realized for the probit link case with multivariate random effects and higher levels of nesting. The approach is supported by asymptotic theory in which expectation propagation is seen to provide consistent estimation of the exact likelihood surface. Numerical studies reveal the availability of fast, highly accurate and scalable methodology for binary mixed model analysis.

2 citations