Transposable regularized covariance models with an application to missing data imputation

doi:10.1214/09-AOAS314

Open AccessJournal ArticleDOI

Transposable regularized covariance models with an application to missing data imputation

Genevera I. Allen, +1 more

- 18 Jun 2009 -

arXiv: Applications

TLDR

Simulations and results on microarray data and the Netflix data show that these imputation techniques often outperform existing methods and offer a greater degree of flexibility.

Abstract:

Missing data estimation is an important challenge with high-dimensional data arranged in the form of a matrix. Typically this data matrix is transposable, meaning that either the rows, columns or both can be treated as features. To model transposable data, we present a modification of the matrix-variate normal, the mean-restricted matrix-variate normal, in which the rows and columns each have a separate mean vector and covariance matrix. By placing additive penalties on the inverse covariance matrices of the rows and columns, these so-called transposable regularized covariance models allow for maximum likelihood estimation of the mean and nonsingular covariance matrices. Using these models, we formulate EM-type algorithms for missing data imputation in both the multivariate and transposable frameworks. We present theoretical results exploiting the structure of our transposable models that allow these models and imputation methods to be applied to high-dimensional data. Simulations and results on microarray data and the Netflix data show that these imputation techniques often outperform existing methods and offer a greater degree of flexibility.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Regression shrinkage and selection via the lasso: a retrospective

Robert Tibshirani

- 01 Jun 2011 -

Journal of The Royal Statistical Society...

TL;DR: In this article, the authors give a brief review of the basic idea and some history and then discuss some developments since the original paper on regression shrinkage and selection via the lasso.

...read moreread less

Journal ArticleDOI

Geodesic Convexity and Covariance Estimation

Ami Wiesel

- 01 Dec 2012 -

IEEE Transactions on Signal Processing

TL;DR: This work considers g-convex functions with positive definite matrix variables, and proves that Kronecker products, and logarithms of determinants are g- Convex, and applies these results to two modern covariance estimation problems: robust estimation in scaled Gaussian distributions, and Kroneker structured models.

...read moreread less

Journal ArticleDOI

A Generalized Least-Square Matrix Decomposition

Genevera I. Allen, +2 more

- 19 Mar 2014 -

Journal of the American Statistical Asso...

TL;DR: By finding the best low-rank approximation of the data with respect to a transposable quadratic norm, the generalized least-square matrix decomposition (GMD), directly accounts for structural relationships and is demonstrated for dimension reduction, signal recovery, and feature selection with high-dimensional structured data.

...read moreread less

Journal ArticleDOI

Covariance Estimation in High Dimensions Via Kronecker Product Expansions

Theodoros Tsiligkaridis, +1 more

- 01 Nov 2013 -

IEEE Transactions on Signal Processing

TL;DR: The results establish that PRLS has significantly faster convergence than the standard sample covariance matrix (SCM) estimator, and show that a class of block Toeplitz covariance matrices is approximatable by low separation rank and give bounds on the minimal separation rank r that ensures a given level of bias.

...read moreread less

Journal ArticleDOI

Sparse Matrix Graphical Models

Chenlei Leng, +1 more

- 08 Oct 2012 -

Journal of the American Statistical Asso...

TL;DR: This article proposes a novel sparse matrix graphical model that synthetically characterizes the underlying conditional independence structure of the sparse vector-variate graphical model by penalizing, respectively, two precision matrices corresponding to the rows and columns.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Book

Statistical Analysis with Missing Data

Roderick J. A. Little, +1 more

TL;DR: This work states that maximum Likelihood for General Patterns of Missing Data: Introduction and Theory with Ignorable Nonresponse and large-Sample Inference Based on Maximum Likelihood Estimates is likely to be high.

...read moreread less

Journal ArticleDOI

Statistical Analysis with Missing Data

Larry E. Richards, +2 more

- 01 Aug 1989 -

Journal of Marketing Research

Journal ArticleDOI

Exact Matrix Completion via Convex Optimization

Emmanuel J. Candès, +1 more

- 01 Dec 2009 -

Foundations of Computational Mathematics

TL;DR: It is proved that one can perfectly recover most low-rank matrices from what appears to be an incomplete set of entries, and that objects other than signals and images can be perfectly reconstructed from very limited information.

...read moreread less

Journal ArticleDOI

Missing value estimation methods for DNA microarrays.

Olga G. Troyanskaya, +7 more

- 01 Jun 2001 -

Bioinformatics

TL;DR: It is shown that KNNimpute appears to provide a more robust and sensitive method for missing value estimation than SVDimpute, and both SVD Impute and KNN Impute surpass the commonly used row average method (as well as filling missing values with zeros).

...read moreread less

Journal ArticleDOI

Multiple Imputation After 18+ Years

Donald B. Rubin

- 01 Jun 1996 -

Journal of the American Statistical Asso...

TL;DR: A description of the assumed context and objectives of multiple imputation is provided, and a review of the multiple imputations framework and its standard results are reviewed.

...read moreread less