scispace - formally typeset
Journal ArticleDOI

Inference and missing data

Donald B. Rubin
- 01 Dec 1976 - 
- Vol. 63, Iss: 3, pp 581-592
TLDR
In this article, it was shown that ignoring the process that causes missing data when making sampling distribution inferences about the parameter of the data, θ, is generally appropriate if and only if the missing data are missing at random and the observed data are observed at random, and then such inferences are generally conditional on the observed pattern of missing data.
Abstract
Two results are presented concerning inference when data may be missing. First, ignoring the process that causes missing data when making sampling distribution inferences about the parameter of the data, θ, is generally appropriate if and only if the missing data are “missing at random” and the observed data are “observed at random,” and then such inferences are generally conditional on the observed pattern of missing data. Second, ignoring the process that causes missing data when making Bayesian inferences about θ is generally appropriate if and only if the missing data are missing at random and the parameter of the missing data is “independent” of θ. Examples and discussion indicating the implications of these results are included.

read more

Citations
More filters
Journal ArticleDOI

Statistical data preparation: management of missing values and outliers.

TL;DR: The types of missing values, ways of identifying outliers, and dealing with the two are discussed, which affect the process of estimating statistics, resulting in overestimated or underestimated values.
Journal ArticleDOI

Umgang mit fehlenden Werten in der psychologischen Forschung : Probleme und Lösungen

TL;DR: In this paper, a Ubersicht der in der Literatur diskutierten Ansatze zum Umgang mit fehlenden Werten vorgenommen, wobei drei Typen von Verfahren unterschieden werden.
Journal ArticleDOI

Identification and prospective validation of clinically relevant chronic obstructive pulmonary disease (COPD) subtypes

TL;DR: In patients with COPD recruited at their first hospitalisation, three different COPD subtypes were identified and prospectively validated: ‘severe respiratory COPD’, ‘moderate respiratory COPd’ and ‘systemic COPD'.
Journal ArticleDOI

Computational Strategies for Multivariate Linear Mixed-Effects Models With Missing Values

TL;DR: In this paper, the authors present new computational techniques for multivariate longitudinal or clustered data with missing values by applying a multivariate extension of a popular linear mixed-effects model, creating multiple imputations of missing values for subsequent analyses by a straightforward and effective Markov chain Monte Carlo procedure.
Journal ArticleDOI

Investment in Energy Efficiency: Do the Characteristics of Firms Matter?

TL;DR: In this article, a discrete choice regression is estimated over a large sample of participating and non-participating firms to examine whether firms' characteristics influence their decision to join the Environmental Protection Agency's voluntary Green Lights program.
References
More filters
Book

Bayesian inference in statistical analysis

TL;DR: In this article, the effect of non-normality on inference about a population mean with generalizations was investigated. But the authors focused on the effect on the mean with information from more than one source.
Journal ArticleDOI

Maximum Likelihood Estimates for a Multivariate Normal Distribution when Some Observations are Missing

TL;DR: In this paper, the authors give an approach to derive maximum likelihood estimates of parameters of multivariate normal distributions in cases where some observations are missing (Edgett [2] and Lord [3], [4]).
Journal ArticleDOI

Missing Observations in Multivariate Statistics I. Review of the Literature

TL;DR: In this paper, a review of the literature on the problem of handling multivariate data with observations missing on some or all of the variables under study is presented, where the authors examine the ways that statisticians have devised to estimate means, variances, correlations and linear regression functions.