scispace - formally typeset
Journal ArticleDOI

Inference and missing data

Donald B. Rubin
- 01 Dec 1976 - 
- Vol. 63, Iss: 3, pp 581-592
TLDR
In this article, it was shown that ignoring the process that causes missing data when making sampling distribution inferences about the parameter of the data, θ, is generally appropriate if and only if the missing data are missing at random and the observed data are observed at random, and then such inferences are generally conditional on the observed pattern of missing data.
Abstract
Two results are presented concerning inference when data may be missing. First, ignoring the process that causes missing data when making sampling distribution inferences about the parameter of the data, θ, is generally appropriate if and only if the missing data are “missing at random” and the observed data are “observed at random,” and then such inferences are generally conditional on the observed pattern of missing data. Second, ignoring the process that causes missing data when making Bayesian inferences about θ is generally appropriate if and only if the missing data are missing at random and the parameter of the missing data is “independent” of θ. Examples and discussion indicating the implications of these results are included.

read more

Citations
More filters

Piecing together the past: Statistical insights into paleoclimatic reconstructions

TL;DR: In this paper, the authors consider the problem of reconstructing a climate process through space and time from overlapping instrumental and climate sensitive proxy time series that are assumed to be well dated, an assumption likely only reasonable for certain proxies over at most the last few millennia.
Journal ArticleDOI

The Use of Multiple Correspondence Analysis to Explore Associations between Categories of Qualitative Variables in Healthy Ageing.

TL;DR: MCA provided a powerful tool to explore complex ageing data, covering multiple and diverse variables, showing if a relationship exists and how variables are related, and offering statistical results that can be seen both analytically and visually.
Journal ArticleDOI

Multiple Imputation for Incomplete Data in Epidemiologic Studies

TL;DR: The theoretical underpinnings of multiple imputation are described, and application of this method is illustrated as part of a collaborative challenge to assess the performance of various techniques for dealing with missing data.
Posted Content

Is Greener Whiter Yet? The Sustainable Slopes Program After Five Years

TL;DR: In this paper, the ski industry's Sustainable Slopes Program (SSP) was evaluated in the western United States between 2001 and 2005 and no evidence in a five-year analysis to conclude that ski areas adopting the SSP displayed superior performance levels than non-participants for the following areas of environmental protection: overall environmental performance, expansion management, pollution management, and wildlife and habitat management.
Journal ArticleDOI

External Validity in Policy Evaluations that Choose Sites Purposively.

TL;DR: A conceptual model of purposive site selection is proposed and a formal, yet intuitive, mathematical expression for the bias in the pooled impact estimate when sites are selected purposively is derived.
References
More filters
Book

Bayesian inference in statistical analysis

TL;DR: In this article, the effect of non-normality on inference about a population mean with generalizations was investigated. But the authors focused on the effect on the mean with information from more than one source.
Journal ArticleDOI

Maximum Likelihood Estimates for a Multivariate Normal Distribution when Some Observations are Missing

TL;DR: In this paper, the authors give an approach to derive maximum likelihood estimates of parameters of multivariate normal distributions in cases where some observations are missing (Edgett [2] and Lord [3], [4]).
Journal ArticleDOI

Missing Observations in Multivariate Statistics I. Review of the Literature

TL;DR: In this paper, a review of the literature on the problem of handling multivariate data with observations missing on some or all of the variables under study is presented, where the authors examine the ways that statisticians have devised to estimate means, variances, correlations and linear regression functions.