scispace - formally typeset
Search or ask a question

Showing papers by "Donald B. Rubin published in 1989"



Journal ArticleDOI
TL;DR: This work focuses on the development of Imputation Models for Social Security Benefit Reconciliation in the context of a Finite Population and examines the role of Bayesian and Randomization--Based Inferences in these models.
Abstract: Tables and Figures. Glossary. 1. Introduction. 1.1 Overview. 1.2 Examples of Surveys with Nonresponse. 1.3 Properly Handling Nonresponse. 1.4 Single Imputation. 1.5 Multiple Imputation. 1.6 Numerical Example Using Multiple Imputation. 1.7 Guidance for the Reader. 2. Statistical Background. 2.1 Introduction. 2.2 Variables in the Finite Population. 2.3 Probability Distributions and Related Calculations. 2.4 Probability Specifications for Indicator Variables. 2.5 Probability Specifications for (X,Y). 2.6 Bayesian Inference for a Population Quality. 2.7 Interval Estimation. 2.8 Bayesian Procedures for Constructing Interval Estimates, Including Significance Levels and Point Estimates. 2.9 Evaluating the Performance of Procedures. 2.10 Similarity of Bayesian and Randomization--Based Inferences in Many Practical Cases. 3. Underlying Bayesian Theory. 3.1 Introduction and Summary of Repeated--Imputation Inferences. 3.2 Key Results for Analysis When the Multiple Imputations are Repeated Draws from the Posterior Distribution of the Missing Values. 3.3 Inference for Scalar Estimands from a Modest Number of Repeated Completed--Data Means and Variances. 3.4 Significance Levels for Multicomponent Estimands from a Modest Number of Repeated Completed--Data Means and Variance--Covariance Matrices. 3.5 Significance Levels from Repeated Completed--Data Significance Levels. 3.6 Relating the Completed--Data and Completed--Data Posterior Distributions When the Sampling Mechanism is Ignorable. 4. Randomization--Based Evaluations. 4.1 Introduction. 4.2 General Conditions for the Randomization--Validity of Infinite--m Repeated--Imputation Inferences. 4.3Examples of Proper and Improper Imputation Methods in a Simple Case with Ignorable Nonresponse. 4.4 Further Discussion of Proper Imputation Methods. 4.5 The Asymptotic Distibution of (Qm,Um,Bm) for Proper Imputation Methods. 4.6 Evaluations of Finite--m Inferences with Scalar Estimands. 4.7 Evaluation of Significance Levels from the Moment--Based Statistics Dm and Dm with Multicomponent Estimands. 4.8 Evaluation of Significance Levels Based on Repeated Significance Levels. 5. Procedures with Ignorable Nonresponse. 5.1 Introduction. 5.2 Creating Imputed Values under an Explicit Model. 5.3 Some Explicit Imputation Models with Univariate YI and Covariates. 5.4 Monotone Patterns of Missingness in Multivariate YI. 5.5 Missing Social Security Benefits in the Current Population Survey. 5.6 Beyond Monotone Missingness. 6. Procedures with Nonignorable Nonresponse. 6.1 Introduction. 6.2 Nonignorable Nonresponse with Univariate YI and No XI. 6.3 Formal Tasks with Nonignorable Nonresponse. 6.4 Illustrating Mixture Modeling Using Educational Testing Service Data. 6.5 Illustrating Selection Modeling Using CPS Data. 6.6 Extensions to Surveys with Follow--Ups. 6.7 Follow--Up Response in a Survey of Drinking Behavior Among Men of Retirement Age. References. Author Index. Subject Index. Appendix I. Report Written for the Social Security Administration in 1977. Appendix II. Report Written for the Census Bureau in 1983.

5,436 citations


Journal ArticleDOI
TL;DR: In this paper, the authors demonstrate how nonresponse in sample surveys and censuses can be handled by replacing each missing value with two or more multiple imputations, and demonstrate the benefit of this statistical technique for researchers who must analyze them.
Abstract: Demonstrates how nonresponse in sample surveys and censuses can be handled by replacing each missing value with two or more multiple imputations. Clearly illustrates the advantages of modern computing to such handle surveys, and demonstrates the benefit of this statistical technique for researchers who must analyze them. Also presents the background for Bayesian and frequentist theory. After establishing that only standard complete-data methods are needed to analyze a multiply-imputed set, the text evaluates procedures in general circumstances, outlining specific procedures for creating imputations in both the ignorable and nonignorable cases. Examples and exercises reinforce ideas, and the interplay of Bayesian and frequentist ideas presents a unified picture of modern statistics.

2,078 citations


Journal ArticleDOI
TL;DR: The EM algorithm, a convenient method for computing maximum likelihood estimates in missing-data problems, is described and applied to two common models, the multivariate normal model for continuous data and the multinomial model for discrete data.
Abstract: Methods for handling missing data in social science data sets are reviewed. Limitations of common practical approaches, including complete-case analysis, available-case analysis and imputation, are illustrated on a simple missing-data problem with one complete and one incomplete variable. Two more principled approaches, namely maximum likelihood under a model for the data and missing-data mechanism and multiple imputation, are applied to the bivariate problem. General properties of these methods are outlined, and applications to more complex missing-data problems are discussed. The EM algorithm, a convenient method for computing maximum likelihood estimates in missing-data problems, is described and applied to two common models, the multivariate normal model for continuous data and the multinomial model for discrete data. Multiple imputation under explicit or implicit models is recommended as a method that retains the advantages of imputation and overcomes its limitations.

994 citations


Journal ArticleDOI
TL;DR: Les auteurs imaginent un indicateur, aisement interpretable, de la taille et de l'importance des effets d'une experimentation sur un echantillon unique.
Abstract: Les auteurs imaginent un indicateur, aisement interpretable, de la taille et de l'importance des effets d'une experimentation sur un echantillon unique. Ils le font suivant un modele mathematique

86 citations


Book ChapterDOI
01 Jan 1989
TL;DR: In this paper, the authors describe the applications of multilevel models of educational data and present several examples of multi-level analyses of data from the Educational Testing Service (ETS).
Abstract: Publisher Summary This chapter describes the applications of multilevel models of educational data. Multilevel models are powerful tools for the applied statistician involved in educational research. Several examples of multilevel analyses of data from Educational Testing Service is presented in the chapter, in which multilevel analysis using a Bayesian approach provides better answers than methods based on sampling theory ideas such as un-biasedness and minimum variance. Multilevel models lie between standard models—between the extremes associated with the significant/not-significant dichotomy arising from the testing of standard models. Multilevel models can be used when data are thin without invoking awkward rules regarding the minimum sample size required for some form of analysis. Multilevel models make the quantities to be estimated random variables. The best confirmation is honest validation: make predictions to new data sets and see how accurate the predictions are. The consideration of conceptual replications can then be useful by eliminating inappropriate models. Diagnostics, such as frequentist-based examination of residuals, or posterior predictive model monitoring are, thus, necessary adjuncts to a successful multilevel analysis.

4 citations