Showing papers by "Donald B. Rubin published in 1989"

PDF

Open Access

Journal Article•DOI•

Statistical Analysis with Missing Data

[...]

Larry E. Richards, Roderick J. A. Little, Donald B. Rubin

01 Aug 1989-Journal of Marketing Research

7,643 citations

Journal Article•DOI•

Multiple Imputation for Nonresponse in Surveys.

[...]

C. D. Kershaw, Donald B. Rubin

01 Mar 1989-Biometrics

TL;DR: This work focuses on the development of Imputation Models for Social Security Benefit Reconciliation in the context of a Finite Population and examines the role of Bayesian and Randomization--Based Inferences in these models.

...read moreread less

Abstract: Tables and Figures. Glossary. 1. Introduction. 1.1 Overview. 1.2 Examples of Surveys with Nonresponse. 1.3 Properly Handling Nonresponse. 1.4 Single Imputation. 1.5 Multiple Imputation. 1.6 Numerical Example Using Multiple Imputation. 1.7 Guidance for the Reader. 2. Statistical Background. 2.1 Introduction. 2.2 Variables in the Finite Population. 2.3 Probability Distributions and Related Calculations. 2.4 Probability Specifications for Indicator Variables. 2.5 Probability Specifications for (X,Y). 2.6 Bayesian Inference for a Population Quality. 2.7 Interval Estimation. 2.8 Bayesian Procedures for Constructing Interval Estimates, Including Significance Levels and Point Estimates. 2.9 Evaluating the Performance of Procedures. 2.10 Similarity of Bayesian and Randomization--Based Inferences in Many Practical Cases. 3. Underlying Bayesian Theory. 3.1 Introduction and Summary of Repeated--Imputation Inferences. 3.2 Key Results for Analysis When the Multiple Imputations are Repeated Draws from the Posterior Distribution of the Missing Values. 3.3 Inference for Scalar Estimands from a Modest Number of Repeated Completed--Data Means and Variances. 3.4 Significance Levels for Multicomponent Estimands from a Modest Number of Repeated Completed--Data Means and Variance--Covariance Matrices. 3.5 Significance Levels from Repeated Completed--Data Significance Levels. 3.6 Relating the Completed--Data and Completed--Data Posterior Distributions When the Sampling Mechanism is Ignorable. 4. Randomization--Based Evaluations. 4.1 Introduction. 4.2 General Conditions for the Randomization--Validity of Infinite--m Repeated--Imputation Inferences. 4.3Examples of Proper and Improper Imputation Methods in a Simple Case with Ignorable Nonresponse. 4.4 Further Discussion of Proper Imputation Methods. 4.5 The Asymptotic Distibution of (Qm,Um,Bm) for Proper Imputation Methods. 4.6 Evaluations of Finite--m Inferences with Scalar Estimands. 4.7 Evaluation of Significance Levels from the Moment--Based Statistics Dm and Dm with Multicomponent Estimands. 4.8 Evaluation of Significance Levels Based on Repeated Significance Levels. 5. Procedures with Ignorable Nonresponse. 5.1 Introduction. 5.2 Creating Imputed Values under an Explicit Model. 5.3 Some Explicit Imputation Models with Univariate YI and Covariates. 5.4 Monotone Patterns of Missingness in Multivariate YI. 5.5 Missing Social Security Benefits in the Current Population Survey. 5.6 Beyond Monotone Missingness. 6. Procedures with Nonignorable Nonresponse. 6.1 Introduction. 6.2 Nonignorable Nonresponse with Univariate YI and No XI. 6.3 Formal Tasks with Nonignorable Nonresponse. 6.4 Illustrating Mixture Modeling Using Educational Testing Service Data. 6.5 Illustrating Selection Modeling Using CPS Data. 6.6 Extensions to Surveys with Follow--Ups. 6.7 Follow--Up Response in a Survey of Drinking Behavior Among Men of Retirement Age. References. Author Index. Subject Index. Appendix I. Report Written for the Social Security Administration in 1977. Appendix II. Report Written for the Census Bureau in 1983.

...read moreread less

5,436 citations

Journal Article•DOI•

Multiple Imputation for Nonresponse in Surveys

[...]

William M. Campion, Donald B. Rubin

01 Nov 1989-Journal of Marketing Research

TL;DR: In this paper, the authors demonstrate how nonresponse in sample surveys and censuses can be handled by replacing each missing value with two or more multiple imputations, and demonstrate the benefit of this statistical technique for researchers who must analyze them.

...read moreread less

Abstract: Demonstrates how nonresponse in sample surveys and censuses can be handled by replacing each missing value with two or more multiple imputations. Clearly illustrates the advantages of modern computing to such handle surveys, and demonstrates the benefit of this statistical technique for researchers who must analyze them. Also presents the background for Bayesian and frequentist theory. After establishing that only standard complete-data methods are needed to analyze a multiply-imputed set, the text evaluates procedures in general circumstances, outlining specific procedures for creating imputations in both the ignorable and nonignorable cases. Examples and exercises reinforce ideas, and the interplay of Bayesian and frequentist ideas presents a unified picture of modern statistics.

...read moreread less

2,078 citations

Journal Article•DOI•

The Analysis of Social Science Data with Missing Values

[...]

Roderick J. A. Little¹, Donald B. Rubin²•Institutions (2)

University of California, Los Angeles¹, Harvard University²

01 Nov 1989-Sociological Methods & Research

TL;DR: The EM algorithm, a convenient method for computing maximum likelihood estimates in missing-data problems, is described and applied to two common models, the multivariate normal model for continuous data and the multinomial model for discrete data.

...read moreread less

Abstract: Methods for handling missing data in social science data sets are reviewed. Limitations of common practical approaches, including complete-case analysis, available-case analysis and imputation, are illustrated on a simple missing-data problem with one complete and one incomplete variable. Two more principled approaches, namely maximum likelihood under a model for the data and missing-data mechanism and multiple imputation, are applied to the bivariate problem. General properties of these methods are outlined, and applications to more complex missing-data problems are discussed. The EM algorithm, a convenient method for computing maximum likelihood estimates in missing-data problems, is described and applied to two common models, the multivariate normal model for continuous data and the multinomial model for discrete data. Multiple imputation under explicit or implicit models is recommended as a method that retains the advantages of imputation and overcomes its limitations.

...read moreread less

994 citations

Journal Article•DOI•

Effect Size Estimation for One-Sample Multiple-Choice-Type Data: Design, Analysis, and Meta-Analysis

[...]

Robert Rosenthal¹, Donald B. Rubin•Institutions (1)

Harvard University¹

01 Sep 1989-Psychological Bulletin

TL;DR: Les auteurs imaginent un indicateur, aisement interpretable, de la taille et de l'importance des effets d'une experimentation sur un echantillon unique.

...read moreread less

Abstract: Les auteurs imaginent un indicateur, aisement interpretable, de la taille et de l'importance des effets d'une experimentation sur un echantillon unique. Ils le font suivant un modele mathematique

...read moreread less

86 citations

Book Chapter•DOI•

Some applications of multilevel models to educational data

[...]

Donald B. Rubin¹•Institutions (1)

Harvard University¹

01 Jan 1989

TL;DR: In this paper, the authors describe the applications of multilevel models of educational data and present several examples of multi-level analyses of data from the Educational Testing Service (ETS).

...read moreread less

Abstract: Publisher Summary This chapter describes the applications of multilevel models of educational data. Multilevel models are powerful tools for the applied statistician involved in educational research. Several examples of multilevel analyses of data from Educational Testing Service is presented in the chapter, in which multilevel analysis using a Bayesian approach provides better answers than methods based on sampling theory ideas such as un-biasedness and minimum variance. Multilevel models lie between standard models—between the extremes associated with the significant/not-significant dichotomy arising from the testing of standard models. Multilevel models can be used when data are thin without invoking awkward rules regarding the minimum sample size required for some form of analysis. Multilevel models make the quantities to be estimated random variables. The best confirmation is honest validation: make predictions to new data sets and see how accurate the predictions are. The consideration of conceptual replications can then be useful by eliminating inappropriate models. Diagnostics, such as frequentist-based examination of residuals, or posterior predictive model monitoring are, thus, necessary adjuncts to a successful multilevel analysis.

...read moreread less

4 citations

Journal Article•

Multiple imputation for nonresponse in surveys (book review)

[...]

Donald B. Rubin, C. D. Reviewer Kershaw

08 May 1989-Biometrics

3 citations

Journal Article•DOI•

Bugs, Lacunae, and the Minnesota/DC Effect—A Discussion of H. Wainer’s “Eelworms, Bullet Holes, and Geraldine Ferraro”

[...]

Donald B. Rubin

01 Jan 1989-Journal of Educational and Behavioral Statistics

2 citations