Bayesian Model Averaging for Generalized Linear Models with Missing Covariates

Open AccessPosted Content

Bayesian Model Averaging for Generalized Linear Models with Missing Covariates

Valentino Dardanoni, +3 more

- 01 May 2013 -

Research Papers in Economics

Chats0

TLDR

In this article, the authors address the problem of estimating generalized linear models (GLMs) when the outcome of interest is always observed, the values of some covariates are missing for some observations, but imputations are available to fill-in the missing values.

Abstract:

We address the problem of estimating generalized linear models (GLMs) when the outcome of interest is always observed, the values of some covariates are missing for some observations, but imputations are available to fill-in the missing values. Under certain conditions on the missing-data mechanism and the imputation model, this situation generates a trade-off between bias and precision in the estimation of the parameters of interest. The complete cases are often too few, so precision is lost, but just filling-in the missing values with the imputations may lead to bias when the imputation model is either incorrectly specified or uncongenial. Following the generalized missing-indicator approach originally proposed by Dardanoni et al. (2011) for linear regression models, we characterize this bias-precision trade- off in terms of model uncertainty regarding which covariates should be dropped from an augmented GLM for the full sample of observed and imputed data. This formulation is attractive because model uncertainty can then be handled very naturally through Bayesian model averaging (BMA). In addition to applying the generalized missing-indicator method to the wider class of GLMs, we make two extensions. First, we propose a block-BMA strategy that incorporates information on the available missing-data patterns and has the advantage of being computationally simple. Second, we allow the observed outcome to be multivariate, thus covering the case of seemingly unrelated regression equations models, and ordered, multinomial or conditional logit and probit models. Our approach is illustrated through an empirical application using the first wave of the Survey on Health, Aging and Retirement in Europe (SHARE).

Bayesian Model Averaging for Generalized Linear Models with Missing Covariates

Citations

Journal of the American Statistical Association: William S. Cleveland, Marylyn E. McGill and Robert McGill, “The shape parameter for a two variable graph” 83 (1988) 289–300

Imputation of Missing Data in Waves 1 and 2 of SHARE

Frequentist Model Averaging

References

The risk inflation criterion for multiple regression

Correction: Consistency and Asymptotic Normality of the Maximum Likelihood Estimator in Generalized Linear Models

Journal of the American Statistical Association: William S. Cleveland, Marylyn E. McGill and Robert McGill, “The shape parameter for a two variable graph” 83 (1988) 289–300

Bayesian Model Selection in Structural Equation Models

A comparison of two model averaging techniques with an application to growth empirics

Related Papers (5)

A Note on the Use of Missing Auxiliary Variables in Full Information Maximum Likelihood-Based Structural Equation Models

Multiple Imputation Strategies for Multiple Group Structural Equation Models

Review of Zero-Inflated Models with Missing Data

Analyzing Incomplete Discrete Longitudinal Clinical Trial Data

A Class of Pattern-Mixture Models for Normal Incomplete Data