scispace - formally typeset
Journal ArticleDOI

Model averaging and muddled multimodel inferences

Brian S. Cade
- 01 Sep 2015 - 
- Vol. 96, Iss: 9, pp 2370-2382
Reads0
Chats0
TLDR
Three flawed practices associated with model averaging coefficients for predictor variables in regression models commonly occur when making multimodel inferences in analyses of ecological data and ought to be discontinued if the authors are to make effective scientific contributions to ecological knowledge and conservation of natural resources.
Abstract
Three flawed practices associated with model averaging coefficients for predictor variables in regression models commonly occur when making multimodel inferences in analyses of ecological data. Model-averaged regression coefficients based on Akaike information criterion (AIC) weights have been recommended for addressing model uncertainty but they are not valid, interpretable estimates of partial effects for individual predictors when there is multicollinearity among the predictor variables. Multicollinearity implies that the scaling of units in the denominators of the regression coefficients may change across models such that neither the parameters nor their estimates have common scales, therefore averaging them makes no sense. The associated sums of AIC model weights recommended to assess relative importance of individual predictors are really a measure of relative importance of models, with little information about contributions by individual predictors compared to other measures of relative importance based on effects size or variance reduction. Sometimes the model-averaged regression coefficients for predictor variables are incorrectly used to make model-averaged predictions of the response variable when the models are not linear in the parameters. I demonstrate the issues with the first two practices using the college grade point average example extensively analyzed by Burnham and Anderson. I show how partial standard deviations of the predictor variables can be used to detect changing scales of their estimates with multicollinearity. Standardizing estimates based on partial standard deviations for their variables can be used to make the scaling of the estimates commensurate across models, a necessary but not sufficient condition for model averaging of the estimates to be sensible. A unimodal distribution of estimates and valid interpretation of individual parameters are additional requisite conditions. The standardized estimates or equivalently the t statistics on unstandardized estimates also can be used to provide more informative measures of relative importance than sums of AIC weights. Finally, I illustrate how seriously compromised statistical interpretations and predictions can be for all three of these flawed practices by critiquing their use in a recent species distribution modeling technique developed for predicting Greater Sage-Grouse (Centrocercus urophasianus) distribution in Colorado, USA. These model averaging issues are common in other ecological literature and ought to be discontinued if we are to make effective scientific contributions to ecological knowledge and conservation of natural resources.

read more

Citations
More filters
Journal ArticleDOI

A brief introduction to mixed effects modelling and multi-model inference in ecology.

TL;DR: This overview should serve as a widely accessible code of best practice for applying LMMs to complex biological problems and model structures, and in doing so improve the robustness of conclusions drawn from studies investigating ecological and evolutionary questions.
Journal ArticleDOI

Doses of neighborhood nature: The benefits for mental health of living with nature

TL;DR: In this paper, the authors demonstrate quantifiable associations of mental health with the characteristics of nearby nature that people actually experience and demonstrate that vegetation cover and afternoon bird abundances were positively associated with a lower prevalence of depression, anxiety, and stress.
Journal ArticleDOI

Model averaging in ecology: a review of Bayesian, information-theoretic, and tactical approaches for predictive inference

TL;DR: In this article, the authors review the mathematical foundations of model averaging along with the diversity of approaches available and stress the importance of non-parametric methods such as cross-validation for a reliable uncertainty quantification of model-averaged predictions.
Journal ArticleDOI

The relative performance of AIC, AICC and BIC in the presence of unobserved heterogeneity

TL;DR: It is found that the relative predictive performance of model selection by different information criteria is heavily dependent on the degree of unobserved heterogeneity between data sets, and that the choice of information criterion should ideally be based upon hypothesized properties of the population of data sets from which a given data set could have arisen.
Journal ArticleDOI

Identifying the best climatic predictors in ecology and evolution

TL;DR: A four-step approach is proposed which allows for more rigorous identification and quantification of weather signals and any other predictor variable for which data is available at high temporal resolution, easily implementable with the new R package ‘climwin’ and provides a benchmark performance to compare other approaches to.
References
More filters
Journal ArticleDOI

Regression Shrinkage and Selection via the Lasso

TL;DR: A new method for estimation in linear models called the lasso, which minimizes the residual sum of squares subject to the sum of the absolute value of the coefficients being less than a constant, is proposed.
Book

Model Selection and Multimodel Inference: A Practical Information-Theoretic Approach

TL;DR: The second edition of this book is unique in that it focuses on methods for making formal statistical inference from all the models in an a priori set (Multi-Model Inference).
Journal ArticleDOI

Multimodel Inference Understanding AIC and BIC in Model Selection

TL;DR: Various facets of such multimodel inference are presented here, particularly methods of model averaging, which can be derived as a non-Bayesian result.
BookDOI

Regression modeling strategies : with applications to linear models, logistic regression, and survival analysis

TL;DR: In this article, the authors present a case study in least squares fitting and interpretation of a linear model, where they use nonparametric transformations of X and Y to fit a linear regression model.
Journal ArticleDOI

Collinearity: a review of methods to deal with it and a simulation study evaluating their performance

TL;DR: It was found that methods specifically designed for collinearity, such as latent variable methods and tree based models, did not outperform the traditional GLM and threshold-based pre-selection and the value of GLM in combination with penalised methods and thresholds when omitted variables are considered in the final interpretation.
Related Papers (5)