Multimodel Inference Understanding AIC and BIC in Model Selection

doi:10.1177/0049124104268644

Home
/
Papers
/
Multimodel Inference Understanding AIC and BIC in Model Selection

Journal Article•DOI•

Multimodel Inference Understanding AIC and BIC in Model Selection

Kenneth P. Burnham¹, David E. Anderson¹•Institutions (1)

Colorado State University¹

01 Nov 2004-Sociological Methods & Research (SAGE Publications)-Vol. 33, Iss: 2, pp 261-304

TL;DR: Various facets of such multimodel inference are presented here, particularly methods of model averaging, which can be derived as a non-Bayesian result.

read less

Abstract: The model selection literature has been generally poor at reflecting the deep foundations of the Akaike information criterion (AIC) and at making appropriate comparisons to the Bayesian information...

...read moreread less

Citations

PDF

Open Access

More filters

Journal Article•DOI•

AIC model selection and multimodel inference in behavioral ecology: some background, observations, and comparisons

[...]

Kenneth P. Burnham¹, David E. Anderson¹, Kathryn P. Huyvaert¹•Institutions (1)

Colorado State University¹

01 Feb 2011-Behavioral Ecology and Sociobiology

TL;DR: The information-theoretic (I-T) approaches to valid inference are outlined including a review of some simple methods for making formal inference from all the hypotheses in the model set (multimodel inference).

...read moreread less

Abstract: We briefly outline the information-theoretic (I-T) approaches to valid inference including a review of some simple methods for making formal inference from all the hypotheses in the model set (multimodel inference). The I-T approaches can replace the usual t tests and ANOVA tables that are so inferentially limited, but still commonly used. The I-T methods are easy to compute and understand and provide formal measures of the strength of evidence for both the null and alternative hypotheses, given the data. We give an example to highlight the importance of deriving alternative hypotheses and representing these as probability models. Fifteen technical issues are addressed to clarify various points that have appeared incorrectly in the recent literature. We offer several remarks regarding the future of empirical science and data analysis under an I-T framework.

...read moreread less

3,105 citations

Journal Article•DOI•

The effect of human mobility and control measures on the COVID-19 epidemic in China.

[...]

Moritz U. G. Kraemer¹, Moritz U. G. Kraemer², Moritz U. G. Kraemer³, Chia-Hung Yang⁴, Bernardo Gutierrez¹, Bernardo Gutierrez⁵, Chieh-Hsi Wu⁶, Brennan Klein⁴, David M. Pigott⁷, Louis du Plessis¹, Nuno R. Faria¹, Ruoran Li², William P. Hanage², John S. Brownstein², John S. Brownstein³, Maylis Layan⁸, Maylis Layan⁹, Alessandro Vespignani¹⁰, Alessandro Vespignani⁴, Huaiyu Tian¹¹, Christopher Dye¹, Oliver G. Pybus¹, Oliver G. Pybus¹², Samuel V. Scarpino⁴ - Show less +20 more•Institutions (12)

University of Oxford¹, Harvard University², Boston Children's Hospital³, Northeastern University⁴, Universidad San Francisco de Quito⁵, University of Southampton⁶, Institute for Health Metrics and Evaluation⁷, University of Paris⁸, Pasteur Institute⁹, Institute for Scientific Interchange¹⁰, Beijing Normal University¹¹, Royal Veterinary College¹²

01 May 2020-Science

TL;DR: Real-time mobility data from Wuhan and detailed case data including travel history are used to elucidate the role of case importation in transmission in cities across China and to ascertain the impact of control measures.

...read moreread less

Abstract: The ongoing coronavirus disease 2019 (COVID-19) outbreak expanded rapidly throughout China. Major behavioral, clinical, and state interventions were undertaken to mitigate the epidemic and prevent the persistence of the virus in human populations in China and worldwide. It remains unclear how these unprecedented interventions, including travel restrictions, affected COVID-19 spread in China. We used real-time mobility data from Wuhan and detailed case data including travel history to elucidate the role of case importation in transmission in cities across China and to ascertain the impact of control measures. Early on, the spatial distribution of COVID-19 cases in China was explained well by human mobility data. After the implementation of control measures, this correlation dropped and growth rates became negative in most locations, although shifts in the demographics of reported cases were still indicative of local chains of transmission outside of Wuhan. This study shows that the drastic control measures implemented in China substantially mitigated the spread of COVID-19.

...read moreread less

2,362 citations

Journal Article•DOI•

An Introduction to Recursive Partitioning: Rationale, Application and Characteristics of Classification and Regression Trees, Bagging and Random Forests

[...]

Carolin Strobl¹, James D. Malley¹, Gerhard Tutz²•Institutions (2)

Ludwig Maximilian University of Munich¹, Center for Information Technology²

01 Dec 2009-Psychological Methods

TL;DR: The aim of this work is to introduce the principles of the standard recursive partitioning methods as well as recent methodological improvements, to illustrate their usage for low and high-dimensional data exploration, but also to point out limitations of the methods and potential pitfalls in their practical application.

...read moreread less

Abstract: Recursive partitioning methods have become popular and widely used tools for nonparametric regression and classification in many scientific fields. Especially random forests, which can deal with large numbers of predictor variables even in the presence of complex interactions, have been applied successfully in genetics, clinical medicine, and bioinformatics within the past few years. High-dimensional problems are common not only in genetics, but also in some areas of psychological research, where only a few subjects can be measured because of time or cost constraints, yet a large amount of data is generated for each subject. Random forests have been shown to achieve a high prediction accuracy in such applications and to provide descriptive variable importance measures reflecting the impact of each variable in both main effects and interactions. The aim of this work is to introduce the principles of the standard recursive partitioning methods as well as recent methodological improvements, to illustrate their usage for low and high-dimensional data exploration, but also to point out limitations of the methods and potential pitfalls in their practical application. Application of the methods is illustrated with freely available implementations in the R system for statistical computing.

...read moreread less

2,001 citations

Additional excerpts

...For a detailed discussion of approaches that account for the complexity of parametric models, see Burnham and Anderson (2002) or Burnham and Anderson (2004)....
[...]

Journal Article•DOI•

A brief guide to model selection, multimodel inference and model averaging in behavioural ecology using Akaike’s information criterion.

[...]

Matthew R. E. Symonds¹, Adnan Moussalli²•Institutions (2)

University of Melbourne¹, Museum Victoria²

01 Jan 2011-Behavioral Ecology and Sociobiology

TL;DR: Akaike’s information criterion is provided, using recent examples from the behavioural ecology literature, a simple introductory guide to AIC: what it is, how and when to apply it and what it achieves.

...read moreread less

Abstract: Akaike’s information criterion (AIC) is increasingly being used in analyses in the field of ecology. This measure allows one to compare and rank multiple competing models and to estimate which of them best approximates the “true” process underlying the biological phenomenon under study. Behavioural ecologists have been slow to adopt this statistical tool, perhaps because of unfounded fears regarding the complexity of the technique. Here, we provide, using recent examples from the behavioural ecology literature, a simple introductory guide to AIC: what it is, how and when to apply it and what it achieves. We discuss multimodel inference using AIC—a procedure which should be used where no one model is strongly supported. Finally, we highlight a few of the pitfalls and problems that can be encountered by novice practitioners.

...read moreread less

1,946 citations

Book•

Ecological Models and Data in R

[...]

Benjamin M. Bolker

21 Jul 2008

TL;DR: In step-by-step detail, Benjamin Bolker teaches ecology graduate students and researchers everything they need to know in order to use maximum likelihood, information-theoretic, and Bayesian techniques to analyze their own data using the programming language R.

...read moreread less

Abstract: Ecological Models and Data in R is the first truly practical introduction to modern statistical methods for ecology. In step-by-step detail, the book teaches ecology graduate students and researchers everything they need to know in order to use maximum likelihood, information-theoretic, and Bayesian techniques to analyze their own data using the programming language R. Drawing on extensive experience teaching these techniques to graduate students in ecology, Benjamin Bolker shows how to choose among and construct statistical models for data, estimate their parameters and confidence limits, and interpret the results. The book also covers statistical frameworks, the philosophy of statistical modeling, and critical mathematical functions and probability distributions. It requires no programming background--only basic calculus and statistics.

...read moreread less

1,626 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

References

PDF

Open Access

More filters

Journal Article•DOI•

A new look at the statistical model identification

[...]

Hirotugu Akaike

01 Dec 1974-IEEE Transactions on Automatic Control

TL;DR: In this article, a new estimate minimum information theoretical criterion estimate (MAICE) is introduced for the purpose of statistical identification, which is free from the ambiguities inherent in the application of conventional hypothesis testing procedure.

...read moreread less

Abstract: The history of the development of statistical hypothesis testing in time series analysis is reviewed briefly and it is pointed out that the hypothesis testing procedure is not adequately defined as the procedure for statistical model identification. The classical maximum likelihood estimation procedure is reviewed and a new estimate minimum information theoretical criterion (AIC) estimate (MAICE) which is designed for the purpose of statistical identification is introduced. When there are several competing models the MAICE is defined by the model and the maximum likelihood estimates of the parameters which give the minimum of AIC defined by AIC = (-2)log-(maximum likelihood) + 2(number of independently adjusted parameters within the model). MAICE provides a versatile procedure for statistical model identification which is free from the ambiguities inherent in the application of conventional hypothesis testing procedure. The practical utility of MAICE in time series analysis is demonstrated with some numerical examples.

...read moreread less

47,133 citations

Journal Article•DOI•

Estimating the Dimension of a Model

[...]

Gideon Schwarz

01 Mar 1978-Annals of Statistics

TL;DR: In this paper, the problem of selecting one of a number of models of different dimensions is treated by finding its Bayes solution, and evaluating the leading terms of its asymptotic expansion.

...read moreread less

Abstract: The problem of selecting one of a number of models of different dimensions is treated by finding its Bayes solution, and evaluating the leading terms of its asymptotic expansion. These terms are a valid large-sample criterion beyond the Bayesian context, since they do not depend on the a priori distribution.

...read moreread less

38,681 citations

Book•

Model Selection and Multimodel Inference: A Practical Information-Theoretic Approach

[...]

Kenneth P. Burnham, David E. Anderson

19 Jun 2013

TL;DR: The second edition of this book is unique in that it focuses on methods for making formal statistical inference from all the models in an a priori set (Multi-Model Inference).

...read moreread less

Abstract: Introduction * Information and Likelihood Theory: A Basis for Model Selection and Inference * Basic Use of the Information-Theoretic Approach * Formal Inference From More Than One Model: Multi-Model Inference (MMI) * Monte Carlo Insights and Extended Examples * Statistical Theory and Numerical Results * Summary

...read moreread less

36,993 citations

Estimating the dimension of a model

[...]

Gideon Schwarz

01 Jan 2005

...read moreread less

36,760 citations

"Multimodel Inference Understanding ..." refers methods in this paper

...UNDERSTANDING BIC Schwarz (1978) derived the Bayesian information criterion as BIC 2ln( ) log( ) .œ _ K n As usually used one computes BIC for each model and selects the model with the smallest criterion value....
[...]

Proceedings Article•

Information Theory and an Extention of the Maximum Likelihood Principle

[...]

H. Akaike

01 Jan 1973

TL;DR: The classical maximum likelihood principle can be considered to be a method of asymptotic realization of an optimum estimate with respect to a very general information theoretic criterion to provide answers to many practical problems of statistical model fitting.

...read moreread less

Abstract: In this paper it is shown that the classical maximum likelihood principle can be considered to be a method of asymptotic realization of an optimum estimate with respect to a very general information theoretic criterion. This observation shows an extension of the principle to provide answers to many practical problems of statistical model fitting.

...read moreread less

18,539 citations