Maximum entropy modeling of species geographic distributions

doi:10.1016/J.ECOLMODEL.2005.03.026

Home
/
Papers
/
Maximum entropy modeling of species geographic distributions

Journal Article•DOI•

Maximum entropy modeling of species geographic distributions

Steven J. Phillips¹, Robert P. Anderson², Robert P. Anderson³, Robert E. Schapire⁴•Institutions (4)

AT&T Labs¹, City University of New York², American Museum of Natural History³, Princeton University⁴

25 Jan 2006-Ecological Modelling (Elsevier)-Vol. 190, Iss: 3, pp 231-259

TL;DR: In this paper, the use of the maximum entropy method (Maxent) for modeling species geographic distributions with presence-only data was introduced, which is a general-purpose machine learning method with a simple and precise mathematical formulation.

read less

About: This article is published in Ecological Modelling.The article was published on 2006-01-25 and is currently open access. It has received 13120 citations till now. The article focuses on the topics: Environmental niche modelling & Species distribution.

...read moreread less

Citations

PDF

Open Access

More filters

Journal Article•DOI•

Novel methods improve prediction of species' distributions from occurrence data

[...]

Jane Elith¹, Catherine H. Graham², Robert P. Anderson³, Miroslav Dudík⁴, Simon Ferrier, Antoine Guisan⁵, Robert J. Hijmans⁶, Falk Huettmann⁷, John R. Leathwick⁸, Anthony Lehmann, Jin Li⁹, Lúcia G. Lohmann¹⁰, Bette A. Loiselle¹¹, Glenn Manion, Craig Moritz⁶, Miguel Nakamura¹², Yoshinori Nakazawa¹³, Jacob C. M. Mc Overton¹⁴, A. Townsend Peterson¹³, Steven J. Phillips¹⁵, Karen Richardson¹⁶, Ricardo Scachetti-Pereira, Robert E. Schapire, Jorge Soberón¹³, Stephen E. Williams¹⁷, Mary S. Wisz, Niklaus E. Zimmermann¹⁸ - Show less +23 more•Institutions (18)

University of Melbourne¹, Stony Brook University², City University of New York³, Princeton University⁴, University of Lausanne⁵, University of California, Berkeley⁶, University of Alaska Fairbanks⁷, National Institute of Water and Atmospheric Research⁸, Commonwealth Scientific and Industrial Research Organisation⁹, University of São Paulo¹⁰, University of Missouri¹¹, Consejo Nacional de Ciencia y Tecnología¹², University of Kansas¹³, Landcare Research¹⁴, AT&T¹⁵, McGill University¹⁶, James Cook University¹⁷, Swiss Federal Institute for Forest, Snow and Landscape Research¹⁸

01 Apr 2006-Ecography

TL;DR: This work compared 16 modelling methods over 226 species from 6 regions of the world, creating the most comprehensive set of model comparisons to date and found that presence-only data were effective for modelling species' distributions for many species and regions.

...read moreread less

Abstract: Prediction of species' distributions is central to diverse applications in ecology, evolution and conservation science. There is increasing electronic access to vast sets of occurrence records in museums and herbaria, yet little effective guidance on how best to use this information in the context of numerous approaches for modelling distributions. To meet this need, we compared 16 modelling methods over 226 species from 6 regions of the world, creating the most comprehensive set of model comparisons to date. We used presence-only data to fit models, and independent presence-absence data to evaluate the predictions. Along with well-established modelling methods such as generalised additive models and GARP and BIOCLIM, we explored methods that either have been developed recently or have rarely been applied to modelling species' distributions. These include machine-learning methods and community models, both of which have features that may make them particularly well suited to noisy or sparse information, as is typical of species' occurrence data. Presence-only data were effective for modelling species' distributions for many species and regions. The novel methods consistently outperformed more established methods. The results of our analysis are promising for the use of data from museums and herbaria, especially as methods suited to the noise inherent in such data improve.

...read moreread less

7,589 citations

Journal Article•DOI•

Predicting species distribution: offering more than simple habitat models.

[...]

Antoine Guisan¹, Wilfried Thuiller²•Institutions (2)

University of Lausanne¹, University of Évora²

01 Sep 2005-Ecology Letters

TL;DR: An overview of recent advances in species distribution models, and new avenues for incorporating species migration, population dynamics, biotic interactions and community ecology into SDMs at multiple spatial scales are suggested.

...read moreread less

Abstract: In the last two decades, interest in species distribution models (SDMs) of plants and animals has grown dramatically. Recent advances in SDMs allow us to potentially forecast anthropogenic effects on patterns of biodiversity at different spatial scales. However, some limitations still preclude the use of SDMs in many theoretical and practical applications. Here, we provide an overview of recent advances in this field, discuss the ecological principles and assumptions underpinning SDMs, and highlight critical limitations and decisions inherent in the construction and evaluation of SDMs. Particular emphasis is given to the use of SDMs for the assessment of climate change impacts and conservation management issues. We suggest new avenues for incorporating species migration, population dynamics, biotic interactions and community ecology into SDMs at multiple spatial scales. Addressing all these issues requires a better integration of SDMs with ecological theory.

...read moreread less

5,620 citations

Journal Article•DOI•

Modeling of species distributions with Maxent: new extensions and a comprehensive evaluation

[...]

Steven J. Phillips¹, Miroslav Dudík•Institutions (1)

AT&T¹

01 Apr 2008-Ecography

TL;DR: This paper presents a tuning method that uses presence-only data for parameter tuning, and introduces several concepts that improve the predictive accuracy and running time of Maxent and describes a new logistic output format that gives an estimate of probability of presence.

...read moreread less

Abstract: Accurate modeling of geographic distributions of species is crucial to various applications in ecology and conservation. The best performing techniques often require some parameter tuning, which may be prohibitively time-consuming to do separately for each species, or unreliable for small or biased datasets. Additionally, even with the abundance of good quality data, users interested in the application of species models need not have the statistical knowledge required for detailed tuning. In such cases, it is desirable to use "default settings", tuned and validated on diverse datasets. Maxent is a recently introduced modeling technique, achieving high predictive accuracy and enjoying several additional attractive properties. The performance of Maxent is influenced by a moderate number of parameters. The first contribution of this paper is the empirical tuning of these parameters. Since many datasets lack information about species absence, we present a tuning method that uses presence-only data. We evaluate our method on independently collected high-quality presence-absence data. In addition to tuning, we introduce several concepts that improve the predictive accuracy and running time of Maxent. We introduce "hinge features" that model more complex relationships in the training data; we describe a new logistic output format that gives an estimate of probability of presence; finally we explore "background sampling" strategies that cope with sample selection bias and decrease model-building time. Our evaluation, based on a diverse dataset of 226 species from 6 regions, shows: 1) default settings tuned on presence-only data achieve performance which is almost as good as if they had been tuned on the evaluation data itself; 2) hinge features substantially improve model performance; 3) logistic output improves model calibration, so that large differences in output values correspond better to large differences in suitability; 4) "target-group" background sampling can give much better predictive performance than random background sampling; 5) random background sampling results in a dramatic decrease in running time, with no decrease in model performance.

...read moreread less

5,314 citations

Cites methods from "Maximum entropy modeling of species..."

...Modeling of species distributions with Maxent: new extensions and a comprehensive evaluation...
[...]

Journal Article•DOI•

Species Distribution Models: Ecological Explanation and Prediction Across Space and Time

[...]

Jane Elith¹, John R. Leathwick²•Institutions (2)

University of Melbourne¹, National Institute of Water and Atmospheric Research²

06 Feb 2009-Annual Review of Ecology, Evolution, and Systematics

TL;DR: Species distribution models (SDMs) as mentioned in this paper are numerical tools that combine observations of species occurrence or abundance with environmental estimates, and are used to gain ecological and evolutionary insights and to predict distributions across landscapes, sometimes requiring extrapolation in space and time.

...read moreread less

Abstract: Species distribution models (SDMs) are numerical tools that combine observations of species occurrence or abundance with environmental estimates. They are used to gain ecological and evolutionary insights and to predict distributions across landscapes, sometimes requiring extrapolation in space and time. SDMs are now widely used across terrestrial, freshwater, and marine realms. Differences in methods between disciplines reflect both differences in species mobility and in “established use.” Model realism and robustness is influenced by selection of relevant predictors and modeling method, consideration of scale, how the interplay between environmental and geographic factors is handled, and the extent of extrapolation. Current linkages between SDM practice and ecological theory are often weak, hindering progress. Remaining challenges include: improvement of methods for modeling presence-only data and for model selection and evaluation; accounting for biotic interactions; and assessing model uncertainty.

...read moreread less

5,076 citations

Cites background or methods from "Maximum entropy modeling of species..."

...…Frescino 2002), classification and regression trees and ensembles of trees (random forests: Prasad et al. 2006; boosted regression trees: Elith et al. 2008), genetic algorithms (Stockwell & Peters 1999), support vector machines (Drake et al. 2006), and maximum entropy models (Phillips et al. 2006)....
[...]
...Where analytical methods were once restricted to envelopes and distance measures, comparison of presence records with background or pseudoabsence points is now common (e.g., using GARP, ENFA, MaxEnt, and regression methods)....
[...]
...In machine learning these ideas of model selection and tuning are termed “regularization,” i.e., making the fitted surface more regular or smooth by controlling overfitting (e.g., used in MaxEnt, Phillips et al. 2006)....
[...]
...The key structural features of GLMs (non-normal error distributions, additive terms, nonlinear fitted functions) continue to be useful and are part of many current methods including RSFs (Manly et al. 2002) and maximum entropy models (MaxEnt; Phillips et al. 2006)....
[...]

Journal Article•DOI•

A statistical explanation of MaxEnt for ecologists

[...]

Jane Elith¹, Steven J. Phillips², Trevor Hastie³, Miroslav Dudík⁴, Yung En Chee¹, Colin J. Yates⁵ - Show less +2 more•Institutions (5)

University of Melbourne¹, AT&T Labs², Stanford University³, Yahoo!⁴, Department of Environment and Conservation⁵

01 Jan 2011-Diversity and Distributions

TL;DR: A new statistical explanation of MaxEnt is described, showing that the model minimizes the relative entropy between two probability densities defined in covariate space, which is likely to be a more accessible way to understand the model than previous ones that rely on machine learning concepts.

...read moreread less

Abstract: MaxEnt is a program for modelling species distributions from presence-only species records. This paper is written for ecologists and describes the MaxEnt model from a statistical perspective, making explicit links between the structure of the model, decisions required in producing a modelled distribution, and knowledge about the species and the data that might affect those decisions. To begin we discuss the characteristics of presence-only data, highlighting implications for modelling distributions. We particularly focus on the problems of sample bias and lack of information on species prevalence. The keystone of the paper is a new statistical explanation of MaxEnt which shows that the model minimizes the relative entropy between two probability densities (one estimated from the presence data and one, from the landscape) defined in covariate space. For many users, this viewpoint is likely to be a more accessible way to understand the model than previous ones that rely on machine learning concepts. We then step through a detailed explanation of MaxEnt describing key components (e.g. covariates and features, and definition of the landscape extent), the mechanics of model fitting (e.g. feature selection, constraints and regularization) and outputs. Using case studies for a Banksia species native to south-west Australia and a riverine fish, we fit models and interpret them, exploring why certain choices affect the result and what this means. The fish example illustrates use of the model with vector data for linear river segments rather than raster (gridded) data. Appropriate treatments for survey bias, unprojected data, locally restricted species, and predicting to environments outside the range of the training data are demonstrated, and new capabilities discussed. Online appendices include additional details of the model and the mathematical links between previous explanations and this one, example code and data, and further information on the case studies.

...read moreread less

4,621 citations

Cites background or methods from "Maximum entropy modeling of species..."

...MaxEnt (Phillips et al., 2006; Phillips & Dudı́k, 2008) is one such method and is the focus of this paper....
[...]
...…or coefficients These are the parameters of the model that weight the contribution of each feature. k in previous papers*, b in this paper *Phillips et al. (2006), Phillips & Dudı́k (2008) 48 Diversity and Distributions, 17, 43–57, ª 2010 Blackwell Publishing Ltd tuning parameter k.…...
[...]
...Note also that the AUC in this case is calculated on presence vs. background data (Phillips et al., 2006)....
[...]
...This was called the ‘‘raw’’ distribution (Phillips et al., 2006), and gave the probability, given the species is present, that it is found at pixel x. Maximizing the entropy of the raw distribution is equivalent to minimizing the relative entropy of f1(z) relative to f(z), so the two formulations…...
[...]
...The MaxEnt model – a short overview Previous papers have described MaxEnt as estimating a distribution across geographic space (Phillips et al., 2006; Phillips & Dudı́k, 2008)....
[...]

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

References

PDF

Open Access

More filters

Journal Article•DOI•

A mathematical theory of communication

[...]

Claude E. Shannon

01 Jul 1948-Bell System Technical Journal

TL;DR: This final installment of the paper considers the case where the signals or the messages or both are continuously variable, in contrast with the discrete nature assumed until now.

...read moreread less

Abstract: In this final installment of the paper we consider the case where the signals or the messages or both are continuously variable, in contrast with the discrete nature assumed until now. To a considerable extent the continuous case can be obtained through a limiting process from the discrete case by dividing the continuum of messages and signals into a large but finite number of small regions and calculating the various parameters involved on a discrete basis. As the size of the regions is decreased these parameters in general approach as limits the proper values for the continuous case. There are, however, a few new effects that appear and also a general change of emphasis in the direction of specialization of the general results to particular cases.

...read moreread less

65,425 citations

Journal Article•DOI•

The meaning and use of the area under a receiver operating characteristic (ROC) curve.

[...]

James A. Hanley, Barbara J. McNeil

01 Apr 1982-Radiology

TL;DR: A representation and interpretation of the area under a receiver operating characteristic (ROC) curve obtained by the "rating" method, or by mathematical predictions based on patient characteristics, is presented and it is shown that in such a setting the area represents the probability that a randomly chosen diseased subject is (correctly) rated or ranked with greater suspicion than a random chosen non-diseased subject.

...read moreread less

Abstract: A representation and interpretation of the area under a receiver operating characteristic (ROC) curve obtained by the "rating" method, or by mathematical predictions based on patient characteristics, is presented. It is shown that in such a setting the area represents the probability that a randomly chosen diseased subject is (correctly) rated or ranked with greater suspicion than a randomly chosen non-diseased subject. Moreover, this probability of a correct ranking is the same quantity that is estimated by the already well-studied nonparametric Wilcoxon statistic. These two relationships are exploited to (a) provide rapid closed-form expressions for the approximate magnitude of the sampling variability, i.e., standard error that one uses to accompany the area under a smoothed ROC curve, (b) guide in determining the size of the sample required to provide a sufficiently reliable estimate of this area, and (c) determine how large sample sizes should be to ensure that one can statistically detect difference...

...read moreread less

19,398 citations

"Maximum entropy modeling of species..." refers methods in this paper

...ROC analysis was developed in signal processing and is widely used in clinical medicine(Hanley and McNeil, 1982, 1983; Zweig and Campbell, 1993)....
[...]
...Each partition was created by randomly selecting 70% of the occurrence localities as training data, with the remaining 30% reserved for testing the resulting models....
[...]

Journal Article•DOI•

Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach.

[...]

Elizabeth R. DeLong¹, David M. DeLong¹, Daniel L. Clarke-Pearson•Institutions (1)

Quintiles¹

01 Sep 1988-Biometrics

TL;DR: A nonparametric approach to the analysis of areas under correlated ROC curves is presented, by using the theory on generalized U-statistics to generate an estimated covariance matrix.

...read moreread less

Abstract: Methods of evaluating and comparing the performance of diagnostic tests are of increasing importance as new tests are developed and marketed. When a test is based on an observed variable that lies on a continuous or graded scale, an assessment of the overall value of the test can be made through the use of a receiver operating characteristic (ROC) curve. The curve is constructed by varying the cutpoint used to determine which values of the observed variable will be considered abnormal and then plotting the resulting sensitivities against the corresponding false positive rates. When two or more empirical curves are constructed based on tests performed on the same individuals, statistical analysis on differences between curves must take into account the correlated nature of the data. This paper presents a nonparametric approach to the analysis of areas under correlated ROC curves, by using the theory on generalized U-statistics to generate an estimated covariance matrix.

...read moreread less

16,496 citations

"Maximum entropy modeling of species..." refers methods in this paper

...It uses a non-parametric test(DeLong et al., 1988)to determine whether one prediction is significantly better than another when using correlated samples (i.e., with both predictions evaluated on the same test instances), and reports the result as aχ2 statistic and correspondingp value....
[...]

Journal Article•DOI•

Information Theory and Statistical Mechanics. II

[...]

E. T. Jaynes¹•Institutions (1)

Stanford University¹

15 Oct 1957-Physical Review

TL;DR: In this article, the authors consider statistical mechanics as a form of statistical inference rather than as a physical theory, and show that the usual computational rules, starting with the determination of the partition function, are an immediate consequence of the maximum-entropy principle.

...read moreread less

Abstract: Information theory provides a constructive criterion for setting up probability distributions on the basis of partial knowledge, and leads to a type of statistical inference which is called the maximum-entropy estimate. It is the least biased estimate possible on the given information; i.e., it is maximally noncommittal with regard to missing information. If one considers statistical mechanics as a form of statistical inference rather than as a physical theory, it is found that the usual computational rules, starting with the determination of the partition function, are an immediate consequence of the maximum-entropy principle. In the resulting "subjective statistical mechanics," the usual rules are thus justified independently of any physical argument, and in particular independently of experimental verification; whether or not the results agree with experiment, they still represent the best estimates that could have been made on the basis of the information available.It is concluded that statistical mechanics need not be regarded as a physical theory dependent for its validity on the truth of additional assumptions not contained in the laws of mechanics (such as ergodicity, metric transitivity, equal a priori probabilities, etc.). Furthermore, it is possible to maintain a sharp distinction between its physical and statistical aspects. The former consists only of the correct enumeration of the states of a system and their properties; the latter is a straightforward example of statistical inference.

...read moreread less

12,099 citations

"Maximum entropy modeling of species..." refers background in this paper

...Its origins lie in statistical mechanics (Jaynes, 1957) , and it remains an active area of research with an Annual Conference, Maximum Entropy and Bayesian Methods, that explores applications in diverse areas such as astronomy, portfolio optimization, image reconstruction, statistical physics and signal processing....
[...]
...Jaynes gave a general answer to this question: the best approach is to ensure that the approximation satisfies any constraints on the unknown distribution that we are aware of, and that subject to those constraints, the distribution should have maximum entropy(Jaynes, 1957) ....
[...]
...E.T. Jaynes gave a general answer to this question: the best approach is to ensure that the approximation satisfies any constraints on the unknown distribution that we are aware of, and that subject to those constraints, the distribution should have maximum entropy(Jaynes, 1957)....
[...]
...Its origins lie in statistical mechanics(Jaynes, 1957), and it remains an active area of research with an Annual Conference, Maximum Entropy and Bayesian Methods, that explores applications in diverse areas such as astronomy, portfolio optimization, image reconstruction, statistical physics and…...
[...]

Journal Article•DOI•

Extinction risk from climate change

[...]

Chris D. Thomas¹, Alison Cameron¹, Rhys E. Green², Rhys E. Green³, Michel Bakkenes, Linda J. Beaumont⁴, Yvonne C. Collingham⁵, Barend F.N. Erasmus⁶, Marinez Ferreira de Siqueira, Alan Grainger¹, Lee Hannah⁷, Lesley Hughes⁴, Brian Huntley⁵, Albert S. van Jaarsveld⁸, Guy F. Midgley, Lera Miles¹, Lera Miles⁹, Miguel A. Ortega-Huerta¹⁰, A. Townsend Peterson¹¹, Oliver L. Phillips¹, Stephen E. Williams¹² - Show less +17 more•Institutions (12)

University of Leeds¹, University of Cambridge², Royal Society for the Protection of Birds³, Macquarie University⁴, Durham University⁵, University of the Witwatersrand⁶, Conservation International⁷, Stellenbosch University⁸, World Conservation Monitoring Centre⁹, National Autonomous University of Mexico¹⁰, University of Kansas¹¹, James Cook University¹²

08 Jan 2004-Nature

TL;DR: Estimates of extinction risks for sample regions that cover some 20% of the Earth's terrestrial surface show the importance of rapid implementation of technologies to decrease greenhouse gas emissions and strategies for carbon sequestration.

...read moreread less

Abstract: Climate change over the past approximately 30 years has produced numerous shifts in the distributions and abundances of species and has been implicated in one species-level extinction. Using projections of species' distributions for future climate scenarios, we assess extinction risks for sample regions that cover some 20% of the Earth's terrestrial surface. Exploring three approaches in which the estimated probability of extinction shows a power-law relationship with geographical range size, we predict, on the basis of mid-range climate-warming scenarios for 2050, that 15-37% of species in our sample of regions and taxa will be 'committed to extinction'. When the average of the three methods and two dispersal scenarios is taken, minimal climate-warming scenarios produce lower projections of species committed to extinction ( approximately 18%) than mid-range ( approximately 24%) and maximum-change ( approximately 35%) scenarios. These estimates show the importance of rapid implementation of technologies to decrease greenhouse gas emissions and strategies for carbon sequestration.

...read moreread less

7,089 citations

"Maximum entropy modeling of species..." refers background in this paper

...This is important for applications such as invasive-species management (e.g.,Peterson and Robins, 2003) and predicting the impact of climate change (e.g.,Thomas et al., 2004)....
[...]