scispace - formally typeset
Open AccessJournal ArticleDOI

A statistical explanation of MaxEnt for ecologists

Reads0
Chats0
TLDR
A new statistical explanation of MaxEnt is described, showing that the model minimizes the relative entropy between two probability densities defined in covariate space, which is likely to be a more accessible way to understand the model than previous ones that rely on machine learning concepts.
Abstract
MaxEnt is a program for modelling species distributions from presence-only species records. This paper is written for ecologists and describes the MaxEnt model from a statistical perspective, making explicit links between the structure of the model, decisions required in producing a modelled distribution, and knowledge about the species and the data that might affect those decisions. To begin we discuss the characteristics of presence-only data, highlighting implications for modelling distributions. We particularly focus on the problems of sample bias and lack of information on species prevalence. The keystone of the paper is a new statistical explanation of MaxEnt which shows that the model minimizes the relative entropy between two probability densities (one estimated from the presence data and one, from the landscape) defined in covariate space. For many users, this viewpoint is likely to be a more accessible way to understand the model than previous ones that rely on machine learning concepts. We then step through a detailed explanation of MaxEnt describing key components (e.g. covariates and features, and definition of the landscape extent), the mechanics of model fitting (e.g. feature selection, constraints and regularization) and outputs. Using case studies for a Banksia species native to south-west Australia and a riverine fish, we fit models and interpret them, exploring why certain choices affect the result and what this means. The fish example illustrates use of the model with vector data for linear river segments rather than raster (gridded) data. Appropriate treatments for survey bias, unprojected data, locally restricted species, and predicting to environments outside the range of the training data are demonstrated, and new capabilities discussed. Online appendices include additional details of the model and the mathematical links between previous explanations and this one, example code and data, and further information on the case studies.

read more

Citations
More filters
Journal ArticleDOI

A practical guide to MaxEnt for modeling species' distributions: what it does, and why inputs and settings matter

TL;DR: A detailed explanation of how MaxEnt works and a prospectus on modeling options are provided to enable users to make informed decisions when preparing data, choosing settings and interpreting output to highlight the need for making biologically motivated modeling decisions.
Journal ArticleDOI

Opening the black box: an open-source release of Maxent

TL;DR: A new open-source release of the Maxent software for modeling species distributions from occurrence records and environmental data is announced, and a new R package for fitting Maxent models using the glmnet package for regularized generalized linear models is described.
Journal ArticleDOI

ENMeval: An R package for conducting spatially independent evaluations and estimating optimal model complexity for Maxent ecological niche models

TL;DR: ENMeval as mentioned in this paper is an R package that creates data sets for k-fold cross-validation using one of several methods for partitioning occurrence data (including options for spatially independent partitions), builds a series of candidate models using Maxent with a variety of user-defined settings and provides multiple evaluation metrics to aid in selecting optimal model settings.
Journal ArticleDOI

Making better Maxent models of species distributions: complexity, overfitting and evaluation

TL;DR: In this paper, the authors integrate solutions to these issues for Maxent models, using the Caribbean spiny pocket mouse, Heteromys anomalus, as an example, by selecting appropriate evaluation data, detecting overfitting and tuning program settings to approximate optimal model complexity.
Journal ArticleDOI

SDMtoolbox: a python-based GIS toolkit for landscape genetic, biogeographic and species distribution model analyses

TL;DR: The toolkit simplifies many GIS analyses required for species distribution modelling and other analyses, alleviating the need for repetitive and time-consuming climate data pre-processing and post-SDM analyses.
References
More filters
Journal ArticleDOI

A new look at the statistical model identification

TL;DR: In this article, a new estimate minimum information theoretical criterion estimate (MAICE) is introduced for the purpose of statistical identification, which is free from the ambiguities inherent in the application of conventional hypothesis testing procedure.
Journal ArticleDOI

Regression Shrinkage and Selection via the Lasso

TL;DR: A new method for estimation in linear models called the lasso, which minimizes the residual sum of squares subject to the sum of the absolute value of the coefficients being less than a constant, is proposed.
Book

The Elements of Statistical Learning: Data Mining, Inference, and Prediction

TL;DR: In this paper, the authors describe the important ideas in these areas in a common conceptual framework, and the emphasis is on concepts rather than mathematics, with a liberal use of color graphics.
Journal ArticleDOI

Maximum entropy modeling of species geographic distributions

TL;DR: In this paper, the use of the maximum entropy method (Maxent) for modeling species geographic distributions with presence-only data was introduced, which is a general-purpose machine learning method with a simple and precise mathematical formulation.

Contribution of Working Group II to the Fourth Assessment Report of the Intergovernmental Panel on Climate Change

TL;DR: Drafting Authors: Neil Adger, Pramod Aggarwal, Shardul Agrawala, Joseph Alcamo, Abdelkader Allali, Oleg Anisimov, Nigel Arnell, Michel Boko, Osvaldo Canziani, Timothy Carter, Gino Casassa, Ulisses Confalonieri, Rex Victor Cruz, Edmundo de Alba Alcaraz, William Easterling, Christopher Field, Andreas Fischlin, Blair Fitzharris.
Related Papers (5)