Home
/
Authors
/
Ricardo Scachetti-Pereira

Author

Ricardo Scachetti-Pereira

Bio: Ricardo Scachetti-Pereira is an academic researcher from University of Kansas. The author has contributed to research in topics: Environmental niche modelling & Ecological niche. The author has an hindex of 5, co-authored 5 publications receiving 6891 citations.

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Novel methods improve prediction of species' distributions from occurrence data

[...]

Jane Elith¹, Catherine H. Graham², Robert P. Anderson³, Miroslav Dudík⁴, Simon Ferrier, Antoine Guisan⁵, Robert J. Hijmans⁶, Falk Huettmann⁷, John R. Leathwick⁸, Anthony Lehmann, Jin Li⁹, Lúcia G. Lohmann¹⁰, Bette A. Loiselle¹¹, Glenn Manion, Craig Moritz⁶, Miguel Nakamura¹², Yoshinori Nakazawa¹³, Jacob C. M. Mc Overton¹⁴, A. Townsend Peterson¹³, Steven J. Phillips¹⁵, Karen Richardson¹⁶, Ricardo Scachetti-Pereira, Robert E. Schapire, Jorge Soberón¹³, Stephen E. Williams¹⁷, Mary S. Wisz, Niklaus E. Zimmermann¹⁸ - Show less +23 more•Institutions (18)

University of Melbourne¹, Stony Brook University², City University of New York³, Princeton University⁴, University of Lausanne⁵, University of California, Berkeley⁶, University of Alaska Fairbanks⁷, National Institute of Water and Atmospheric Research⁸, Commonwealth Scientific and Industrial Research Organisation⁹, University of São Paulo¹⁰, University of Missouri¹¹, Consejo Nacional de Ciencia y Tecnología¹², University of Kansas¹³, Landcare Research¹⁴, AT&T¹⁵, McGill University¹⁶, James Cook University¹⁷, Swiss Federal Institute for Forest, Snow and Landscape Research¹⁸

01 Apr 2006-Ecography

TL;DR: This work compared 16 modelling methods over 226 species from 6 regions of the world, creating the most comprehensive set of model comparisons to date and found that presence-only data were effective for modelling species' distributions for many species and regions.

...read moreread less

Abstract: Prediction of species' distributions is central to diverse applications in ecology, evolution and conservation science. There is increasing electronic access to vast sets of occurrence records in museums and herbaria, yet little effective guidance on how best to use this information in the context of numerous approaches for modelling distributions. To meet this need, we compared 16 modelling methods over 226 species from 6 regions of the world, creating the most comprehensive set of model comparisons to date. We used presence-only data to fit models, and independent presence-absence data to evaluate the predictions. Along with well-established modelling methods such as generalised additive models and GARP and BIOCLIM, we explored methods that either have been developed recently or have rarely been applied to modelling species' distributions. These include machine-learning methods and community models, both of which have features that may make them particularly well suited to noisy or sparse information, as is typical of species' occurrence data. Presence-only data were effective for modelling species' distributions for many species and regions. The novel methods consistently outperformed more established methods. The results of our analysis are promising for the use of data from museums and herbaria, especially as methods suited to the noise inherent in such data improve.

...read moreread less

7,589 citations

Journal Article•DOI•

Predicting invasions of North American basses in Japan using native range data and a genetic algorithm

[...]

Kei'ichiro Iguchi, Keiichi Matsuura, Kristina M. McNyset, A. Townsend Peterson, Ricardo Scachetti-Pereira, Katherine A. Powers, David Vieglais, Edward O. Wiley, Taiga Yodo - Show less +5 more

01 Jul 2004-Transactions of The American Fisheries Society

TL;DR: The technique of ecological niche modeling using the genetic algorithm for rule-set prediction (GARP) to predict the potential distributions of these two species in Japan, finding that the predictions were statistically significant for both species.

...read moreread less

Abstract: Largemouth bass Micropterus salmoides and smallmouth bass M. dolomieu have been introduced into freshwater habitats in Japan, with potentially serious consequences for native fish populations. In this paper we apply the technique of ecological niche modeling using the genetic algorithm for rule-set prediction (GARP) to predict the potential distributions of these two species in Japan. This algorithm constructs a niche model based on point occurrence records and ecological coverages. The model can be visualized in geographic space, yielding a prediction of potential geographic range. The model can then be tested by determining how well independent point occurrence data are predicted according to the criteria of sensitivity and specificity provided by receiver–operator curve analysis. We ground-truthed GARP's ability to forecast the geographic occurrence of each species in its native range. The predictions were statistically significant for both species (P < 0.001). We projected the niche models on...

...read moreread less

108 citations

Journal Article•DOI•

Distribution of Capybaras in an Agroecosystem, Southeastern Brazil, Based on Ecological Niche Modeling

[...]

Katia Maria Paschoaletto Micchi de Barros Ferraz¹, Townsend Peterson², Ricardo Scachetti-Pereira², Carlos Alberto Vettorazzi¹, Luciano Martins Verdade¹ - Show less +1 more•Institutions (2)

University of São Paulo¹, University of Kansas²

12 Feb 2009-Journal of Mammalogy

TL;DR: In this paper, the authors developed predictive models of ecological and spatial distributions of capybaras (Hydrochoerus hydrochaeris) using ecological niche modeling, and found that most occurrences of the animals were in flat areas with water bodies surrounded by sugarcane and pasture.

...read moreread less

Abstract: Southeastern Brazil has seen dramatic landscape modifications in recent decades, due to expansion of agriculture and urban areas; these changes have influenced the distribution and abundance of vertebrates. We developed predictive models of ecological and spatial distributions of capybaras (Hydrochoerus hydrochaeris) using ecological niche modeling. Most occurrences of capybaras were in flat areas with water bodies surrounded by sugarcane and pasture. More than 75% of the Piracicaba River basin was estimated as potentially habitable by capybara. The models had low omission error (2.3‐3.4%), but higher commission error (91.0‐98.5%); these ‘‘model failures’’ seem to be more related to local habitat characteristics than to spatial ones. The potential distribution of capybaras in the basin is associated with anthropogenic habitats, particularly with intensive land use for agriculture.

...read moreread less

31 citations

Journal Article•DOI•

Assessment of invasive potential of Homalodisca coagulata in western North America and south America

[...]

A. Townsend Peterson¹, Ricardo Scachetti-Pereira, Daniel A. Kluza¹•Institutions (1)

University of Kansas¹

01 Jan 2003-Biota Neotropica

TL;DR: The potential of Homalodisca coagulata to invade South America is a question of economic importance, given its potential impact as a disease vector for several crops as mentioned in this paper.

...read moreread less

Abstract: The potential of Homalodisca coagulata to invade South America is a question of economic importance, given its potential impact as a disease vector for several crops. We developed ecological niche models for the species on its native geographic distribution in the southeastern United States; we tested the predictivity of the models both on the native distributional area and via projections to California, where the species has long been present as an invasive species. In both cases, tests indicated high statistical significance of predictions. Projection of models to South America indicated little possibility of invasion of southeastern Brazil, where citrus diseases were of concern. However, all models agree in predict-ing great risk of establishment in the wine-growing regions of northern Argentina and extreme southern Brazil; great precaution is thus to be recommended when any movements of bio-materials are made from infected areas to this region.

...read moreread less

23 citations

Journal Article•DOI•

Predicting invasive potential of smooth crotalaria (Crotalaria pallida) in Brazilian national parks based on African records

[...]

Rafael Luís Fonseca¹, Paulo R. Guimarães², Sérgio Rodrigues Morbiolo², Ricardo Scachetti-Pereira³, A. Townsend Peterson³ - Show less +1 more•Institutions (3)

Conservation International¹, State University of Campinas², University of Kansas³

01 May 2006

TL;DR: This work investigated the potential geographic range of the invasive paleotropical weed, smooth crotalaria, in protected natural areas across Brazil and found it appears more likely to occur in open and highly fragmented areas than in extensive closed forests.

...read moreread less

Abstract: Alien weed species rank among the most important threats to conservation of biodiversity, making understanding the extent to which protected natural areas are vulnerable to invasion by weeds pivotal in long-term maintenance and conservation of biodiversity. We investigated the potential geographic range of the invasive paleotropical weed, smooth crotalaria, in protected natural areas across Brazil. The ecological niche dimensions of smooth crotalaria in Africa (its putative original distribution) were modeled using a genetic algorithm. Models for the native range and their projections to South America showed good predictive ability when challenged with independent occurrence data. All Brazilian protected natural areas were predicted as highly vulnerable to invasion by this species. However, smooth crotalaria appears more likely to occur in open (savanna-like vegetation, such as cerrado and pantanal) and highly fragmented (Atlantic forest) areas than in extensive closed forests (Amazon). Managemen...

...read moreread less

16 citations

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

The global distribution and burden of dengue

[...]

Samir Bhatt¹, Peter W. Gething¹, Oliver J. Brady¹, Jane P. Messina¹, Andrew Farlow¹, Catherine L. Moyes¹, John M. Drake², John M. Drake¹, John S. Brownstein³, Anne G. Hoen⁴, Osman Sankoh⁵, Osman Sankoh⁶, Monica F. Myers¹, Dylan B. George⁷, Thomas Jaenisch⁶, G. R. William Wint¹, Cameron P. Simmons¹, Thomas W. Scott⁸, Thomas W. Scott⁷, Jeremy Farrar¹, Jeremy Farrar⁹, Simon I. Hay⁷, Simon I. Hay¹ - Show less +19 more•Institutions (9)

University of Oxford¹, University of Georgia², Boston Children's Hospital³, Dartmouth College⁴, University of the Witwatersrand⁵, Heidelberg University⁶, National Institutes of Health⁷, University of California, Davis⁸, National University of Singapore⁹

25 Apr 2013-Nature

TL;DR: These new risk maps and infection estimates provide novel insights into the global, regional and national public health burden imposed by dengue and will help to guide improvements in disease control strategies using vaccine, drug and vector control methods, and in their economic evaluation.

...read moreread less

Abstract: Dengue is a systemic viral infection transmitted between humans by Aedes mosquitoes. For some patients, dengue is a life-threatening illness. There are currently no licensed vaccines or specific therapeutics, and substantial vector control efforts have not stopped its rapid emergence and global spread. The contemporary worldwide distribution of the risk of dengue virus infection and its public health burden are poorly known. Here we undertake an exhaustive assembly of known records of dengue occurrence worldwide, and use a formal modelling framework to map the global distribution of dengue risk. We then pair the resulting risk map with detailed longitudinal information from dengue cohort studies and population surfaces to infer the public health burden of dengue in 2010. We predict dengue to be ubiquitous throughout the tropics, with local spatial variations in risk influenced strongly by rainfall, temperature and the degree of urbanization. Using cartographic approaches, we estimate there to be 390 million (95% credible interval 284-528) dengue infections per year, of which 96 million (67-136) manifest apparently (any level of disease severity). This infection total is more than three times the dengue burden estimate of the World Health Organization. Stratification of our estimates by country allows comparison with national dengue reporting, after taking into account the probability of an apparent infection being formally reported. The most notable differences are discussed. These new risk maps and infection estimates provide novel insights into the global, regional and national public health burden imposed by dengue. We anticipate that they will provide a starting point for a wider discussion about the global impact of this disease and will help to guide improvements in disease control strategies using vaccine, drug and vector control methods, and in their economic evaluation.

...read moreread less

7,238 citations

Journal Article•DOI•

Collinearity: a review of methods to deal with it and a simulation study evaluating their performance

[...]

Carsten F. Dormann¹, Jane Elith¹, Sven Bacher¹, Carsten M. Buchmann¹, Gudrun Carl¹, Gabriel Carré¹, Jaime Ricardo García Márquez¹, Bernd Gruber¹, Bruno Lafourcade¹, Pedro J. Leitão¹, Tamara Münkemüller¹, Colin J. McClean¹, Patrick E. Osborne¹, Björn Reineking¹, Boris Schröder¹, Andrew K. Skidmore¹, Damaris Zurell¹, Sven Lautenbach¹ - Show less +14 more•Institutions (1)

Helmholtz Centre for Environmental Research - UFZ¹

01 Jan 2013-Ecography

TL;DR: It was found that methods specifically designed for collinearity, such as latent variable methods and tree based models, did not outperform the traditional GLM and threshold-based pre-selection and the value of GLM in combination with penalised methods and thresholds when omitted variables are considered in the final interpretation.

...read moreread less

Abstract: Collinearity refers to the non independence of predictor variables, usually in a regression-type analysis. It is a common feature of any descriptive ecological data set and can be a problem for parameter estimation because it inflates the variance of regression parameters and hence potentially leads to the wrong identification of relevant predictors in a statistical model. Collinearity is a severe problem when a model is trained on data from one region or time, and predicted to another with a different or unknown structure of collinearity. To demonstrate the reach of the problem of collinearity in ecology, we show how relationships among predictors differ between biomes, change over spatial scales and through time. Across disciplines, different approaches to addressing collinearity problems have been developed, ranging from clustering of predictors, threshold-based pre-selection, through latent variable methods, to shrinkage and regularisation. Using simulated data with five predictor-response relationships of increasing complexity and eight levels of collinearity we compared ways to address collinearity with standard multiple regression and machine-learning approaches. We assessed the performance of each approach by testing its impact on prediction to new data. In the extreme, we tested whether the methods were able to identify the true underlying relationship in a training dataset with strong collinearity by evaluating its performance on a test dataset without any collinearity. We found that methods specifically designed for collinearity, such as latent variable methods and tree based models, did not outperform the traditional GLM and threshold-based pre-selection. Our results highlight the value of GLM in combination with penalised methods (particularly ridge) and threshold-based pre-selection when omitted variables are considered in the final interpretation. However, all approaches tested yielded degraded predictions under change in collinearity structure and the ‘folk lore’-thresholds of correlation coefficients between predictor variables of |r| >0.7 was an appropriate indicator for when collinearity begins to severely distort model estimation and subsequent prediction. The use of ecological understanding of the system in pre-analysis variable selection and the choice of the least sensitive statistical approaches reduce the problems of collinearity, but cannot ultimately solve them.

...read moreread less

6,199 citations

Journal Article•DOI•

Modeling of species distributions with Maxent: new extensions and a comprehensive evaluation

[...]

Steven J. Phillips¹, Miroslav Dudík•Institutions (1)

AT&T¹

01 Apr 2008-Ecography

TL;DR: This paper presents a tuning method that uses presence-only data for parameter tuning, and introduces several concepts that improve the predictive accuracy and running time of Maxent and describes a new logistic output format that gives an estimate of probability of presence.

...read moreread less

Abstract: Accurate modeling of geographic distributions of species is crucial to various applications in ecology and conservation. The best performing techniques often require some parameter tuning, which may be prohibitively time-consuming to do separately for each species, or unreliable for small or biased datasets. Additionally, even with the abundance of good quality data, users interested in the application of species models need not have the statistical knowledge required for detailed tuning. In such cases, it is desirable to use "default settings", tuned and validated on diverse datasets. Maxent is a recently introduced modeling technique, achieving high predictive accuracy and enjoying several additional attractive properties. The performance of Maxent is influenced by a moderate number of parameters. The first contribution of this paper is the empirical tuning of these parameters. Since many datasets lack information about species absence, we present a tuning method that uses presence-only data. We evaluate our method on independently collected high-quality presence-absence data. In addition to tuning, we introduce several concepts that improve the predictive accuracy and running time of Maxent. We introduce "hinge features" that model more complex relationships in the training data; we describe a new logistic output format that gives an estimate of probability of presence; finally we explore "background sampling" strategies that cope with sample selection bias and decrease model-building time. Our evaluation, based on a diverse dataset of 226 species from 6 regions, shows: 1) default settings tuned on presence-only data achieve performance which is almost as good as if they had been tuned on the evaluation data itself; 2) hinge features substantially improve model performance; 3) logistic output improves model calibration, so that large differences in output values correspond better to large differences in suitability; 4) "target-group" background sampling can give much better predictive performance than random background sampling; 5) random background sampling results in a dramatic decrease in running time, with no decrease in model performance.

...read moreread less

5,314 citations

Journal Article•DOI•

Species Distribution Models: Ecological Explanation and Prediction Across Space and Time

[...]

Jane Elith¹, John R. Leathwick²•Institutions (2)

University of Melbourne¹, National Institute of Water and Atmospheric Research²

06 Feb 2009-Annual Review of Ecology, Evolution, and Systematics

TL;DR: Species distribution models (SDMs) as mentioned in this paper are numerical tools that combine observations of species occurrence or abundance with environmental estimates, and are used to gain ecological and evolutionary insights and to predict distributions across landscapes, sometimes requiring extrapolation in space and time.

...read moreread less

Abstract: Species distribution models (SDMs) are numerical tools that combine observations of species occurrence or abundance with environmental estimates. They are used to gain ecological and evolutionary insights and to predict distributions across landscapes, sometimes requiring extrapolation in space and time. SDMs are now widely used across terrestrial, freshwater, and marine realms. Differences in methods between disciplines reflect both differences in species mobility and in “established use.” Model realism and robustness is influenced by selection of relevant predictors and modeling method, consideration of scale, how the interplay between environmental and geographic factors is handled, and the extent of extrapolation. Current linkages between SDM practice and ecological theory are often weak, hindering progress. Remaining challenges include: improvement of methods for modeling presence-only data and for model selection and evaluation; accounting for biotic interactions; and assessing model uncertainty.

...read moreread less

5,076 citations

Journal Article•DOI•

A working guide to boosted regression trees

[...]

Jane Elith¹, John R. Leathwick², Trevor Hastie³•Institutions (3)

University of Melbourne¹, National Institute of Water and Atmospheric Research², Stanford University³

01 Jul 2008-Journal of Animal Ecology

TL;DR: This study provides a working guide to boosted regression trees (BRT), an ensemble method for fitting statistical models that differs fundamentally from conventional techniques that aim to fit a single parsimonious model.

...read moreread less

Abstract: Summary 1 Ecologists use statistical models for both explanation and prediction, and need techniques that are flexible enough to express typical features of their data, such as nonlinearities and interactions 2 This study provides a working guide to boosted regression trees (BRT), an ensemble method for fitting statistical models that differs fundamentally from conventional techniques that aim to fit a single parsimonious model Boosted regression trees combine the strengths of two algorithms: regression trees (models that relate a response to their predictors by recursive binary splits) and boosting (an adaptive method for combining many simple models to give improved predictive performance) The final BRT model can be understood as an additive regression model in which individual terms are simple trees, fitted in a forward, stagewise fashion 3 Boosted regression trees incorporate important advantages of tree-based methods, handling different types of predictor variables and accommodating missing data They have no need for prior data transformation or elimination of outliers, can fit complex nonlinear relationships, and automatically handle interaction effects between predictors Fitting multiple trees in BRT overcomes the biggest drawback of single tree models: their relatively poor predictive performance Although BRT models are complex, they can be summarized in ways that give powerful ecological insight, and their predictive performance is superior to most traditional modelling methods 4 The unique features of BRT raise a number of practical issues in model fitting We demonstrate the practicalities and advantages of using BRT through a distributional analysis of the short-finned eel ( Anguilla australis Richardson), a native freshwater fish of New Zealand We use a data set of over 13 000 sites to illustrate effects of several settings, and then fit and interpret a model using a subset of the data We provide code and a tutorial to enable the wider use of BRT by ecologists

...read moreread less

4,787 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse