scispace - formally typeset
Search or ask a question
Author

Colin J. McClean

Bio: Colin J. McClean is an academic researcher from University of York. The author has contributed to research in topics: Land cover & Biodiversity. The author has an hindex of 27, co-authored 70 publications receiving 8874 citations. Previous affiliations of Colin J. McClean include Helmholtz Centre for Environmental Research - UFZ & Agricultural & Applied Economics Association.


Papers
More filters
Journal ArticleDOI
TL;DR: It was found that methods specifically designed for collinearity, such as latent variable methods and tree based models, did not outperform the traditional GLM and threshold-based pre-selection and the value of GLM in combination with penalised methods and thresholds when omitted variables are considered in the final interpretation.
Abstract: Collinearity refers to the non independence of predictor variables, usually in a regression-type analysis. It is a common feature of any descriptive ecological data set and can be a problem for parameter estimation because it inflates the variance of regression parameters and hence potentially leads to the wrong identification of relevant predictors in a statistical model. Collinearity is a severe problem when a model is trained on data from one region or time, and predicted to another with a different or unknown structure of collinearity. To demonstrate the reach of the problem of collinearity in ecology, we show how relationships among predictors differ between biomes, change over spatial scales and through time. Across disciplines, different approaches to addressing collinearity problems have been developed, ranging from clustering of predictors, threshold-based pre-selection, through latent variable methods, to shrinkage and regularisation. Using simulated data with five predictor-response relationships of increasing complexity and eight levels of collinearity we compared ways to address collinearity with standard multiple regression and machine-learning approaches. We assessed the performance of each approach by testing its impact on prediction to new data. In the extreme, we tested whether the methods were able to identify the true underlying relationship in a training dataset with strong collinearity by evaluating its performance on a test dataset without any collinearity. We found that methods specifically designed for collinearity, such as latent variable methods and tree based models, did not outperform the traditional GLM and threshold-based pre-selection. Our results highlight the value of GLM in combination with penalised methods (particularly ridge) and threshold-based pre-selection when omitted variables are considered in the final interpretation. However, all approaches tested yielded degraded predictions under change in collinearity structure and the ‘folk lore’-thresholds of correlation coefficients between predictor variables of |r| >0.7 was an appropriate indicator for when collinearity begins to severely distort model estimation and subsequent prediction. The use of ecological understanding of the system in pre-analysis variable selection and the choice of the least sensitive statistical approaches reduce the problems of collinearity, but cannot ultimately solve them.

6,199 citations

Journal ArticleDOI
15 Feb 2002-Science
TL;DR: Coral reefs are the most biologically diverse of shallow water marine ecosystems but are being degraded worldwide by human activities and climate warming, and conservation efforts targeted toward them could help avert the loss of tropical reef biodiversity.
Abstract: Coral reefs are the most biologically diverse of shallow water marine ecosystems but are being degraded worldwide by human activities and climate warming. Analyses of the geographic ranges of 3235 species of reef fish, corals, snails, and lobsters revealed that between 7.2% and 53.6% of each taxon have highly restricted ranges, rendering them vulnerable to extinction. Restricted-range species are clustered into centers of endemism, like those described for terrestrial taxa. The 10 richest centers of endemism cover 15.8% of the world's coral reefs (0.012% of the oceans) but include between 44.8 and 54.2% of the restricted-range species. Many occur in regions where reefs are being severely affected by people, potentially leading to numerous extinctions. Threatened centers of endemism are major biodiversity hotspots, and conservation efforts targeted toward them could help avert the loss of tropical reef biodiversity.

1,491 citations

Journal ArticleDOI
TL;DR: In this paper, the authors investigated the impact that the choice of model can have on predictions, identify key reasons why model output may differ and discuss the implications that model uncertainty has for policy-guiding applications.
Abstract: Aim Many attempts to predict the potential range of species rely on environmental niche (or 'bioclimate envelope') modelling, yet the effects of using different niche-based methodologies require further investigation. Here we investigate the impact that the choice of model can have on predictions, identify key reasons why model output may differ and discuss the implications that model uncertainty has for policy-guiding applications. Location The Western Cape of South Africa. Methods We applied nine of the most widely used modelling techniques to model potential distributions under current and predicted future climate for four species (including two subspecies) of Proteaceae. Each model was built using an identical set of five input variables and distribution data for 3996 sampled sites. We compare model predictions by testing agreement between observed and simulated distributions for the present day (using the area under the receiver operating characteristic curve (AUC) and kappa statistics) and by assessing consistency in predictions of range size changes under future climate (using cluster analysis). Results Our analyses show significant differences between predictions from different models, with predicted changes in range size by 2030 differing in both magnitude and direction (e.g. from 92% loss to 322% gain). We explain differences with reference to two characteristics of the modelling techniques: data input requirements (presence/absence vs. presence-only approaches) and assumptions made by each algorithm when extrapolating beyond the range of data used to build the model. The effects of these factors should be carefully considered when using this modelling approach to predict species ranges. Main Conclusions We highlight an important source of uncertainty in assessments of the impacts of climate change on biodiversity and emphasize that model predictions should be interpreted in policy-guiding applications along with a full appreciation of uncertainty.

895 citations

Journal ArticleDOI
TL;DR: An estimate of the costs of a global MPA network is provided, based on a survey of the running costs of 83 MPAs worldwide, that a global network meeting the World Parks Congress target of conserving 20-30% of the world's seas might cost between 5 and 19 billion US dollars annually to run and would probably create around one million jobs.
Abstract: Declines in marine harvests, wildlife, and habitats have prompted calls at both the 2002 World Summit on Sustainable Development and the 2003 World Parks Congress for the establishment of a global system of marine protected areas (MPAs). MPAs that restrict fishing and other human activities conserve habitats and populations and, by exporting biomass, may sustain or increase yields of nearby fisheries. Here we provide an estimate of the costs of a global MPA network, based on a survey of the running costs of 83 MPAs worldwide. Annual running costs per unit area spanned six orders of magnitude, and were higher in MPAs that were smaller, closer to coasts, and in high-cost, developed countries. Models extrapolating these findings suggest that a global MPA network meeting the World Parks Congress target of conserving 20–30% of the world's seas might cost between $5 billion and $19 billion annually to run and would probably create around one million jobs. Although substantial, gross network costs are less than current government expenditures on harmful subsidies to industrial fisheries. They also ignore potential private gains from improved fisheries and tourism and are dwarfed by likely social gains from increasing the sustainability of fisheries and securing vital ecosystem services.

321 citations

Journal ArticleDOI
10 Jan 2011-PLOS ONE
TL;DR: This forecast is the first approximation of the potential impacts of climate change on fasciolosis risk in the UK and can be used as a basis for indicating where active disease surveillance should be targeted and where the development of improved mitigation or adaptation measures is likely to bring the greatest benefits.
Abstract: Fasciola hepatica (liver fluke) is a physically and economically devastating parasitic trematode whose rise in recent years has been attributed to climate change. Climate has an impact on the free-living stages of the parasite and its intermediate host Lymnaea truncatula, with the interactions between rainfall and temperature having the greatest influence on transmission efficacy. There have been a number of short term climate driven forecasts developed to predict the following season's infection risk, with the Ollerenshaw index being the most widely used. Through the synthesis of a modified Ollerenshaw index with the UKCP09 fine scale climate projection data we have developed long term seasonal risk forecasts up to 2070 at a 25 km square resolution. Additionally UKCIP gridded datasets at 5 km square resolution from 1970-2006 were used to highlight the climate-driven increase to date. The maps show unprecedented levels of future fasciolosis risk in parts of the UK, with risk of serious epidemics in Wales by 2050. The seasonal risk maps demonstrate the possible change in the timing of disease outbreaks due to increased risk from overwintering larvae. Despite an overall long term increase in all regions of the UK, spatio-temporal variation in risk levels is expected. Infection risk will reduce in some areas and fluctuate greatly in others with a predicted decrease in summer infection for parts of the UK due to restricted water availability. This forecast is the first approximation of the potential impacts of climate change on fasciolosis risk in the UK. It can be used as a basis for indicating where active disease surveillance should be targeted and where the development of improved mitigation or adaptation measures is likely to bring the greatest benefits.

210 citations


Cited by
More filters
Journal ArticleDOI
TL;DR: Preface to the Princeton Landmarks in Biology Edition vii Preface xi Symbols used xiii 1.
Abstract: Preface to the Princeton Landmarks in Biology Edition vii Preface xi Symbols Used xiii 1. The Importance of Islands 3 2. Area and Number of Speicies 8 3. Further Explanations of the Area-Diversity Pattern 19 4. The Strategy of Colonization 68 5. Invasibility and the Variable Niche 94 6. Stepping Stones and Biotic Exchange 123 7. Evolutionary Changes Following Colonization 145 8. Prospect 181 Glossary 185 References 193 Index 201

14,171 citations

Journal ArticleDOI
25 Apr 2013-Nature
TL;DR: These new risk maps and infection estimates provide novel insights into the global, regional and national public health burden imposed by dengue and will help to guide improvements in disease control strategies using vaccine, drug and vector control methods, and in their economic evaluation.
Abstract: Dengue is a systemic viral infection transmitted between humans by Aedes mosquitoes. For some patients, dengue is a life-threatening illness. There are currently no licensed vaccines or specific therapeutics, and substantial vector control efforts have not stopped its rapid emergence and global spread. The contemporary worldwide distribution of the risk of dengue virus infection and its public health burden are poorly known. Here we undertake an exhaustive assembly of known records of dengue occurrence worldwide, and use a formal modelling framework to map the global distribution of dengue risk. We then pair the resulting risk map with detailed longitudinal information from dengue cohort studies and population surfaces to infer the public health burden of dengue in 2010. We predict dengue to be ubiquitous throughout the tropics, with local spatial variations in risk influenced strongly by rainfall, temperature and the degree of urbanization. Using cartographic approaches, we estimate there to be 390 million (95% credible interval 284-528) dengue infections per year, of which 96 million (67-136) manifest apparently (any level of disease severity). This infection total is more than three times the dengue burden estimate of the World Health Organization. Stratification of our estimates by country allows comparison with national dengue reporting, after taking into account the probability of an apparent infection being formally reported. The most notable differences are discussed. These new risk maps and infection estimates provide novel insights into the global, regional and national public health burden imposed by dengue. We anticipate that they will provide a starting point for a wider discussion about the global impact of this disease and will help to guide improvements in disease control strategies using vaccine, drug and vector control methods, and in their economic evaluation.

7,238 citations

Journal ArticleDOI
15 Feb 2008-Science
TL;DR: This article developed an ecosystem-specific, multiscale spatial model to synthesize 17 global data sets of anthropogenic drivers of ecological change for 20 marine ecosystems and found that no area is unaffected by human influence and that a large fraction (41%) is strongly affected by multiple drivers.
Abstract: The management and conservation of the world's oceans require synthesis of spatial data on the distribution and intensity of human activities and the overlap of their impacts on marine ecosystems. We developed an ecosystem-specific, multiscale spatial model to synthesize 17 global data sets of anthropogenic drivers of ecological change for 20 marine ecosystems. Our analysis indicates that no area is unaffected by human influence and that a large fraction (41%) is strongly affected by multiple drivers. However, large areas of relatively little human impact remain, particularly near the poles. The analytical process and resulting maps provide flexible tools for regional and global efforts to allocate conservation resources; to implement ecosystem-based management; and to inform marine spatial planning, education, and basic research.

5,365 citations

Journal ArticleDOI
TL;DR: This paper presents a tuning method that uses presence-only data for parameter tuning, and introduces several concepts that improve the predictive accuracy and running time of Maxent and describes a new logistic output format that gives an estimate of probability of presence.
Abstract: Accurate modeling of geographic distributions of species is crucial to various applications in ecology and conservation. The best performing techniques often require some parameter tuning, which may be prohibitively time-consuming to do separately for each species, or unreliable for small or biased datasets. Additionally, even with the abundance of good quality data, users interested in the application of species models need not have the statistical knowledge required for detailed tuning. In such cases, it is desirable to use "default settings", tuned and validated on diverse datasets. Maxent is a recently introduced modeling technique, achieving high predictive accuracy and enjoying several additional attractive properties. The performance of Maxent is influenced by a moderate number of parameters. The first contribution of this paper is the empirical tuning of these parameters. Since many datasets lack information about species absence, we present a tuning method that uses presence-only data. We evaluate our method on independently collected high-quality presence-absence data. In addition to tuning, we introduce several concepts that improve the predictive accuracy and running time of Maxent. We introduce "hinge features" that model more complex relationships in the training data; we describe a new logistic output format that gives an estimate of probability of presence; finally we explore "background sampling" strategies that cope with sample selection bias and decrease model-building time. Our evaluation, based on a diverse dataset of 226 species from 6 regions, shows: 1) default settings tuned on presence-only data achieve performance which is almost as good as if they had been tuned on the evaluation data itself; 2) hinge features substantially improve model performance; 3) logistic output improves model calibration, so that large differences in output values correspond better to large differences in suitability; 4) "target-group" background sampling can give much better predictive performance than random background sampling; 5) random background sampling results in a dramatic decrease in running time, with no decrease in model performance.

5,314 citations

Journal ArticleDOI
TL;DR: Species distribution models (SDMs) as mentioned in this paper are numerical tools that combine observations of species occurrence or abundance with environmental estimates, and are used to gain ecological and evolutionary insights and to predict distributions across landscapes, sometimes requiring extrapolation in space and time.
Abstract: Species distribution models (SDMs) are numerical tools that combine observations of species occurrence or abundance with environmental estimates. They are used to gain ecological and evolutionary insights and to predict distributions across landscapes, sometimes requiring extrapolation in space and time. SDMs are now widely used across terrestrial, freshwater, and marine realms. Differences in methods between disciplines reflect both differences in species mobility and in “established use.” Model realism and robustness is influenced by selection of relevant predictors and modeling method, consideration of scale, how the interplay between environmental and geographic factors is handled, and the extent of extrapolation. Current linkages between SDM practice and ecological theory are often weak, hindering progress. Remaining challenges include: improvement of methods for modeling presence-only data and for model selection and evaluation; accounting for biotic interactions; and assessing model uncertainty.

5,076 citations