scispace - formally typeset
Search or ask a question
Author

Elena N. Ieno

Bio: Elena N. Ieno is an academic researcher from University of Aberdeen. The author has contributed to research in topics: Linear regression & Poisson distribution. The author has an hindex of 19, co-authored 37 publications receiving 19589 citations.

Papers
More filters
Book
11 Apr 2011
TL;DR: In this paper, the authors apply additive mixed modelling on phyoplankton time series data and show that the additive model can be used to estimate the age distribution of small cetaceans.
Abstract: Limitations of linear regression applied on ecological data. - Things are not always linear additive modelling. - Dealing with hetergeneity. - Mixed modelling for nested data. - Violation of independence - temporal data. - Violation of independence spatial data. - Generalised linear modelling and generalised additive modelling. - Generalised estimation equations. - GLMM and GAMM. - Estimating trends for Antarctic birds in relation to climate change. - Large-scale impacts of land-use change in a Scottish farming catchment. - Negative binomial GAM and GAMM to analyse amphibian road killings. - Additive mixed modelling applied on deep-sea plagic bioluminescent organisms. - Additive mixed modelling applied on phyoplankton time series data. - Mixed modelling applied on American Fouldbrood affecting honey bees larvae. - Three-way nested data for age determination techniques applied to small cetaceans. - GLMM applied on the spatial distribution of koalas in a fragmented landscape. - GEE and GLMM applied on binomial Badger activity data.

12,477 citations

Journal ArticleDOI
TL;DR: A protocol for data exploration is provided; current tools to detect outliers, heterogeneity of variance, collinearity, dependence of observations, problems with interactions, double zeros in multivariate analysis, zero inflation in generalized linear modelling, and the correct type of relationships between dependent and independent variables are discussed; and advice on how to address these problems when they arise is provided.
Abstract: Summary 1. While teaching statistics to ecologists, the lead authors of this paper have noticed common statistical problems. If a random sample of their work (including scientific papers) produced before doing these courses were selected, half would probably contain violations of the underlying assumptions of the statistical techniques employed. 2. Some violations have little impact on the results or ecological conclusions; yet others increase type I or type II errors, potentially resulting in wrong ecological conclusions. Most of these violations can be avoided by applying better data exploration. These problems are especially troublesome in applied ecology, where management and policy decisions are often at stake. 3. Here, we provide a protocol for data exploration; discuss current tools to detect outliers, heterogeneity of variance, collinearity, dependence of observations, problems with interactions, double zeros in multivariate analysis, zero inflation in generalized linear modelling, and the correct type of relationships between dependent and independent variables; and provide advice on how to address these problems when they arise. We also address misconceptions about normality, and provide advice on data transformations. 4. Data exploration avoids type I and type II errors, among other problems, thereby reducing the chance of making wrong ecological conclusions and poor recommendations. It is therefore essential for good quality management and policy based on statistical analyses.

5,894 citations

Book
03 May 2007
TL;DR: In this paper, the effects of rice farming on aquatic birds with mixed modelling were investigated using additive and generalised additive modeling and univariate methods to analyse abundance of decapod larvae.
Abstract: Introduction.- Data management and software.- Advice for teachers.- Exploration.- Linear regression.- Generalised linear modelling.- Additive and generalised additive modelling.- Introduction to mixed modelling.- Univariate tree models.- Measures of association.- Ordination--first encounter.- Principal component analysis and redundancy analysis.- Correspondence analysis and canonical correspondence analysis.- Introduction to discriminant analysis.- Principal coordinate analysis and non-metric multidimensional scaling.- Time series analysis--Introduction.- Common trends and sudden changes.- Analysis and modelling lattice data.- Spatially continuous data analysis and modelling.- Univariate methods to analyse abundance of decapod larvae.- Analysing presence and absence data for flatfish distribution in the Tagus estuary, Portugual.- Crop pollination by honeybees in an Argentinean pampas system using additive mixed modelling.- Investigating the effects of rice farming on aquatic birds with mixed modelling.- Classification trees and radar detection of birds for North Sea wind farms.- Fish stock identification through neural network analysis of parasite fauna.- Monitoring for change: using generalised least squares, nonmetric multidimensional scaling, and the Mantel test on western Montana grasslands.- Univariate and multivariate analysis applied on a Dutch sandy beach community.- Multivariate analyses of South-American zoobenthic species--spoilt for choice.- Principal component analysis applied to harbour porpoise fatty acid data.- Multivariate analysis of morphometric turtle data--size and shape.- Redundancy analysis and additive modelling applied on savanna tree data.- Canonical correspondence analysis of lowland pasture vegetation in the humid tropics of Mexico.- Estimating common trends in Portuguese fisheries landings.- Common trends in demersal communities on the Newfoundland-Labrador Shelf.- Sea level change and salt marshes in the Wadden Sea: a time series analysis.- Time series analysis of Hawaiian waterbirds.- Spatial modelling of forest community features in the Volzhsko-Kamsky reserve.

1,788 citations

Journal ArticleDOI
TL;DR: A 10‐step protocol to streamline analysis of data that will enhance understanding of the data, the statistical models and the results, and optimize communication with the reader with respect to both the procedure and the outcomes is offered.
Abstract: Summary Scientific investigation is of value only insofar as relevant results are obtained and communicated, a task that requires organizing, evaluating, analysing and unambiguously communicating the significance of data. In this context, working with ecological data, reflecting the complexities and interactions of the natural world, can be a challenge. Recent innovations for statistical analysis of multifaceted interrelated data make obtaining more accurate and meaningful results possible, but key decisions of the analyses to use, and which components to present in a scientific paper or report, may be overwhelming. We offer a 10-step protocol to streamline analysis of data that will enhance understanding of the data, the statistical models and the results, and optimize communication with the reader with respect to both the procedure and the outcomes. The protocol takes the investigator from study design and organization of data (formulating relevant questions, visualizing data collection, data exploration, identifying dependency), through conducting analysis (presenting, fitting and validating the model) and presenting output (numerically and visually), to extending the model via simulation. Each step includes procedures to clarify aspects of the data that affect statistical analysis, as well as guidelines for written presentation. Steps are illustrated with examples using data from the literature. Following this protocol will reduce the organization, analysis and presentation of what may be an overwhelming information avalanche into sequential and, more to the point, manageable, steps. It provides guidelines for selecting optimal statistical tools to assess data relevance and significance, for choosing aspects of the analysis to include in a published report and for clearly communicating information.

501 citations

01 Jan 2009
TL;DR: Tienda online donde Comprar Mixed Effects Models and Extensions in Ecology with R al precio 66,45 € de Zuur, Alain F.
Abstract: Tienda online donde Comprar Mixed Effects Models and Extensions in Ecology with R al precio 66,45 € de Zuur, Alain F. | Ieno, Elena N. | Walker, Neil J. | Saveliev, Anatoly A. | Smith, Graham M., tienda de Libros de Medicina, Libros de Biologia - Ecologia

376 citations


Cited by
More filters
Journal ArticleDOI
TL;DR: It was found that methods specifically designed for collinearity, such as latent variable methods and tree based models, did not outperform the traditional GLM and threshold-based pre-selection and the value of GLM in combination with penalised methods and thresholds when omitted variables are considered in the final interpretation.
Abstract: Collinearity refers to the non independence of predictor variables, usually in a regression-type analysis. It is a common feature of any descriptive ecological data set and can be a problem for parameter estimation because it inflates the variance of regression parameters and hence potentially leads to the wrong identification of relevant predictors in a statistical model. Collinearity is a severe problem when a model is trained on data from one region or time, and predicted to another with a different or unknown structure of collinearity. To demonstrate the reach of the problem of collinearity in ecology, we show how relationships among predictors differ between biomes, change over spatial scales and through time. Across disciplines, different approaches to addressing collinearity problems have been developed, ranging from clustering of predictors, threshold-based pre-selection, through latent variable methods, to shrinkage and regularisation. Using simulated data with five predictor-response relationships of increasing complexity and eight levels of collinearity we compared ways to address collinearity with standard multiple regression and machine-learning approaches. We assessed the performance of each approach by testing its impact on prediction to new data. In the extreme, we tested whether the methods were able to identify the true underlying relationship in a training dataset with strong collinearity by evaluating its performance on a test dataset without any collinearity. We found that methods specifically designed for collinearity, such as latent variable methods and tree based models, did not outperform the traditional GLM and threshold-based pre-selection. Our results highlight the value of GLM in combination with penalised methods (particularly ridge) and threshold-based pre-selection when omitted variables are considered in the final interpretation. However, all approaches tested yielded degraded predictions under change in collinearity structure and the ‘folk lore’-thresholds of correlation coefficients between predictor variables of |r| >0.7 was an appropriate indicator for when collinearity begins to severely distort model estimation and subsequent prediction. The use of ecological understanding of the system in pre-analysis variable selection and the choice of the least sensitive statistical approaches reduce the problems of collinearity, but cannot ultimately solve them.

6,199 citations

Journal ArticleDOI
TL;DR: A protocol for data exploration is provided; current tools to detect outliers, heterogeneity of variance, collinearity, dependence of observations, problems with interactions, double zeros in multivariate analysis, zero inflation in generalized linear modelling, and the correct type of relationships between dependent and independent variables are discussed; and advice on how to address these problems when they arise is provided.
Abstract: Summary 1. While teaching statistics to ecologists, the lead authors of this paper have noticed common statistical problems. If a random sample of their work (including scientific papers) produced before doing these courses were selected, half would probably contain violations of the underlying assumptions of the statistical techniques employed. 2. Some violations have little impact on the results or ecological conclusions; yet others increase type I or type II errors, potentially resulting in wrong ecological conclusions. Most of these violations can be avoided by applying better data exploration. These problems are especially troublesome in applied ecology, where management and policy decisions are often at stake. 3. Here, we provide a protocol for data exploration; discuss current tools to detect outliers, heterogeneity of variance, collinearity, dependence of observations, problems with interactions, double zeros in multivariate analysis, zero inflation in generalized linear modelling, and the correct type of relationships between dependent and independent variables; and provide advice on how to address these problems when they arise. We also address misconceptions about normality, and provide advice on data transformations. 4. Data exploration avoids type I and type II errors, among other problems, thereby reducing the chance of making wrong ecological conclusions and poor recommendations. It is therefore essential for good quality management and policy based on statistical analyses.

5,894 citations

Journal ArticleDOI
TL;DR: The glmmTMB package fits many types of GLMMs and extensions, including models with continuously distributed responses, but here the authors focus on count responses and its ability to estimate the Conway-Maxwell-Poisson distribution parameterized by the mean is unique.
Abstract: Count data can be analyzed using generalized linear mixed models when observations are correlated in ways that require random effects However, count data are often zero-inflated, containing more zeros than would be expected from the typical error distributions We present a new package, glmmTMB, and compare it to other R packages that fit zero-inflated mixed models The glmmTMB package fits many types of GLMMs and extensions, including models with continuously distributed responses, but here we focus on count responses glmmTMB is faster than glmmADMB, MCMCglmm, and brms, and more flexible than INLA and mgcv for zero-inflated modeling One unique feature of glmmTMB (among packages that fit zero-inflated mixed models) is its ability to estimate the Conway-Maxwell-Poisson distribution parameterized by the mean Overall, its most appealing features for new users may be the combination of speed, flexibility, and its interface’s similarity to lme4

4,497 citations

Posted Content
TL;DR: In this paper, the authors provide a unified and comprehensive theory of structural time series models, including a detailed treatment of the Kalman filter for modeling economic and social time series, and address the special problems which the treatment of such series poses.
Abstract: In this book, Andrew Harvey sets out to provide a unified and comprehensive theory of structural time series models. Unlike the traditional ARIMA models, structural time series models consist explicitly of unobserved components, such as trends and seasonals, which have a direct interpretation. As a result the model selection methodology associated with structural models is much closer to econometric methodology. The link with econometrics is made even closer by the natural way in which the models can be extended to include explanatory variables and to cope with multivariate time series. From the technical point of view, state space models and the Kalman filter play a key role in the statistical treatment of structural time series models. The book includes a detailed treatment of the Kalman filter. This technique was originally developed in control engineering, but is becoming increasingly important in fields such as economics and operations research. This book is concerned primarily with modelling economic and social time series, and with addressing the special problems which the treatment of such series poses. The properties of the models and the methodological techniques used to select them are illustrated with various applications. These range from the modellling of trends and cycles in US macroeconomic time series to to an evaluation of the effects of seat belt legislation in the UK.

4,252 citations

Journal ArticleDOI
02 Apr 2015-Nature
TL;DR: A terrestrial assemblage database of unprecedented geographic and taxonomic coverage is analysed to quantify local biodiversity responses to land use and related changes and shows that in the worst-affected habitats, pressures reduce within-sample species richness by an average of 76.5%, total abundance by 39.5% and rarefaction-based richness by 40.3%.
Abstract: Human activities, especially conversion and degradation of habitats, are causing global biodiversity declines. How local ecological assemblages are responding is less clear--a concern given their importance for many ecosystem functions and services. We analysed a terrestrial assemblage database of unprecedented geographic and taxonomic coverage to quantify local biodiversity responses to land use and related changes. Here we show that in the worst-affected habitats, these pressures reduce within-sample species richness by an average of 76.5%, total abundance by 39.5% and rarefaction-based richness by 40.3%. We estimate that, globally, these pressures have already slightly reduced average within-sample richness (by 13.6%), total abundance (10.7%) and rarefaction-based richness (8.1%), with changes showing marked spatial variation. Rapid further losses are predicted under a business-as-usual land-use scenario; within-sample richness is projected to fall by a further 3.4% globally by 2100, with losses concentrated in biodiverse but economically poor countries. Strong mitigation can deliver much more positive biodiversity changes (up to a 1.9% average increase) that are less strongly related to countries' socioeconomic status.

2,532 citations