scispace - formally typeset
Journal ArticleDOI

A new statistical approach for assessing similarity of species composition with incidence and abundance data

TLDR
This work provides a probabilistic derivation for the classic, incidence-based forms of Jaccard and Sorensen indices of compositional similarity and proposes estimators for these indices that include the effect of unseen shared species, based on either (replicated) incidence- or abundancebased sample data.
Abstract
The classic Jaccard and Sorensen indices of compositional similarity (and other indices that depend upon the same variables) are notoriously sensitive to sample size, especially for assemblages with numerous rare species. Further, because these indices are based solely on presence–absence data, accurate estimators for them are unattainable. We provide a probabilistic derivation for the classic, incidence-based forms of these indices and extend this approach to formulate new Jaccard-type or Sorensen-type indices based on species abundance data. We then propose estimators for these indices that include the effect of unseen shared species, based on either (replicated) incidence- or abundancebased sample data. In sampling simulations, these new estimators prove to be considerably less biased than classic indices when a substantial proportion of species are missing from samples. Based on species-rich empirical datasets, we show how incorporating the effect of unseen shared species not only increases accuracy but also can change the interpretation of results.

read more

Citations
More filters
Journal ArticleDOI

The biodiversity of species and their rates of extinction, distribution, and protection

TL;DR: The biodiversity of eukaryote species and their extinction rates, distributions, and protection is reviewed, and what the future rates of species extinction will be, how well protected areas will slow extinction Rates, and how the remaining gaps in knowledge might be filled are reviewed.
Journal ArticleDOI

Waste not, want not: why rarefying microbiome data is inadmissible.

TL;DR: It is advocated that investigators avoid rarefying altogether and supported statistical theory is provided that simultaneously accounts for library size differences and biological variability using an appropriate mixture model.
Journal ArticleDOI

UniFrac: an effective distance metric for microbial community comparison

TL;DR: It is confirmed with actual sequence data that UniFrac values can be influenced by the number of sequences/sample, and sequence jackknifing is recommended to avoid this issue.
References
More filters
Journal ArticleDOI

An Ordination of the Upland Forest Communities of Southern Wisconsin

TL;DR: It is shown that nature of unit variation is a naajor problenl in systematies, and that whether this variation is diserete, continuous, or in some other form, there is a need for appliGation of (uantitative and statistical methods.
Book

Measuring Biological Diversity

TL;DR: In this paper, the authors focus on the pressure humanity is placing on the natural world, and on the continued ability of ecosystems to deliver the services on which we all depend, and develop strategies to ameliorate its impact.
Journal ArticleDOI

Estimating Terrestrial Biodiversity through Extrapolation

TL;DR: The importance of using 'reference' sites to assess the true richness and composition of species assemblages, to measure ecologically significant ratios between unrelated taxa, toMeasure taxon/sub-taxon (hierarchical) ratios, and to 'calibrate' standardized sampling methods is discussed.
Journal ArticleDOI

Measuring Biological Diversity

TL;DR: In this article, a measure called the effective number of species is developed from a nonparametric probability inequality and is shown to have a simple interpretation in terms of comparing linear experiments.
Journal ArticleDOI

Interpolating, extrapolating, and comparing incidence-based species accumulation curves

TL;DR: In this paper, a binomial mixture model is proposed for the species accumulation function based on presence-absence (incidence) of species in a sample of quadrats or other sampling units, which covers interpolation between zero and the observed number of samples, as well as extrapolation beyond the observed sample set.