Author
Anne Chao
Other affiliations: National Taiwan University
Bio: Anne Chao is an academic researcher from National Tsing Hua University. The author has contributed to research in topics: Estimator & Population. The author has an hindex of 54, co-authored 178 publications receiving 24610 citations. Previous affiliations of Anne Chao include National Taiwan University.
Topics: Estimator, Population, Species richness, Species diversity, Biodiversity
Papers published on a yearly basis
Papers
More filters
•
TL;DR: On applique la methode d'Efron (1981, 1982) a la construction d'intervalles de confiance bases sur des distributions du bootstrap as discussed by the authors.
Abstract: On applique la methode d'Efron (1981, 1982) a la construction d'intervalles de confiance bases sur des distributions du bootstrap
3,858 citations
••
TL;DR: In this article, the authors extended previous rarefaction and extrapolation models for species richness (Hill number q D, where q ¼ 0) to measures of taxon diversity incorporating relative abundance (i.e., for any Hill number qD, q. 0) and presented a unified approach for both individual-based (abundance) data and sample-based data.
Abstract: Quantifying and assessing changes in biological diversity are central aspects of many ecological studies, yet accurate methods of estimating biological diversity from sampling data have been elusive. Hill numbers, or the effective number of species, are increasingly used to characterize the taxonomic, phylogenetic, or functional diversity of an assemblage. However, empirical estimates of Hill numbers, including species richness, tend to be an increasing function of sampling effort and, thus, tend to increase with sample completeness. Integrated curves based on sampling theory that smoothly link rarefaction (interpolation) and prediction (extrapolation) standardize samples on the basis of sample size or sample completeness and facilitate the comparison of biodiversity data. Here we extended previous rarefaction and extrapolation models for species richness (Hill number q D, where q ¼ 0) to measures of taxon diversity incorporating relative abundance (i.e., for any Hill number q D, q . 0) and present a unified approach for both individual-based (abundance) data and sample- based (incidence) data. Using this unified sampling framework, we derive both theoretical formulas and analytic estimators for seamless rarefaction and extrapolation based on Hill numbers. Detailed examples are provided for the first three Hill numbers: q ¼ 0 (species richness), q ¼ 1 (the exponential of Shannon's entropy index), and q ¼ 2 (the inverse of Simpson's concentration index). We developed a bootstrap method for constructing confidence intervals around Hill numbers, facilitating the comparison of multiple assemblages of both rarefied and extrapolated samples. The proposed estimators are accurate for both rarefaction and short-range extrapolation. For long-range extrapolation, the performance of the estimators depends on both the value of q and on the extrapolation range. We tested our methods on simulated data generated from species abundance models and on data from large species inventories. We also illustrate the formulas and estimators using empirical data sets from biodiversity surveys of temperate forest spiders and tropical ants.
2,182 citations
••
TL;DR: A point estimator and its associated confidence interval for the size of a closed population are proposed under models that incorporate heterogeneity of capture probability andumerical results show that the proposed confidence interval performs satisfactorily in maintaining the nominal levels.
Abstract: A point estimator and its associated confidence interval for the size of a closed population are proposed under models that incorporate heterogeneity of capture probability. Real data sets provided in Edwards and Eberhardt (1967, Journal of Wildlife Management 31, 87-96) and Carothers (1973, Journal of Animal Ecology 42, 125-146) are used to illustrate this method and to compare it with other estimates. The performance of the proposed procedure is also investigated by means of Monte Carlo experiments. The method is especially useful when most of the captured individuals are caught once or twice in the sample, for which case the jackknife estimator usually does not work well. Numerical results also show that the proposed confidence interval performs satisfactorily in maintaining the nominal levels.
2,173 citations
••
TL;DR: In this article, the authors present an R package iNEXT (iNterpolation/EXTrapolation) which provides simple functions to compute and plot the seamless rarefaction and extrapolation sampling curves for the three most widely used members of the Hill number family.
Abstract: Summary
Hill numbers (or the effective number of species) have been increasingly used to quantify the species/taxonomic diversity of an assemblage. The sample-size- and coverage-based integrations of rarefaction (interpolation) and extrapolation (prediction) of Hill numbers represent a unified standardization method for quantifying and comparing species diversity across multiple assemblages.
We briefly review the conceptual background of Hill numbers along with two approaches to standardization. We present an R package iNEXT (iNterpolation/EXTrapolation) which provides simple functions to compute and plot the seamless rarefaction and extrapolation sampling curves for the three most widely used members of the Hill number family (species richness, Shannon diversity and Simpson diversity). Two types of biodiversity data are allowed: individual-based abundance data and sampling-unit-based incidence data.
Several applications of the iNEXT packages are reviewed: (i) Non-asymptotic analysis: comparison of diversity estimates for equally large or equally complete samples. (ii) Asymptotic analysis: comparison of estimated asymptotic or true diversities. (iii) Assessment of sample completeness (sample coverage) across multiple samples. (iv) Comparison of estimated point diversities for a specified sample size or a specified level of sample coverage.
Two examples are demonstrated, using the data (one for abundance data and the other for incidence data) included in the package, to illustrate all R functions and graphical displays.
2,170 citations
••
TL;DR: This work provides a probabilistic derivation for the classic, incidence-based forms of Jaccard and Sorensen indices of compositional similarity and proposes estimators for these indices that include the effect of unseen shared species, based on either (replicated) incidence- or abundancebased sample data.
Abstract: The classic Jaccard and Sorensen indices of compositional similarity (and other indices that depend upon the same variables) are notoriously sensitive to sample size, especially for assemblages with numerous rare species. Further, because these indices are based solely on presence–absence data, accurate estimators for them are unattainable. We provide a probabilistic derivation for the classic, incidence-based forms of these indices and extend this approach to formulate new Jaccard-type or Sorensen-type indices based on species abundance data. We then propose estimators for these indices that include the effect of unseen shared species, based on either (replicated) incidence- or abundancebased sample data. In sampling simulations, these new estimators prove to be considerably less biased than classic indices when a substantial proportion of species are missing from samples. Based on species-rich empirical datasets, we show how incorporating the effect of unseen shared species not only increases accuracy but also can change the interpretation of results.
1,672 citations
Cited by
More filters
••
TL;DR: Preface to the Princeton Landmarks in Biology Edition vii Preface xi Symbols used xiii 1.
Abstract: Preface to the Princeton Landmarks in Biology Edition vii Preface xi Symbols Used xiii 1. The Importance of Islands 3 2. Area and Number of Speicies 8 3. Further Explanations of the Area-Diversity Pattern 19 4. The Strategy of Colonization 68 5. Invasibility and the Variable Niche 94 6. Stepping Stones and Biotic Exchange 123 7. Evolutionary Changes Following Colonization 145 8. Prospect 181 Glossary 185 References 193 Index 201
14,171 citations
•
TL;DR: It is suggested that the natural selection against large insertion/deletion is so weak that a large amount of variation is maintained in a population.
11,521 citations
••
TL;DR: The importance of using 'reference' sites to assess the true richness and composition of species assemblages, to measure ecologically significant ratios between unrelated taxa, toMeasure taxon/sub-taxon (hierarchical) ratios, and to 'calibrate' standardized sampling methods is discussed.
Abstract: Both the magnitude and the urgency of the task of assessing global biodiversity require that we make the most of what we know through the use of estimation and extrapolation. Likewise, future biodiversity inventories need to be designed around the use of effective sampling and estimation procedures, especially for 'hyperdiverse' groups of terrestrial organisms, such as arthropods, nematodes, fungi, and microorganisms. The challenge of estimating patterns of species richness from samples can be separated into (i) the problem of estimating local species richness, and (ii) the problem of estimating the distinctness, or complementarity, of species assemblages. These concepts apply on a wide range of spatial, temporal, and functional scales. Local richness can be estimated by extrapolating species accumulation curves, fitting parametric distributions of relative abundance, or using non-parametric techniques based on the distribution of individuals among species or of species among samples. We present several of these methods and examine their effectiveness for an example data set. We present a simple measure of complementarity, with some biogeographic examples, and outline the difficult problem of estimating complementarity from samples. Finally, we discuss the importance of using 'reference' sites (or sub-sites) to assess the true richness and composition of species assemblages, to measure ecologically significant ratios between unrelated taxa, to measure taxon/sub-taxon (hierarchical) ratios, and to 'calibrate' standardized sampling methods. This information can then be applied to the rapid, approximate assessment of species richness and faunal or floral composition at 'comparative' sites.
4,245 citations
••
TL;DR: In this paper, the authors compared the fecal microbiota of European children (EU) and that of children from a rural African village of Burkina Faso (BF), where the diet, high in fiber content, is similar to that of early human settlements at the time of the birth of agriculture.
Abstract: Gut microbial composition depends on different dietary habits just as health depends on microbial metabolism, but the association of microbiota with different diets in human populations has not yet been shown. In this work, we compared the fecal microbiota of European children (EU) and that of children from a rural African village of Burkina Faso (BF), where the diet, high in fiber content, is similar to that of early human settlements at the time of the birth of agriculture. By using high-throughput 16S rDNA sequencing and biochemical analyses, we found significant differences in gut microbiota between the two groups. BF children showed a significant enrichment in Bacteroidetes and depletion in Firmicutes (P < 0.001), with a unique abundance of bacteria from the genus Prevotella and Xylanibacter, known to contain a set of bacterial genes for cellulose and xylan hydrolysis, completely lacking in the EU children. In addition, we found significantly more short-chain fatty acids (P < 0.001) in BF than in EU children. Also, Enterobacteriaceae (Shigella and Escherichia) were significantly underrepresented in BF than in EU children (P < 0.05). We hypothesize that gut microbiota coevolved with the polysaccharide-rich diet of BF individuals, allowing them to maximize energy intake from fibers while also protecting them from inflammations and noninfectious colonic diseases. This study investigates and compares human intestinal microbiota from children characterized by a modern western diet and a rural diet, indicating the importance of preserving this treasure of microbial diversity from ancient rural communities worldwide.
4,233 citations
••
TL;DR: Over 1.2 million previously unknown genes represented in these samples, including more than 782 new rhodopsin-like photoreceptors are identified, suggesting substantial oceanic microbial diversity.
Abstract: We have applied “whole-genome shotgun sequencing” to microbial populations collected en masse on tangential flow and impact filters from seawater samples collected from the Sargasso Sea near Bermuda. A total of 1.045 billion base pairs of nonredundant sequence was generated, annotated, and analyzed to elucidate the gene content, diversity, and relative abundance of the organisms within these environmental samples. These data are estimated to derive from at least 1800 genomic species based on sequence relatedness, including 148 previously unknown bacterial phylotypes. We have identified over 1.2 million previously unknown genes represented in these samples, including more than 782 new rhodopsin-like photoreceptors. Variation in species present and stoichiometry suggests substantial oceanic microbial diversity. Microorganisms are responsible for most of the biogeochemical cycles that shape the environment of Earth and its oceans. Yet, these organisms are the least well understood on Earth, as the ability to study and understand the metabolic potential of microorganisms has been hampered by the inability to generate pure cultures. Recent studies have begun to explore environ
4,210 citations