scispace - formally typeset
Journal ArticleDOI

Accelerated species inventory on Madagascar using coalescent-based models of species delineation.

Reads0
Chats0
TLDR
A modified GMYC model is developed that allows for a variable transition from coalescent to speciation among lineages and provides a method of species discovery and biodiversity assessment using single-locus data from mixed or environmental samples while building a globally available taxonomic database for future identifications.
Abstract
High-throughput DNA sequencing has the potential to accelerate species discovery if it is able to recognize evolutionary entities from sequence data that are comparable to species. The general mixed Yule-coalescent (GMYC) model estimates the species boundary from DNA surveys by identifying independently evolving lineages as a transition from coalescent to speciation branching patterns on a phylogenetic tree. Applied here to 12 families from 4 orders of insects in Madagascar, we used the model to delineate 370 putative species from mitochondrial DNA sequence variation among 1614 individuals. These were compared with data from the nuclear genome and morphological identification and found to be highly congruent (98% and 94%). We developed a modified GMYC that allows for a variable transition from coalescent to speciation among lineages. This revised model increased the congruence with morphology (97%), suggesting that a variable threshold better reflects the clustering of sequence data into biological species. Local endemism was pronounced in all 5 insect groups. Most species (60-91%) and haplotypes (88-99%) were found at only 1 of the 5 study sites (40-1000 km apart). This pronounced endemism resulted in a 37% increase in species numbers using diagnostic nucleotides in a population aggregation analysis. Sample sizes between 7 and 10 individuals represented a threshold above which there was minimal increase in genetic diversity, broadly agreeing with coalescent theory and other empirical studies. Our results from >1.4 Mb of empirical data suggest that the GMYC model captures species boundaries comparable to those from traditional methods without the need for prior hypotheses of population coherence. This provides a method of species discovery and biodiversity assessment using single-locus data from mixed or environmental samples while building a globally available taxonomic database for future identifications. (Biodiversity; coalescent; DNA barcoding; DNA taxonomy; endemism; GMYC; Madagascar; turnover.)

read more

Citations
More filters
Journal Article

Statistical method for testing the neutral mutation hypothesis by DNA polymorphism.

TL;DR: It is suggested that the natural selection against large insertion/deletion is so weak that a large amount of variation is maintained in a population.
Journal ArticleDOI

ABGD, Automatic Barcode Gap Discovery for primary species delimitation

TL;DR: Automatic Barcode Gap Discovery is fast, simple method to split a sequence alignment data set into candidate species that should be complemented with other evidence in an integrative taxonomic approach.
Journal ArticleDOI

A general species delimitation method with applications to phylogenetic placements

TL;DR: The Poisson tree processes (PTP) model is introduced to infer putative species boundaries on a given phylogenetic input tree and yields more accurate results than de novo species delimitation methods.
Journal ArticleDOI

A DNA-Based Registry for All Animal Species: The Barcode Index Number (BIN) System

TL;DR: A persistent, species-level taxonomic registry for the animal kingdom is developed based on the analysis of patterns of nucleotide variation in the barcode region of the cytochrome c oxidase I (COI) gene.
Journal ArticleDOI

The integrative future of taxonomy.

TL;DR: It is concluded that taxonomy needs to be pluralistic to improve species discovery and description, and to develop novel protocols to produce the much-needed inventory of life in a reasonable time.
References
More filters
Journal ArticleDOI

BEAST: Bayesian evolutionary analysis by sampling trees

TL;DR: BEAST is a fast, flexible software architecture for Bayesian analysis of molecular sequences related by an evolutionary tree that provides models for DNA and protein sequence evolution, highly parametric coalescent analysis, relaxed clock phylogenetics, non-contemporaneous sequence data, statistical alignment and a wide range of options for prior distributions.
Journal Article

Statistical method for testing the neutral mutation hypothesis by DNA polymorphism.

TL;DR: It is suggested that the natural selection against large insertion/deletion is so weak that a large amount of variation is maintained in a population.
Journal ArticleDOI

Statistical method for testing the neutral mutation hypothesis by DNA polymorphism.

TL;DR: The relationship between the two estimates of genetic variation at the DNA level, namely the number of segregating sites and the average number of nucleotide differences estimated from pairwise comparison, is investigated in this article.
Journal ArticleDOI

APE: Analyses of Phylogenetics and Evolution in R language

TL;DR: UNLABELLED Analysis of Phylogenetics and Evolution (APE) is a package written in the R language for use in molecular evolution and phylogenetics that provides both utility functions for reading and writing data and manipulating phylogenetic trees.
Journal ArticleDOI

DnaSP, DNA polymorphism analyses by the coalescent and other methods.

TL;DR: The present version of DnaSP introduces several new modules and features which, among other options, allow handling big data sets and conducting a large number of coalescent-based tests by Monte Carlo computer simulations.
Related Papers (5)