scispace - formally typeset
Search or ask a question
Author

Aron J. Fazekas

Bio: Aron J. Fazekas is an academic researcher from University of Guelph. The author has contributed to research in topics: DNA barcoding & Barcode. The author has an hindex of 17, co-authored 19 publications receiving 4208 citations.

Papers
More filters
Journal ArticleDOI
TL;DR: The 2-locus combination of rbcL+matK will provide a universal framework for the routine use of DNA sequence data to identify specimens and contribute toward the discovery of overlooked species of land plants.
Abstract: DNA barcoding involves sequencing a standard region of DNA as a tool for species identification. However, there has been no agreement on which region(s) should be used for barcoding land plants. To provide a community recommendation on a standard plant barcode, we have compared the performance of 7 leading candidate plastid DNA regions (atpF–atpH spacer, matK gene, rbcL gene, rpoB gene, rpoC1 gene, psbK–psbI spacer, and trnH–psbA spacer). Based on assessments of recoverability, sequence quality, and levels of species discrimination, we recommend the 2-locus combination of rbcL+matK as the plant barcode. This core 2-locus barcode will provide a universal framework for the routine use of DNA sequence data to identify specimens and contribute toward the discovery of overlooked species of land plants.

2,255 citations

Journal ArticleDOI
30 Jul 2008-PLOS ONE
TL;DR: Resolution to the contentious debate on plant barcoding should involve increased attention to practical issues related to the ease of sequence recovery, global alignability, and marker redundancy in multilocus plant DNA barc coding systems.
Abstract: A universal barcode system for land plants would be a valuable resource, with potential utility in fields as diverse as ecology, floristics, law enforcement and industry. However, the application of plant barcoding has been constrained by a lack of consensus regarding the most variable and technically practical DNA region(s). We compared eight candidate plant barcoding regions from the plastome and one from the mitochondrial genome for how well they discriminated the monophyly of 92 species in 32 diverse genera of land plants (N = 251 samples). The plastid markers comprise portions of five coding (rpoB, rpoC1, rbcL, matK and 23S rDNA) and three non-coding (trnH-psbA, atpF-atpH, and psbK-psbI) loci. Our survey included several taxonomically complex groups, and in all cases we examined multiple populations and species. The regions differed in their ability to discriminate species, and in ease of retrieval, in terms of amplification and sequencing success. Single locus resolution ranged from 7% (23S rDNA) to 59% (trnH-psbA) of species with well-supported monophyly. Sequence recovery rates were related primarily to amplification success (85-100% for plastid loci), with matK requiring the greatest effort to achieve reasonable recovery (88% using 10 primer pairs). Several loci (matK, psbK-psbI, trnH-psbA) were problematic for generating fully bidirectional sequences. Setting aside technical issues related to amplification and sequencing, combining the more variable plastid markers provided clear benefits for resolving species, although with diminishing returns, as all combinations assessed using four to seven regions had only marginally different success rates (69-71%; values that were approached by several two- and three-region combinations). This performance plateau may indicate fundamental upper limits on the precision of species discrimination that is possible with DNA barcoding systems that include moderate numbers of plastid markers. Resolution to the contentious debate on plant barcoding should therefore involve increased attention to practical issues related to the ease of sequence recovery, global alignability, and marker redundancy in multilocus plant DNA barcoding systems.

591 citations

Journal ArticleDOI
13 Apr 2006-Botany
TL;DR: Analysis of over 10 000 rbcL sequences from GenBank demonstrate that this locus could serve well as the core region, with sufficient variation to discriminate among species in approximate ranges, and a tiered approach wherein highly variable loci are nested under a core barcoding gene is proposed.
Abstract: DNA barcoding based on the mitochondrial cytochrome c oxidase 1 (cox1) sequence is being employed for diverse groups of animals with demonstrated success in species identification and new species discovery. Applying barcoding systems to land plants will be a more challenging task as plant genome substitution rates are considerably lower than those observed in animal mitochondria, suggesting that a much greater amount of sequence data from multiple loci will be required to barcode plants. In the absence of an obvious well-characterized plant locus that meets all the necessary criteria, a key first step will be identifying candidate regions with the most potential. To meet the challenges with land plants, we are proposing the adoption of a tiered approach wherein highly variable loci are nested under a core barcoding gene. Analysis of over 10 000 rbcL sequences from GenBank demonstrate that this locus could serve well as the core region, with sufficient variation to discriminate among species in approximate...

336 citations

Journal ArticleDOI
TL;DR: In this article, the relative utility of six coding (Universal Plastid Amplicon, rpoB, rpoc1, accD, rbcL, matK) and one non-coding (trnH-psbA) chloroplast loci for barcoding in the genus Compsoneura using both single region and multiregion approaches.
Abstract: The concept and practice of DNA barcoding have been designed as a system to facilitate species identification and recognition. The primary challenge for barcoding plants has been to identify a suitable region on which to focus the effort. The slow relative nucleotide substitution rates of plant mitochondria and the technical issues with the use of nuclear regions have focused attention on several proposed regions in the plastid genome. One of the challenges for barcoding is to discriminate closely related or recently evolved species. The Myristicaceae, or nutmeg family, is an older group within the angiosperms that contains some recently evolved species providing a challenging test for barcoding plants. The goal of this study is to determine the relative utility of six coding (Universal Plastid Amplicon — UPA, rpoB, rpoc1, accD, rbcL, matK) and one noncoding (trnH-psbA) chloroplast loci for barcoding in the genus Compsoneura using both single region and multiregion approaches. Five of the regions we tested were predominantly invariant across species (UPA, rpoB, rpoC1, accD, rbcL). Two of the regions (matK and trnH-psbA) had significant variation and show promise for barcoding in nutmegs. We demonstrate that a two-gene approach utilizing a moderately variable region (matK) and a more variable region (trnH-psbA) provides resolution among all the Compsonuera species we sampled including the recently evolved C. sprucei and C. mexicana. Our classification analyses based on nonmetric multidimensional scaling ordination, suggest that the use of two regions results in a decreased range of intraspecific variation relative to the distribution of interspecific divergence with 95% of the samples correctly identified in a sequence identification analysis.

264 citations

Journal ArticleDOI
TL;DR: The results using common metrics demonstrate that the number of species supported as monophyletic using barcoding markers is higher in animals than plants, even after controlling for the amount of parsimony‐informative information per species.
Abstract: The ability to discriminate between species using barcoding loci has proved more difficult in plants than animals, raising the possibility that plant species boundaries are less well defined. Here, we review a selection of published barcoding data sets to compare species discrimination in plants vs. animals. Although the use of different genetic markers, analytical methods and depths of taxon sampling may complicate comparisons, our results using common metrics demonstrate that the number of species supported as monophyletic using barcoding markers is higher in animals (> 90%) than plants (~70%), even after controlling for the amount of parsimony-informative information per species. This suggests that more than a simple lack of variability limits species discrimination in plants. Both animal and plant species pairs have variable size gaps between intra- and interspecific genetic distances, but animal species tend to have larger gaps than plants, even in relatively densely sampled genera. An analysis of 12 plant genera suggests that hybridization contributes significantly to variation in genetic discontinuity in plants. Barcoding success may be improved in some plant groups by careful choice of markers and appropriate sampling; however, overall fine-scale species discrimination in plants relative to animals may be inherently more difficult because of greater levels of gene-tree paraphyly.

243 citations


Cited by
More filters
Journal Article
Fumio Tajima1
30 Oct 1989-Genomics
TL;DR: It is suggested that the natural selection against large insertion/deletion is so weak that a large amount of variation is maintained in a population.

11,521 citations

Journal ArticleDOI
TL;DR: Among the regions of the ribosomal cistron, the internal transcribed spacer (ITS) region has the highest probability of successful identification for the broadest range of fungi, with the most clearly defined barcode gap between inter- and intraspecific variation.
Abstract: Six DNA regions were evaluated as potential DNA barcodes for Fungi, the second largest kingdom of eukaryotic life, by a multinational, multilaboratory consortium. The region of the mitochondrial cytochrome c oxidase subunit 1 used as the animal barcode was excluded as a potential marker, because it is difficult to amplify in fungi, often includes large introns, and can be insufficiently variable. Three subunits from the nuclear ribosomal RNA cistron were compared together with regions of three representative protein-coding genes (largest subunit of RNA polymerase II, second largest subunit of RNA polymerase II, and minichromosome maintenance protein). Although the protein-coding gene regions often had a higher percent of correct identification compared with ribosomal markers, low PCR amplification and sequencing success eliminated them as candidates for a universal fungal barcode. Among the regions of the ribosomal cistron, the internal transcribed spacer (ITS) region has the highest probability of successful identification for the broadest range of fungi, with the most clearly defined barcode gap between inter- and intraspecific variation. The nuclear ribosomal large subunit, a popular phylogenetic marker in certain groups, had superior species resolution in some taxonomic groups, such as the early diverging lineages and the ascomycete yeasts, but was otherwise slightly inferior to the ITS. The nuclear ribosomal small subunit has poor species-level resolution in fungi. ITS will be formally proposed for adoption as the primary fungal barcode marker to the Consortium for the Barcode of Life, with the possibility that supplementary barcodes may be developed for particular narrowly circumscribed taxonomic groups.

4,116 citations

Journal ArticleDOI
TL;DR: Automatic Barcode Gap Discovery is fast, simple method to split a sequence alignment data set into candidate species that should be complemented with other evidence in an integrative taxonomic approach.
Abstract: Within uncharacterized groups, DNA barcodes, short DNA sequences that are present in a wide range of species, can be used to assign organisms into species. We propose an automatic procedure that sorts the sequences into hypothetical species based on the barcode gap, which can be observed whenever the divergence among organisms belonging to the same species is smaller than divergence among organisms from different species. We use a range of prior intraspecific divergence to infer from the data a model-based one-sided confidence limit for intraspecific divergence. The method, called Automatic Barcode Gap Discovery (ABGD), then detects the barcode gap as the first significant gap beyond this limit and uses it to partition the data. Inference of the limit and gap detection are then recursively applied to previously obtained groups to get finer partitions until there is no further partitioning. Using six published data sets of metazoans, we show that ABGD is computationally efficient and performs well for standard prior maximum intraspecific divergences (a few per cent of divergence for the five data sets), except for one data set where less than three sequences per species were sampled. We further explore the theoretical limitations of ABGD through simulation of explicit speciation and population genetics scenarios. Our results emphasize in particular the sensitivity of the method to the presence of recent speciation events, via (unrealistically) high rates of speciation or large numbers of species. In conclusion, ABGD is fast, simple method to split a sequence alignment data set into candidate species that should be complemented with other evidence in an integrative taxonomic approach.

2,336 citations

Journal ArticleDOI
TL;DR: Recent developments in rhizosphere research are discussed in relation to assessing the contribution of the micro- and macroflora to sustainable agriculture, nature conservation, the development of bio-energy crops and the mitigation of climate change.
Abstract: The rhizosphere is the interface between plant roots and soil where interactions among a myriad of microorganisms and invertebrates affect biogeochemical cycling, plant growth and tolerance to biotic and abiotic stress. The rhizosphere is intriguingly complex and dynamic, and understanding its ecology and evolution is key to enhancing plant productivity and ecosystem functioning. Novel insights into key factors and evolutionary processes shaping the rhizosphere microbiome will greatly benefit from integrating reductionist and systems-based approaches in both agricultural and natural ecosystems. Here, we discuss recent developments in rhizosphere research in relation to assessing the contribution of the micro- and macroflora to sustainable agriculture, nature conservation, the development of bio-energy crops and the mitigation of climate change.

2,332 citations

Journal ArticleDOI
TL;DR: The 2-locus combination of rbcL+matK will provide a universal framework for the routine use of DNA sequence data to identify specimens and contribute toward the discovery of overlooked species of land plants.
Abstract: DNA barcoding involves sequencing a standard region of DNA as a tool for species identification. However, there has been no agreement on which region(s) should be used for barcoding land plants. To provide a community recommendation on a standard plant barcode, we have compared the performance of 7 leading candidate plastid DNA regions (atpF–atpH spacer, matK gene, rbcL gene, rpoB gene, rpoC1 gene, psbK–psbI spacer, and trnH–psbA spacer). Based on assessments of recoverability, sequence quality, and levels of species discrimination, we recommend the 2-locus combination of rbcL+matK as the plant barcode. This core 2-locus barcode will provide a universal framework for the routine use of DNA sequence data to identify specimens and contribute toward the discovery of overlooked species of land plants.

2,255 citations