Author
Conrad L. Schoch
Other affiliations: Cornell University, Oregon State University, University of Oregon ...read more
Bio: Conrad L. Schoch is an academic researcher from National Institutes of Health. The author has contributed to research in topic(s): Dothideomycetes & Phylogenetic tree. The author has an hindex of 52, co-authored 93 publication(s) receiving 22018 citation(s). Previous affiliations of Conrad L. Schoch include Cornell University & Oregon State University.
Papers published on a yearly basis
Papers
More filters
Conrad L. Schoch1, Keith A. Seifert, Sabine M. Huhndorf2, Vincent Robert3 +157 more•Institutions (59)
TL;DR: Among the regions of the ribosomal cistron, the internal transcribed spacer (ITS) region has the highest probability of successful identification for the broadest range of fungi, with the most clearly defined barcode gap between inter- and intraspecific variation.
Abstract: Six DNA regions were evaluated as potential DNA barcodes for Fungi, the second largest kingdom of eukaryotic life, by a multinational, multilaboratory consortium. The region of the mitochondrial cytochrome c oxidase subunit 1 used as the animal barcode was excluded as a potential marker, because it is difficult to amplify in fungi, often includes large introns, and can be insufficiently variable. Three subunits from the nuclear ribosomal RNA cistron were compared together with regions of three representative protein-coding genes (largest subunit of RNA polymerase II, second largest subunit of RNA polymerase II, and minichromosome maintenance protein). Although the protein-coding gene regions often had a higher percent of correct identification compared with ribosomal markers, low PCR amplification and sequencing success eliminated them as candidates for a universal fungal barcode. Among the regions of the ribosomal cistron, the internal transcribed spacer (ITS) region has the highest probability of successful identification for the broadest range of fungi, with the most clearly defined barcode gap between inter- and intraspecific variation. The nuclear ribosomal large subunit, a popular phylogenetic marker in certain groups, had superior species resolution in some taxonomic groups, such as the early diverging lineages and the ascomycete yeasts, but was otherwise slightly inferior to the ITS. The nuclear ribosomal small subunit has poor species-level resolution in fungi. ITS will be formally proposed for adoption as the primary fungal barcode marker to the Consortium for the Barcode of Life, with the possibility that supplementary barcodes may be developed for particular narrowly circumscribed taxonomic groups.
3,444 citations
TL;DR: The approach to utilizing available RNA-Seq and other data types in the authors' manual curation process for vertebrate, plant, and other species is summarized, and a new direction for prokaryotic genomes and protein name management is described.
Abstract: The RefSeq project at the National Center for Biotechnology Information (NCBI) maintains and curates a publicly available database of annotated genomic, transcript, and protein sequence records (http://www.ncbi.nlm.nih.gov/refseq/). The RefSeq project leverages the data submitted to the International Nucleotide Sequence Database Collaboration (INSDC) against a combination of computation, manual curation, and collaboration to produce a standard set of stable, non-redundant reference sequences. The RefSeq project augments these reference sequences with current knowledge including publications, functional features and informative nomenclature. The database currently represents sequences from more than 55,000 organisms (>4800 viruses, >40,000 prokaryotes and >10,000 eukaryotes; RefSeq release 71), ranging from a single record to complete genomes. This paper summarizes the current status of the viral, prokaryotic, and eukaryotic branches of the RefSeq project, reports on improvements to data access and details efforts to further expand the taxonomic representation of the collection. We also highlight diverse functional curation initiatives that support multiple uses of RefSeq data including taxonomic validation, genome annotation, comparative genomics, and clinical testing. We summarize our approach to utilizing available RNA-Seq and other data types in our manual curation process for vertebrate, plant, and other species, and describe a new direction for prokaryotic genomes and protein name management.
2,862 citations
Clark University1, National Institutes of Health2, Louisiana State University3, CABI4, Umeå University5, Field Museum of Natural History6, Duke University7, University of Minnesota8, University of Alabama9, Oregon State University10, Centraalbureau voor Schimmelcultures11, United States Department of Agriculture12, University of Tübingen13, Max Planck Society14, University of Florida15, Pennsylvania State University16, Aberystwyth University17, Complutense University of Madrid18, University of Oslo19, University of Hong Kong20, University of Tartu21, University of Gothenburg22, University of Kansas23, University of Maine24, University of Illinois at Urbana–Champaign25, Royal Ontario Museum26, Georgia State University27, Estonian University of Life Sciences28, Washington State University29, Nova Southeastern University30, Ludwig Maximilian University of Munich31, University of Western Ontario32, Uppsala University33, Brandon University34, Royal Botanic Garden Edinburgh35, State University of New York at Purchase36, Boise State University37, Cornell University38
TL;DR: A comprehensive phylogenetic classification of the kingdom Fungi is proposed, with reference to recent molecular phylogenetic analyses, and with input from diverse members of the fungal taxonomic community.
Abstract: A comprehensive phylogenetic classification of the kingdom Fungi is proposed, with reference to recent molecular phylogenetic analyses, and with input from diverse members of the fungal taxonomic community. The classification includes 195 taxa, down to the level of order, of which 16 are described or validated here: Dikarya subkingdom nov.; Chytridiomycota, Neocallimastigomycota phyla nov.; Monoblepharidomycetes, Neocallimastigomycetes class. nov.; Eurotiomycetidae, Lecanoromycetidae, Mycocaliciomycetidae subclass. nov.; Acarosporales, Corticiales, Baeomycetales, Candelariales, Gloeophyllales, Melanosporales, Trechisporales, Umbilicariales ords. nov. The clade containing Ascomycota and Basidiomycota is classified as subkingdom Dikarya, reflecting the putative synapomorphy of dikaryotic hyphae. The most dramatic shifts in the classification relative to previous works concern the groups that have traditionally been included in the Chytridiomycota and Zygomycota. The Chytridiomycota is retained in a restricted sense, with Blastocladiomycota and Neocallimastigomycota representing segregate phyla of flagellated Fungi. Taxa traditionally placed in Zygomycota are distributed among Glomeromycota and several subphyla incertae sedis, including Mucoromycotina, Entomophthoromycotina, Kickxellomycotina, and Zoopagomycotina. Microsporidia are included in the Fungi, but no further subdivision of the group is proposed. Several genera of 'basal' Fungi of uncertain position are not placed in any higher taxa, including Basidiobolus, Caulochytrium, Olpidium, and Rozella.
1,928 citations
Duke University1, Oregon State University2, Clark University3, Natural History Museum4, University of Minnesota5, Field Museum of Natural History6, Kaiserslautern University of Technology7, University of Arizona8, New York Botanical Garden9, University of Iowa10, Technische Universität Darmstadt11, University of Maine12, United States Department of Agriculture13, University of Georgia14, University of Alabama15, University of California, Berkeley16, University of Kansas17, Aberystwyth University18, West Virginia University19, Washington State University20, Harvard University21, University of North Carolina at Chapel Hill22, Centraalbureau voor Schimmelcultures23, University of Tennessee24, Okayama University25, University of Kassel26, Brandon University27, Pennsylvania State University28, Leibniz Association29, University of Hamburg30, Royal Botanic Garden Edinburgh31
TL;DR: It is indicated that there may have been at least four independent losses of the flagellum in the kingdom Fungi, and the enigmatic microsporidia seem to be derived from an endoparasitic chytrid ancestor similar to Rozella allomycis, on the earliest diverging branch of the fungal phylogenetic tree.
Abstract: The ancestors of fungi are believed to be simple aquatic forms with flagellated spores, similar to members of the extant phylum Chytridiomycota (chytrids). Current classifications assume that chytrids form an early-diverging clade within the kingdom Fungi and imply a single loss of the spore flagellum, leading to the diversification of terrestrial fungi. Here we develop phylogenetic hypotheses for Fungi using data from six gene regions and nearly 200 species. Our results indicate that there may have been at least four independent losses of the flagellum in the kingdom Fungi. These losses of swimming spores coincided with the evolution of new mechanisms of spore dispersal, such as aerial dispersal in mycelial groups and polar tube eversion in the microsporidia (unicellular forms that lack mitochondria). The enigmatic microsporidia seem to be derived from an endoparasitic chytrid ancestor similar to Rozella allomycis, on the earliest diverging branch of the fungal phylogenetic tree.
1,575 citations
University of Saskatchewan1, Dalhousie University2, University of Rhode Island3, Sewanee: The University of the South4, Natural History Museum5, New York State Department of Health6, University of British Columbia7, Kaiserslautern University of Technology8, Charles University in Prague9, University of Guelph10, Le Moyne College11, Georgia College & State University12, University of Colorado Boulder13, University of Geneva14, Edinburgh Napier University15, University of Arkansas16, Saint Petersburg State University17
TL;DR: This revision of the classification of eukaryotes retains an emphasis on the protists and incorporates changes since 2005 that have resolved nodes and branches in phylogenetic trees.
Abstract: This revision of the classification of eukaryotes, which updates that of Adl et al. [J. Eukaryot. Microbiol. 52 (2005) 399], retains an emphasis on the protists and incorporates changes since 2005 that have resolved nodes and branches in phylogenetic trees. Whereas the previous revision was successful in re-introducing name stability to the classification, this revision provides a classification for lineages that were then still unresolved. The supergroups have withstood phylogenetic hypothesis testing with some modifications, but despite some progress, problematic nodes at the base of the eukaryotic tree still remain to be statistically resolved. Looking forward, subsequent transformations to our understanding of the diversity of life will be from the discovery of novel lineages in previously under-sampled areas and from environmental genomic information.
1,298 citations
Cited by
More filters
TL;DR: Increases in the abundance and activity of Bilophila wadsworthia on the animal-based diet support a link between dietary fat, bile acids and the outgrowth of microorganisms capable of triggering inflammatory bowel disease.
Abstract: Long-term dietary intake influences the structure and activity of the trillions of microorganisms residing in the human gut, but it remains unclear how rapidly and reproducibly the human gut microbiome responds to short-term macronutrient change. Here we show that the short-term consumption of diets composed entirely of animal or plant products alters microbial community structure and overwhelms inter-individual differences in microbial gene expression. The animal-based diet increased the abundance of bile-tolerant microorganisms (Alistipes, Bilophila and Bacteroides) and decreased the levels of Firmicutes that metabolize dietary plant polysaccharides (Roseburia, Eubacterium rectale and Ruminococcus bromii). Microbial activity mirrored differences between herbivorous and carnivorous mammals, reflecting trade-offs between carbohydrate and protein fermentation. Foodborne microbes from both diets transiently colonized the gut, including bacteria, fungi and even viruses. Finally, increases in the abundance and activity of Bilophila wadsworthia on the animal-based diet support a link between dietary fat, bile acids and the outgrowth of microorganisms capable of triggering inflammatory bowel disease. In concert, these results demonstrate that the gut microbiome can rapidly respond to altered diet, potentially facilitating the diversity of human dietary lifestyles.
5,438 citations
TL;DR: The content has been expanded and the quality improved irrespective of whether or not the KOs appear in the three molecular network databases, and the newly introduced addendum category of the GENES database is a collection of individual proteins whose functions are experimentally characterized and from which an increasing number of KOs are defined.
Abstract: KEGG (http://www.kegg.jp/ or http://www.genome.jp/kegg/) is an encyclopedia of genes and genomes. Assigning functional meanings to genes and genomes both at the molecular and higher levels is the primary objective of the KEGG database project. Molecular-level functions are stored in the KO (KEGG Orthology) database, where each KO is defined as a functional ortholog of genes and proteins. Higher-level functions are represented by networks of molecular interactions, reactions and relations in the forms of KEGG pathway maps, BRITE hierarchies and KEGG modules. In the past the KO database was developed for the purpose of defining nodes of molecular networks, but now the content has been expanded and the quality improved irrespective of whether or not the KOs appear in the three molecular network databases. The newly introduced addendum category of the GENES database is a collection of individual proteins whose functions are experimentally characterized and from which an increasing number of KOs are defined. Furthermore, the DISEASE and DRUG databases have been improved by systematic analysis of drug labels for better integration of diseases and drugs with the KEGG molecular networks. KEGG is moving towards becoming a comprehensive knowledge base for both functional interpretation and practical application of genomic information.
4,094 citations
TL;DR: The changes that have occurred in CAZy during the past 5 years are outlined and a novel effort to display the resolution and the carbohydrate ligands in crystallographic complexes of CAZymes is presented.
Abstract: The Carbohydrate-Active Enzymes database (CAZy; http://www.cazy.org) provides online and continuously updated access to a sequence-based family classification linking the sequence to the specificity and 3D structure of the enzymes that assemble, modify and breakdown oligo- and polysaccharides. Functional and 3D structural information is added and curated on a regular basis based on the available literature. In addition to the use of the database by enzymologists seeking curated information on CAZymes, the dissemination of a stable nomenclature for these enzymes is probably a major contribution of CAZy. The past few years have seen the expansion of the CAZy classification scheme to new families, the development of subfamilies in several families and the power of CAZy for the analysis of genomes and metagenomes. This article outlines the changes that have occurred in CAZy during the past 5 years and presents our novel effort to display the resolution and the carbohydrate ligands in crystallographic complexes of CAZymes.
4,078 citations
Conrad L. Schoch1, Keith A. Seifert, Sabine M. Huhndorf2, Vincent Robert3 +157 more•Institutions (59)
TL;DR: Among the regions of the ribosomal cistron, the internal transcribed spacer (ITS) region has the highest probability of successful identification for the broadest range of fungi, with the most clearly defined barcode gap between inter- and intraspecific variation.
Abstract: Six DNA regions were evaluated as potential DNA barcodes for Fungi, the second largest kingdom of eukaryotic life, by a multinational, multilaboratory consortium. The region of the mitochondrial cytochrome c oxidase subunit 1 used as the animal barcode was excluded as a potential marker, because it is difficult to amplify in fungi, often includes large introns, and can be insufficiently variable. Three subunits from the nuclear ribosomal RNA cistron were compared together with regions of three representative protein-coding genes (largest subunit of RNA polymerase II, second largest subunit of RNA polymerase II, and minichromosome maintenance protein). Although the protein-coding gene regions often had a higher percent of correct identification compared with ribosomal markers, low PCR amplification and sequencing success eliminated them as candidates for a universal fungal barcode. Among the regions of the ribosomal cistron, the internal transcribed spacer (ITS) region has the highest probability of successful identification for the broadest range of fungi, with the most clearly defined barcode gap between inter- and intraspecific variation. The nuclear ribosomal large subunit, a popular phylogenetic marker in certain groups, had superior species resolution in some taxonomic groups, such as the early diverging lineages and the ascomycete yeasts, but was otherwise slightly inferior to the ITS. The nuclear ribosomal small subunit has poor species-level resolution in fungi. ITS will be formally proposed for adoption as the primary fungal barcode marker to the Consortium for the Barcode of Life, with the possibility that supplementary barcodes may be developed for particular narrowly circumscribed taxonomic groups.
3,444 citations
TL;DR: The approach to utilizing available RNA-Seq and other data types in the authors' manual curation process for vertebrate, plant, and other species is summarized, and a new direction for prokaryotic genomes and protein name management is described.
Abstract: The RefSeq project at the National Center for Biotechnology Information (NCBI) maintains and curates a publicly available database of annotated genomic, transcript, and protein sequence records (http://www.ncbi.nlm.nih.gov/refseq/). The RefSeq project leverages the data submitted to the International Nucleotide Sequence Database Collaboration (INSDC) against a combination of computation, manual curation, and collaboration to produce a standard set of stable, non-redundant reference sequences. The RefSeq project augments these reference sequences with current knowledge including publications, functional features and informative nomenclature. The database currently represents sequences from more than 55,000 organisms (>4800 viruses, >40,000 prokaryotes and >10,000 eukaryotes; RefSeq release 71), ranging from a single record to complete genomes. This paper summarizes the current status of the viral, prokaryotic, and eukaryotic branches of the RefSeq project, reports on improvements to data access and details efforts to further expand the taxonomic representation of the collection. We also highlight diverse functional curation initiatives that support multiple uses of RefSeq data including taxonomic validation, genome annotation, comparative genomics, and clinical testing. We summarize our approach to utilizing available RNA-Seq and other data types in our manual curation process for vertebrate, plant, and other species, and describe a new direction for prokaryotic genomes and protein name management.
2,862 citations