MycoCosm portal: gearing up for 1000 fungal genomes

doi:10.1093/NAR/GKT1183

Home
/
Papers
/
MycoCosm portal: gearing up for 1000 fungal genomes

Journal Article•DOI•

MycoCosm portal: gearing up for 1000 fungal genomes

Igor V. Grigoriev¹, Roman Nikitin¹, Sajeet Haridas¹, Alan Kuo¹, Robin A. Ohm¹, Robert Otillar¹, Robert Riley¹, Asaf Salamov¹, Xueling Zhao¹, Frank Korzeniewski¹, Tatyana Smirnova¹, Henrik P. Nordberg¹, Inna Dubchak¹, Igor Shabalov¹ - Show less +10 more•Institutions (1)

United States Department of Energy¹

01 Jan 2014-Nucleic Acids Research (Oxford University Press)-Vol. 42, pp 699-704

TL;DR: MycoCosm is a fungal genomics portal developed by the US Department of Energy Joint Genome Institute to support integration, analysis and dissemination of fungal genome sequences and other 'omics' data by providing interactive web-based tools.

read less

Abstract: MycoCosm is a fungal genomics portal (http://jgi.doe.gov/fungi), developed by the US Department of Energy Joint Genome Institute to support integration, analysis and dissemination of fungal genome sequences and other 'omics' data by providing interactive web-based tools. MycoCosm also promotes and facilitates user community participation through the nomination of new species of fungi for sequencing, and the annotation and analysis of resulting data. By efficiently filling gaps in the Fungal Tree of Life, MycoCosm will help address important problems associated with energy and the environment, taking advantage of growing fungal genomics resources.

...read moreread less

Content maybe subject to copyright Report

Citations

PDF

Open Access

More filters

Journal Article•DOI•

Mycorrhizal ecology and evolution: the past, the present, and the future

[...]

Marcel G. A. van der Heijden¹, Marcel G. A. van der Heijden², Francis Martin³, Marc-André Selosse, Ian R. Sanders⁴ - Show less +1 more•Institutions (4)

University of Zurich¹, Utrecht University², University of Lorraine³, University of Lausanne⁴

01 Mar 2015-New Phytologist

TL;DR: Large-scale molecular surveys have provided novel insights into the diversity, spatial and temporal dynamics of mycorrhizal fungal communities, and network theory makes it possible to analyze interactions between plant-fungal partners as complex underground multi-species networks.

...read moreread less

Abstract: Almost all land plants form symbiotic associations with mycorrhizal fungi. These below-ground fungi play a key role in terrestrial ecosystems as they regulate nutrient and carbon cycles, and influence soil structure and ecosystem multifunctionality. Up to 80% of plant N and P is provided by mycorrhizal fungi and many plant species depend on these symbionts for growth and survival. Estimates suggest that there are c. 50 000 fungal species that form mycorrhizal associations with c. 250 000 plant species. The development of high-throughput molecular tools has helped us to better understand the biology, evolution, and biodiversity of mycorrhizal associations. Nuclear genome assemblies and gene annotations of 33 mycorrhizal fungal species are now available providing fascinating opportunities to deepen our understanding of the mycorrhizal lifestyle, the metabolic capabilities of these plant symbionts, the molecular dialogue between symbionts, and evolutionary adaptations across a range of mycorrhizal associations. Large-scale molecular surveys have provided novel insights into the diversity, spatial and temporal dynamics of mycorrhizal fungal communities. At the ecological level, network theory makes it possible to analyze interactions between plant-fungal partners as complex underground multi-species networks. Our analysis suggests that nestedness, modularity and specificity of mycorrhizal networks vary and depend on mycorrhizal type. Mechanistic models explaining partner choice, resource exchange, and coevolution in mycorrhizal associations have been developed and are being tested. This review ends with major frontiers for further research.

...read moreread less

1,223 citations

Cites background from "MycoCosm portal: gearing up for 100..."

...Genome sequences are now available for several mycorrhizal fungi and are valuable for resolving long-standing issues about their Genome sequences and annotations can be assessed through the JGI MycoCosm portal (http://genome.jgi-psf.org/programs/fungi/index.jsf; Grigoriev et al., 2014)....
[...]

Journal Article•DOI•

Convergent losses of decay mechanisms and rapid turnover of symbiosis genes in mycorrhizal mutualists.

[...]

Annegret Kohler¹, Annegret Kohler², Alan Kuo³, László Nagy⁴, László Nagy⁵, Emmanuelle Morin², Emmanuelle Morin¹, Kerrie Barry³, François Buscot⁶, Björn Canbäck⁷, Cindy Choi³, Nicolas Cichocki¹, Nicolas Cichocki², Alicia Clum³, Jan V. Colpaert⁸, Alex Copeland³, Maurício Dutra Costa⁹, Jeanne Doré¹⁰, Dimitrios Floudas⁴, Mariangela Girlanda¹¹, Bernard Henrissat¹², Bernard Henrissat¹³, Sylvie Herrmann⁶, Jaqueline Hess¹⁴, Nils Högberg¹⁵, Tomas Johansson⁷, Hassine-Radhouane Khouja¹¹, Kurt LaButti³, Urs Lahrmann¹⁶, Anthony Levasseur¹², Erika Lindquist³, Anna Lipzen³, Roland Marmeisse¹⁰, Elena Martino¹¹, Elena Martino¹, Claude Murat², Claude Murat¹, Chew Yee Ngan³, Uwe Nehls¹⁷, Jonathan M. Plett², Jonathan M. Plett¹, Anne Pringle¹⁸, Robin A. Ohm³, Silvia Perotto¹¹, Martina Peter¹⁹, Robert Riley³, Francois Rineau²⁰, Francois Rineau⁸, Joske Ruytinx⁸, Asaf Salamov³, Firoz Shah⁷, Hui Sun³, Mika T. Tarkka⁶, Andrew Tritt³, Claire Veneault-Fourrey¹, Claire Veneault-Fourrey², Alga Zuccaro¹⁶, Alga Zuccaro²¹, Anders Tunlid⁷, Igor V. Grigoriev³, David S. Hibbett⁴, Francis Martin², Francis Martin¹ - Show less +59 more•Institutions (21)

Institut national de la recherche agronomique¹, University of Lorraine², United States Department of Energy³, Clark University⁴, Hungarian Academy of Sciences⁵, Helmholtz Centre for Environmental Research - UFZ⁶, Lund University⁷, University of Hasselt⁸, Universidade Federal de Viçosa⁹, University of Lyon¹⁰, University of Turin¹¹, Aix-Marseille University¹², King Abdulaziz University¹³, University of Oslo¹⁴, Swedish University of Agricultural Sciences¹⁵, Max Planck Society¹⁶, University of Bremen¹⁷, Harvard University¹⁸, Swiss Federal Institute for Forest, Snow and Landscape Research¹⁹, Pierre-and-Marie-Curie University²⁰, University of Cologne²¹

01 Apr 2015-Nature Genetics

TL;DR: Convergent evolution of the mycorrhizal habit in fungi occurred via the repeated evolution of a 'symbiosis toolkit', with reduced numbers of PCWDEs and lineage-specific suites of myCorrhiza-induced genes.

...read moreread less

Abstract: To elucidate the genetic bases of mycorrhizal lifestyle evolution, we sequenced new fungal genomes, including 13 ectomycorrhizal (ECM), orchid (ORM) and ericoid (ERM) species, and five saprotrophs, which we analyzed along with other fungal genomes. Ectomycorrhizal fungi have a reduced complement of genes encoding plant cell wall-degrading enzymes (PCWDEs), as compared to their ancestral wood decayers. Nevertheless, they have retained a unique array of PCWDEs, thus suggesting that they possess diverse abilities to decompose lignocellulose. Similar functional categories of nonorthologous genes are induced in symbiosis. Of induced genes, 7-38% are orphan genes, including genes that encode secreted effector-like proteins. Convergent evolution of the mycorrhizal habit in fungi occurred via the repeated evolution of a 'symbiosis toolkit', with reduced numbers of PCWDEs and lineage-specific suites of mycorrhiza-induced genes.

...read moreread less

799 citations

Journal Article•DOI•

Extensive sampling of basidiomycete genomes demonstrates inadequacy of the white-rot/brown-rot paradigm for wood decay fungi

[...]

Robert Riley¹, Asaf Salamov¹, Daren W. Brown², László Nagy³, Dimitrios Floudas³, Benjamin W. Held⁴, Anthony Levasseur⁵, Vincent Lombard⁵, Emmanuelle Morin⁶, Robert Otillar¹, Erika Lindquist¹, Hui Sun¹, Kurt LaButti¹, Jeremy Schmutz, Dina Jabbour⁷, Hong Luo⁷, Scott E. Baker⁸, Antonio G. Pisabarro⁹, Jonathan D. Walton¹⁰, Robert A. Blanchette⁴, Bernard Henrissat⁵, Francis Martin⁶, Daniel Cullen², David S. Hibbett³, Igor V. Grigoriev¹ - Show less +21 more•Institutions (10)

United States Department of Energy¹, United States Department of Agriculture², Clark University³, University of Minnesota⁴, Aix-Marseille University⁵, Institut national de la recherche agronomique⁶, Michigan State University⁷, Pacific Northwest National Laboratory⁸, Universidad Pública de Navarra⁹, Great Lakes Bioenergy Research Center¹⁰

08 Jul 2014-Proceedings of the National Academy of Sciences of the United States of America

TL;DR: The results indicate that the prevailing paradigm of white rot vs. brown rot does not capture the diversity of fungal wood decay mechanisms, and suggest a continuum rather than a dichotomy between the white-rot and brown-rot modes of wood decay.

...read moreread less

Abstract: Basidiomycota (basidiomycetes) make up 32% of the described fungi and include most wood-decaying species, as well as pathogens and mutualistic symbionts. Wood-decaying basidiomycetes have typically been classified as either white rot or brown rot, based on the ability (in white rot only) to degrade lignin along with cellulose and hemicellulose. Prior genomic comparisons suggested that the two decay modes can be distinguished based on the presence or absence of ligninolytic class II peroxidases (PODs), as well as the abundance of enzymes acting directly on crystalline cellulose (reduced in brown rot). To assess the generality of the white-rot/brown-rot classification paradigm, we compared the genomes of 33 basidiomycetes, including four newly sequenced wood decayers, and performed phylogenetically informed principal-components analysis (PCA) of a broad range of gene families encoding plant biomass-degrading enzymes. The newly sequenced Botryobasidium botryosum and Jaapia argillacea genomes lack PODs but possess diverse enzymes acting on crystalline cellulose, and they group close to the model white-rot species Phanerochaete chrysosporium in the PCA. Furthermore, laboratory assays showed that both B. botryosum and J. argillacea can degrade all polymeric components of woody plant cell walls, a characteristic of white rot. We also found expansions in reducing polyketide synthase genes specific to the brown-rot fungi. Our results suggest a continuum rather than a dichotomy between the white-rot and brown-rot modes of wood decay. A more nuanced categorization of rot types is needed, based on an improved understanding of the genomics and biochemistry of wood decay.

...read moreread less

588 citations

Cites background from "MycoCosm portal: gearing up for 100..."

...Genome assemblies and annotations for the organisms used in this study are available via the JGI Genome Portal MycoCosm (http://jgi.doe. gov/fungi; see also Table S1)....
[...]
...into MycoCosm (78), a Web-based fungal resource for comparative analysis....
[...]
...Grigoriev IV, et al. (2014) MycoCosm portal: Gearing up for 1000 fungal genomes....
[...]
...All genomes were annotated using the JGI Annotation Pipeline (77), which combines several gene prediction and annotation methods with transcriptomics data, and integrates the annotated genomes into MycoCosm (78), a Web-based fungal resource for comparative analysis....
[...]

Journal Article•DOI•

The gut mycobiome of the Human Microbiome Project healthy cohort

[...]

Andrea K. Nash¹, Thomas A. Auchtung¹, Matthew C. Wong¹, Daniel P. Smith¹, Jonathan R. Gesell¹, Matthew C. Ross¹, Christopher J. Stewart¹, Ginger A. Metcalf¹, Donna M. Muzny¹, Richard A. Gibbs¹, Nadim J. Ajami¹, Joseph F. Petrosino¹ - Show less +8 more•Institutions (1)

Baylor College of Medicine¹

25 Nov 2017-Microbiome

TL;DR: The gut mycobiome of the Human Microbiome Project (HMP) cohort was investigated by sequencing the Internal Transcribed Spacer 2 (ITS2) region as well as the 18S rRNA gene, suggesting that it is a more sensitive method for studying the mycoboome of stool samples.

...read moreread less

Abstract: Most studies describing the human gut microbiome in healthy and diseased states have emphasized the bacterial component, but the fungal microbiome (i.e., the mycobiome) is beginning to gain recognition as a fundamental part of our microbiome. To date, human gut mycobiome studies have primarily been disease centric or in small cohorts of healthy individuals. To contribute to existing knowledge of the human mycobiome, we investigated the gut mycobiome of the Human Microbiome Project (HMP) cohort by sequencing the Internal Transcribed Spacer 2 (ITS2) region as well as the 18S rRNA gene. Three hundred seventeen HMP stool samples were analyzed by ITS2 sequencing. Fecal fungal diversity was significantly lower in comparison to bacterial diversity. Yeast dominated the samples, comprising eight of the top 15 most abundant genera. Specifically, fungal communities were characterized by a high prevalence of Saccharomyces, Malassezia, and Candida, with S. cerevisiae, M. restricta, and C. albicans operational taxonomic units (OTUs) present in 96.8, 88.3, and 80.8% of samples, respectively. There was a high degree of inter- and intra-volunteer variability in fungal communities. However, S. cerevisiae, M. restricta, and C. albicans OTUs were found in 92.2, 78.3, and 63.6% of volunteers, respectively, in all samples donated over an approximately 1-year period. Metagenomic and 18S rRNA gene sequencing data agreed with ITS2 results; however, ITS2 sequencing provided greater resolution of the relatively low abundance mycobiome constituents. Compared to bacterial communities, the human gut mycobiome is low in diversity and dominated by yeast including Saccharomyces, Malassezia, and Candida. Both inter- and intra-volunteer variability in the HMP cohort were high, revealing that unlike bacterial communities, an individual’s mycobiome is no more similar to itself over time than to another person’s. Nonetheless, several fungal species persisted across a majority of samples, evidence that a core gut mycobiome may exist. ITS2 sequencing data provided greater resolution of the mycobiome membership compared to metagenomic and 18S rRNA gene sequencing data, suggesting that it is a more sensitive method for studying the mycobiome of stool samples.

...read moreread less

558 citations

Cites background from "MycoCosm portal: gearing up for 100..."

...Finally, availability of fungal genomes is also lacking compared to bacteria, though there are efforts underway to change this [49]....
[...]

Journal Article•DOI•

Uniclust databases of clustered and deeply annotated protein sequences and alignments

[...]

Milot Mirdita¹, Lars von den Driesch², Lars von den Driesch¹, Clovis Galiez¹, Maria Jesus Martin², Johannes Söding¹, Martin Steinegger³, Martin Steinegger⁴, Martin Steinegger¹ - Show less +5 more•Institutions (4)

Max Planck Society¹, European Bioinformatics Institute², Technische Universität München³, Seoul National University⁴

04 Jan 2017-Nucleic Acids Research

TL;DR: Uniclust90 and Uniclust50 clusters showed better consistency of functional annotation than those of UniRef90 and UniRef50, owing to an optimised clustering pipeline that runs with the MMseqs2 software for fast and sensitive protein sequence searching and clustering.

...read moreread less

Abstract: We present three clustered protein sequence databases, Uniclust90, Uniclust50, Uniclust30 and three databases of multiple sequence alignments (MSAs), Uniboost10, Uniboost20 and Uniboost30, as a resource for protein sequence analysis, function prediction and sequence searches. The Uniclust databases cluster UniProtKB sequences at the level of 90%, 50% and 30% pairwise sequence identity. Uniclust90 and Uniclust50 clusters showed better consistency of functional annotation than those of UniRef90 and UniRef50, owing to an optimised clustering pipeline that runs with our MMseqs2 software for fast and sensitive protein sequence searching and clustering. Uniclust sequences are annotated with matches to Pfam, SCOP domains, and proteins in the PDB, using our HHblits homology detection tool. Due to its high sensitivity, Uniclust contains 17% more Pfam domain annotations than UniProt. Uniboost MSAs of three diversities are built by enriching the Uniclust30 MSAs with local sequence matches from MMseqs2 profile searches through Uniclust30. All databases can be downloaded from the Uniclust server at uniclust.mmseqs.com. Users can search clusters by keywords and explore their MSAs, taxonomic representation, and annotations. Uniclust is updated every two months with the new UniProt release.

...read moreread less

469 citations

Cites methods from "MycoCosm portal: gearing up for 100..."

...gz: archive containing three files with Pfam, SCOP, and PDB annotations, each formatted as tab-separated lists with nine columns: (1,2) identifiers for query and target, (3-5, 6-8) domain start and end-position and total sequence length for both UniProt and database sequence, (9) HHblits E-value....
[...]

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

References

PDF

Open Access

More filters

Journal Article•DOI•

The Pfam protein families database

[...]

Marco Punta¹, Penny Coggill¹, Ruth Y. Eberhardt¹, Jaina Mistry¹, John Tate¹, Chris Boursnell¹, Ningze Pang¹, Kristoffer Forslund¹, Goran Ceric¹, Jody Clements¹, Andreas Heger¹, Liisa Holm¹, Erik L. L. Sonnhammer¹, Sean R. Eddy¹, Alex Bateman¹, Robert D. Finn¹ - Show less +12 more•Institutions (1)

Wellcome Trust Sanger Institute¹

01 Jan 2000-Nucleic Acids Research

TL;DR: The definition and use of family-specific, manually curated gathering thresholds are explained and some of the features of domains of unknown function (also known as DUFs) are discussed, which constitute a rapidly growing class of families within Pfam.

...read moreread less

Abstract: Pfam is a widely used database of protein families and domains. This article describes a set of major updates that we have implemented in the latest release (version 24.0). The most important change is that we now use HMMER3, the latest version of the popular profile hidden Markov model package. This software is approximately 100 times faster than HMMER2 and is more sensitive due to the routine use of the forward algorithm. The move to HMMER3 has necessitated numerous changes to Pfam that are described in detail. Pfam release 24.0 contains 11,912 families, of which a large number have been significantly updated during the past two years. Pfam is available via servers in the UK (http://pfam.sanger.ac.uk/), the USA (http://pfam.janelia.org/) and Sweden (http://pfam.sbc.su.se/).

...read moreread less

14,075 citations

Journal Article•DOI•

Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes

[...]

Anders Krogh¹, B. Larsson¹, G. von Heijne², Erik L. L. Sonnhammer³•Institutions (3)

Technical University of Denmark¹, Stockholm University², Karolinska Institutet³

19 Jan 2001-Journal of Molecular Biology

TL;DR: A new membrane protein topology prediction method, TMHMM, based on a hidden Markov model is described and validated, and it is discovered that proteins with N(in)-C(in) topologies are strongly preferred in all examined organisms, except Caenorhabditis elegans, where the large number of 7TM receptors increases the counts for N(out)-C-in topologies.

...read moreread less

11,453 citations

"MycoCosm portal: gearing up for 100..." refers methods in this paper

...SignalP (14) is used to detect the sequence motifs responsible for protein localization, TMHMM (15) identifies possible transmembrane domains and InterProScan (16) predicts functional domains from Pfam (17) and other databases....
[...]

Journal Article•DOI•

Pfam: the protein families database.

[...]

Robert D. Finn¹, Alex Bateman², Jody Clements¹, Penelope Coggill², Ruth Y. Eberhardt², Sean R. Eddy¹, Andreas Heger, Kirstie Hetherington³, Liisa Holm, Jaina Mistry², Erik L. L. Sonnhammer⁴, John Tate², Marco Punta² - Show less +9 more•Institutions (4)

Howard Hughes Medical Institute¹, European Bioinformatics Institute², Wellcome Trust Sanger Institute³, Stockholm University⁴

01 Jan 2014-Nucleic Acids Research

TL;DR: Pfam as discussed by the authors is a widely used database of protein families, containing 14 831 manually curated entries in the current version, version 27.0, and has been updated several times since 2012.

...read moreread less

Abstract: Pfam, available via servers in the UK (http://pfam.sanger.ac.uk/) and the USA (http://pfam.janelia.org/), is a widely used database of protein families, containing 14 831 manually curated entries in the current release, version 27.0. Since the last update article 2 years ago, we have generated 1182 new families and maintained sequence coverage of the UniProt Knowledgebase (UniProtKB) at nearly 80%, despite a 50% increase in the size of the underlying sequence database. Since our 2012 article describing Pfam, we have also undertaken a comprehensive review of the features that are provided by Pfam over and above the basic family data. For each feature, we determined the relevance, computational burden, usage statistics and the functionality of the feature in a website context. As a consequence of this review, we have removed some features, enhanced others and developed new ones to meet the changing demands of computational biology. Here, we describe the changes to Pfam content. Notably, we now provide family alignments based on four different representative proteome sequence data sets and a new interactive DNA search interface. We also discuss the mapping between Pfam and known 3D structures.

...read moreread less

9,415 citations

"MycoCosm portal: gearing up for 100..." refers methods in this paper

...SignalP (14) is used to detect the sequence motifs responsible for protein localization, TMHMM (15) identifies possible transmembrane domains and InterProScan (16) predicts functional domains from Pfam (17) and other databases....
[...]

Journal Article•DOI•

SignalP 4.0: discriminating signal peptides from transmembrane regions

[...]

Thomas Nordahl Petersen¹, Søren Brunak¹, Søren Brunak², Gunnar von Heijne³, Gunnar von Heijne⁴, Henrik Nielsen¹ - Show less +2 more•Institutions (4)

Technical University of Denmark¹, University of Copenhagen², Stockholm University³, Science for Life Laboratory⁴

01 Oct 2011-Nature Methods

TL;DR: SignalP 4.0 was the best signal-peptide predictor for all three organism types but was not in all cases as good as SignalP 3.0 according to cleavage-site sensitivity or signal- peptide correlation when there are no transmembrane proteins present.

...read moreread less

Abstract: We benchmarked SignalP 4.0 against SignalP 3.0 and ten other signal peptide prediction algorithms (Fig. 1). We compared prediction performance using the Matthews correlation coefficient16, for which each sequence was counted as a true or false positive or negative. To test SignalP 4.0 performance, we did not use data that had been used in training the networks or selecting the optimal architecture, and the test data did not contain homologs to the training and optimization data (Supplementary Methods). The test set for SignalP 3.0 was also independent of the training set because we removed sequences used to construct SignalP 3.0 and their homologs from the benchmark data. For other algorithms more recent than SignalP 3.0, the benchmark data may include data used to train the methods, possibly leading to slight overestimations of their performance. Our results show that SignalP 4.0 was the best signal-peptide predictor for all three organism types (Fig. 1). This comes at a price, however, because SignalP 4.0 was not in all cases as good as SignalP 3.0 according to cleavage-site sensitivity or signal-peptide correlation when there are no transmembrane proteins present (Supplementary Results). An ideal method would have the best SignalP 4.0: discriminating signal peptides from transmembrane regions

...read moreread less

8,370 citations

"MycoCosm portal: gearing up for 100..." refers methods in this paper

...SignalP (14) is used to detect the sequence motifs responsible for protein localization, TMHMM (15) identifies possible transmembrane domains and InterProScan (16) predicts functional domains from Pfam (17) and other databases....
[...]

Journal Article•DOI•

KEGG for integration and interpretation of large-scale molecular data sets

[...]

Minoru Kanehisa¹, Susumu Goto², Yoko Sato², Miho Furumichi², Mao Tanabe² - Show less +1 more•Institutions (2)

Kyoto University¹, University of Tokyo²

01 Jan 2012-Nucleic Acids Research

TL;DR: KEGG Mapper, a collection of tools for KEGG PATHWAY, BRITE and MODULE mapping, enabling integration and interpretation of large-scale data sets and recent enhancements to the K EGG content, especially the incorporation of disease and drug information used in practice and in society, to support translational bioinformatics.

...read moreread less

Abstract: Kyoto Encyclopedia of Genes and Genomes (KEGG, http://www.genome.jp/kegg/ or http://www.kegg.jp/) is a database resource that integrates genomic, chemical and systemic functional information. In particular, gene catalogs from completely sequenced genomes are linked to higher-level systemic functions of the cell, the organism and the ecosystem. Major efforts have been undertaken to manually create a knowledge base for such systemic functions by capturing and organizing experimental knowledge in computable forms; namely, in the forms of KEGG pathway maps, BRITE functional hierarchies and KEGG modules. Continuous efforts have also been made to develop and improve the cross-species annotation procedure for linking genomes to the molecular networks through the KEGG Orthology system. Here we report KEGG Mapper, a collection of tools for KEGG PATHWAY, BRITE and MODULE mapping, enabling integration and interpretation of large-scale data sets. We also report a variant of the KEGG mapping procedure to extend the knowledge base, where different types of data and knowledge, such as disease genes and drug targets, are integrated as part of the KEGG molecular networks. Finally, we describe recent enhancements to the KEGG content, especially the incorporation of disease and drug information used in practice and in society, to support translational bioinformatics.

...read moreread less

4,259 citations

"MycoCosm portal: gearing up for 100..." refers background in this paper

...Interpro, KEGG and Swiss-Prot hits are used to map gene ontology (GO) terms (21)....
[...]
...gov), Swiss-Prot (18), KEGG (19) and KOG (20) databases additionally facilitate functional interpretation....
[...]
...Protein alignments to the NCBI’s nonredundant (http://www.ncbi.nlm.nih.gov), Swiss-Prot (18), KEGG (19) and KOG (20) databases additionally facilitate functional interpretation....
[...]
...Additionally, summaries listing numbers of genes by category in the GO, KEGG and KOG classifications are accessible from the portal menu, and can be compared with other selected genomes to explore gene family expansions and contractions across genomes....
[...]
...Mycocosm therefore includes tools that integrate single genomes into a comparative context, such as the ability to visualize variation in gene counts in different GO, KEGG and KOG categories across a user-selected assortment of genomes....
[...]