Author
Marcus C. Chibucos
Other affiliations: Virginia Tech, Virginia Bioinformatics Institute
Bio: Marcus C. Chibucos is an academic researcher from University of Maryland, Baltimore. The author has contributed to research in topics: Ontology (information science) & Annotation. The author has an hindex of 27, co-authored 45 publications receiving 6660 citations. Previous affiliations of Marcus C. Chibucos include Virginia Tech & Virginia Bioinformatics Institute.
Papers
More filters
••
TL;DR: GO-CAM, a new framework for representing gene function that is more expressive than standard GO annotations, has been released, and users can now explore the growing repository of these models.
Abstract: The Gene Ontology resource (GO; http://geneontology.org) provides structured, computable knowledge regarding the functions of genes and gene products. Founded in 1998, GO has become widely adopted in the life sciences, and its contents are under continual improvement, both in quantity and in quality. Here, we report the major developments of the GO resource during the past two years. Each monthly release of the GO resource is now packaged and given a unique identifier (DOI), enabling GO-based analyses on a specific release to be reproduced in the future. The molecular function ontology has been refactored to better represent the overall activities of gene products, with a focus on transcription regulator activities. Quality assurance efforts have been ramped up to address potentially out-of-date or inaccurate annotations. New evidence codes for high-throughput experiments now enable users to filter out annotations obtained from these sources. GO-CAM, a new framework for representing gene function that is more expressive than standard GO annotations, has been released, and users can now explore the growing repository of these models. We also provide the ‘GO ribbon’ widget for visualizing GO annotations to a gene; the widget can be easily embedded in any web page.
2,138 citations
••
TL;DR: A historical archive covering the past 15 years of GO data with a consistent format and file structure for both the ontology and annotations is made available to maintain consistency with other ontologies.
Abstract: The Gene Ontology Consortium (GOC) provides the most comprehensive resource currently available for computable knowledge regarding the functions of genes and gene products. Here, we report the advances of the consortium over the past two years. The new GO-CAM annotation framework was notably improved, and we formalized the model with a computational schema to check and validate the rapidly increasing repository of 2838 GO-CAMs. In addition, we describe the impacts of several collaborations to refine GO and report a 10% increase in the number of GO annotations, a 25% increase in annotated gene products, and over 9,400 new scientific articles annotated. As the project matures, we continue our efforts to review older annotations in light of newer findings, and, to maintain consistency with other ontologies. As a result, 20 000 annotations derived from experimental data were reviewed, corresponding to 2.5% of experimental GO annotations. The website (http://geneontology.org) was redesigned for quick access to documentation, downloads and tools. To maintain an accurate resource and support traceability and reproducibility, we have made available a historical archive covering the past 15 years of GO data with a consistent format and file structure for both the ontology and annotations.
1,988 citations
••
Broad Institute1, Sainsbury Laboratory2, Ohio Agricultural Research and Development Center3, Uppsala University4, Wageningen University and Research Centre5, Virginia Bioinformatics Institute6, University of California, Riverside7, University of Aberdeen8, Scottish Crop Research Institute9, University of Warwick10, Agricultural Research Service11, Royal Institute of Technology12, Cornell University13, Oregon State University14, Lafayette College15, University of Glasgow16, Harvard University17, Delaware Biotechnology Institute18, North Carolina State University19, University of Delaware20, University of Tennessee21, University of Maryland, Baltimore22, Vanderbilt University23, College of Wooster24, Bowling Green State University25, Edinburgh Cancer Research Centre26, J. Craig Venter Institute27, Tel Aviv University28, University of Wisconsin-Madison29, University of Hohenheim30, University of Dundee31
TL;DR: The sequence of the P. infestans genome is reported, which at ∼240 megabases (Mb) is by far the largest and most complex genome sequenced so far in the chromalveolates and probably plays a crucial part in the rapid adaptability of the pathogen to host plants and underpins its evolutionary potential.
Abstract: Phytophthora infestans is the most destructive pathogen of potato and a model organism for the oomycetes, a distinct lineage of fungus-like eukaryotes that are related to organisms such as brown algae and diatoms. As the agent of the Irish potato famine in the mid-nineteenth century, P. infestans has had a tremendous effect on human history, resulting in famine and population displacement(1). To this day, it affects world agriculture by causing the most destructive disease of potato, the fourth largest food crop and a critical alternative to the major cereal crops for feeding the world's population(1). Current annual worldwide potato crop losses due to late blight are conservatively estimated at $6.7 billion(2). Management of this devastating pathogen is challenged by its remarkable speed of adaptation to control strategies such as genetically resistant cultivars(3,4). Here we report the sequence of the P. infestans genome, which at similar to 240 megabases (Mb) is by far the largest and most complex genome sequenced so far in the chromalveolates. Its expansion results from a proliferation of repetitive DNA accounting for similar to 74% of the genome. Comparison with two other Phytophthora genomes showed rapid turnover and extensive expansion of specific families of secreted disease effector proteins, including many genes that are induced during infection or are predicted to have activities that alter host physiology. These fast-evolving effector genes are localized to highly dynamic and expanded regions of the P. infestans genome. This probably plays a crucial part in the rapid adaptability of the pathogen to host plants and underpins its evolutionary potential.
1,341 citations
••
Mississippi State University1, Lawrence Berkeley National Laboratory2, Northwestern University3, Texas A&M University4, University of Cambridge5, Swiss Institute of Bioinformatics6, University College London7, University of Maryland, Baltimore8, European Bioinformatics Institute9, Medical College of Wisconsin10, New York University11, Stanford University12, Carnegie Institution for Science13, University of Southern California14, California Institute of Technology15, University of Oregon16
TL;DR: The Gene Ontology (GO) Consortium is a community-based bioinformatics resource that classifies gene product function through the use of structured, controlled vocabularies and has been expanded not only to cover new areas of biology through focused interaction with experts, but also to capture greater specificity in all areas of the ontology.
Abstract: The Gene Ontology (GO) Consortium (GOC, http://www.geneontology.org) is a community-based bioinformatics resource that classifies gene product function through the use of structured, controlled vocabularies. Over the past year, the GOC has implemented several processes to increase the quantity, quality and specificity of GO annotations. First, the number of manual, literature-based annotations has grown at an increasing rate. Second, as a result of a new 'phylogenetic annotation' process, manually reviewed, homology-based annotations are becoming available for a broad range of species. Third, the quality of GO annotations has been improved through a streamlined process for, and automated quality checks of, GO annotations deposited by different annotation groups. Fourth, the consistency and correctness of the ontology itself has increased by using automated reasoning tools. Finally, the GO has been expanded not only to cover new areas of biology through focused interaction with experts, but also to capture greater specificity in all areas of the ontology using tools for adding new combinatorial terms. The GOC works closely with other ontology developers to support integrated use of terminologies. The GOC supports its user community through the use of e-mail lists, social media and web-based resources.
492 citations
••
University of Warwick1, Virginia Bioinformatics Institute2, University of East Anglia3, Utrecht University4, John Innes Centre5, Goethe University Frankfurt6, University of California, Riverside7, Virginia Tech8, University of California, Berkeley9, Lawrence Berkeley National Laboratory10, Washington University in St. Louis11, Agriculture and Agri-Food Canada12, Nanjing Agricultural University13, University of Toulouse14, Centre national de la recherche scientifique15, Wageningen University and Research Centre16, Wellcome Trust17, Broad Institute18, Bowling Green State University19
TL;DR: The genome sequence of the oomycete Hyaloperonospora arabidopsidis is reported, an obligate biotroph and natural pathogen of Arabidopsis thaliana, which exhibits dramatic reductions in genes encoding RXLR effectors, proteins associated with zoospore formation and motility, and enzymes for assimilation of inorganic nitrogen and sulfur.
Abstract: Many oomycete and fungal plant pathogens are obligate biotrophs, which extract nutrients only from living plant tissue and cannot grow apart from their hosts. Although these pathogens cause substantial crop losses, little is known about the molecular basis or evolution of obligate biotrophy. Here, we report the genome sequence of the oomycete Hyaloperonospora arabidopsidis (Hpa), an obligate biotroph and natural pathogen of Arabidopsis thaliana. In comparison with genomes of related, hemibiotrophic Phytophthora species, the Hpa genome exhibits dramatic reductions in genes encoding (i) RXLR effectors and other secreted pathogenicity proteins, (ii) enzymes for assimilation of inorganic nitrogen and sulfur, and (iii) proteins associated with zoospore formation and motility. These attributes comprise a genomic signature of evolution toward obligate biotrophy.
424 citations
Cited by
More filters
28 Jul 2005
TL;DR: PfPMP1)与感染红细胞、树突状组胞以及胎盘的单个或多个受体作用,在黏附及免疫逃避中起关键的作�ly.
Abstract: 抗原变异可使得多种致病微生物易于逃避宿主免疫应答。表达在感染红细胞表面的恶性疟原虫红细胞表面蛋白1(PfPMP1)与感染红细胞、内皮细胞、树突状细胞以及胎盘的单个或多个受体作用,在黏附及免疫逃避中起关键的作用。每个单倍体基因组var基因家族编码约60种成员,通过启动转录不同的var基因变异体为抗原变异提供了分子基础。
18,940 citations
••
TL;DR: The Reactome Knowledgebase provides molecular details of signal transduction, transport, DNA replication, metabolism and other cellular processes as an ordered network of molecular transformations—an extended version of a classic metabolic map, in a single consistent data model.
Abstract: The Reactome Knowledgebase (www.reactome.org) provides molecular details of signal transduction, transport, DNA replication, metabolism and other cellular processes as an ordered network of molecular transformations-an extended version of a classic metabolic map, in a single consistent data model. Reactome functions both as an archive of biological processes and as a tool for discovering unexpected functional relationships in data such as gene expression pattern surveys or somatic mutation catalogues from tumour cells. Over the last two years we redeveloped major components of the Reactome web interface to improve usability, responsiveness and data visualization. A new pathway diagram viewer provides a faster, clearer interface and smooth zooming from the entire reaction network to the details of individual reactions. Tool performance for analysis of user datasets has been substantially improved, now generating detailed results for genome-wide expression datasets within seconds. The analysis module can now be accessed through a RESTFul interface, facilitating its inclusion in third party applications. A new overview module allows the visualization of analysis results on a genome-wide Reactome pathway hierarchy using a single screen page. The search interface now provides auto-completion as well as a faceted search to narrow result lists efficiently.
5,065 citations
••
TL;DR: The UniProtKB responded to the COVID-19 pandemic through expert curation of relevant entries that were rapidly made available to the research community through a dedicated portal and a credit-based publication submission interface was developed.
Abstract: Abstract The aim of the UniProt Knowledgebase is to provide users with a comprehensive, high-quality and freely accessible set of protein sequences annotated with functional information. In this article, we describe significant updates that we have made over the last two years to the resource. The number of sequences in UniProtKB has risen to approximately 190 million, despite continued work to reduce sequence redundancy at the proteome level. We have adopted new methods of assessing proteome completeness and quality. We continue to extract detailed annotations from the literature to add to reviewed entries and supplement these in unreviewed entries with annotations provided by automated systems such as the newly implemented Association-Rule-Based Annotator (ARBA). We have developed a credit-based publication submission interface to allow the community to contribute publications and annotations to UniProt entries. We describe how UniProtKB responded to the COVID-19 pandemic through expert curation of relevant entries that were rapidly made available to the research community through a dedicated portal. UniProt resources are available under a CC-BY (4.0) license via the web at https://www.uniprot.org/.
4,001 citations
••
TL;DR: Changes to the text-mining system, a new scoring-mode for physical interactions, as well as extensive user interface features for customizing, extending and sharing protein networks are described.
Abstract: Cellular life depends on a complex web of functional associations between biomolecules. Among these associations, protein-protein interactions are particularly important due to their versatility, specificity and adaptability. The STRING database aims to integrate all known and predicted associations between proteins, including both physical interactions as well as functional associations. To achieve this, STRING collects and scores evidence from a number of sources: (i) automated text mining of the scientific literature, (ii) databases of interaction experiments and annotated complexes/pathways, (iii) computational interaction predictions from co-expression and from conserved genomic context and (iv) systematic transfers of interaction evidence from one organism to another. STRING aims for wide coverage; the upcoming version 11.5 of the resource will contain more than 14 000 organisms. In this update paper, we describe changes to the text-mining system, a new scoring-mode for physical interactions, as well as extensive user interface features for customizing, extending and sharing protein networks. In addition, we describe how to query STRING with genome-wide, experimental data, including the automated detection of enriched functionalities and potential biases in the user's query data. The STRING resource is available online, at https://string-db.org/.
3,253 citations
••
TL;DR: The recent convergence of molecular studies of plant immunity and pathogen infection strategies is revealing an integrated picture of the plant–pathogen interaction from the perspective of both organisms, suggesting novel biotechnological approaches to crop protection.
Abstract: Plants are engaged in a continuous co-evolutionary struggle for dominance with their pathogens. The outcomes of these interactions are of particular importance to human activities, as they can have dramatic effects on agricultural systems. The recent convergence of molecular studies of plant immunity and pathogen infection strategies is revealing an integrated picture of the plant-pathogen interaction from the perspective of both organisms. Plants have an amazing capacity to recognize pathogens through strategies involving both conserved and variable pathogen elicitors, and pathogens manipulate the defence response through secretion of virulence effector molecules. These insights suggest novel biotechnological approaches to crop protection.
2,666 citations