scispace - formally typeset
Topic

Annotation

About: Annotation is a(n) research topic. Over the lifetime, 6719 publication(s) have been published within this topic receiving 203463 citation(s). The topic is also known as: note & markup.


Papers
More filters
Journal ArticleDOI
TL;DR: Blast2GO (B2G), a research tool designed with the main purpose of enabling Gene Ontology (GO) based data mining on sequence data for which no GO annotation is yet available, is presented.
Abstract: Summary: We present here Blast2GO (B2G), a research tool designed with the main purpose of enabling Gene Ontology (GO) based data mining on sequence data for which no GO annotation is yet available. B2G joints in one application GO annotation based on similarity searches with statistical analysis and highlighted visualization on directed acyclic graphs. This tool offers a suitable platform for functional genomics research in non-model species. B2G is an intuitive and interactive desktop application that allows monitoring and comprehension of the whole annotation and analysis process. Availability: Blast2GO is freely available via Java Web Start at http://www.blast2go.de Supplementary material:http://www.blast2go.de -> Evaluation Contact:[email protected]; [email protected]

9,021 citations

Journal ArticleDOI
TL;DR: A mature web tool for rapid and reliable display of any requested portion of the genome at any scale, together with several dozen aligned annotation tracks, is provided at http://genome.ucsc.edu.
Abstract: As vertebrate genome sequences near completion and research refocuses to their analysis, the issue of effective genome annotation display becomes critical. A mature web tool for rapid and reliable display of any requested portion of the genome at any scale, together with several dozen aligned annotation tracks, is provided at http://genome.ucsc.edu. This browser displays assembly contigs and gaps, mRNA and expressed sequence tag alignments, multiple gene predictions, cross-species homologies, single nucleotide polymorphisms, sequence-tagged sites, radiation hybrid data, transposon repeats, and more as a stack of coregistered tracks. Text and sequence-based searches provide quick and precise access to any region of specific interest. Secondary links from individual features lead to sequence details and supplementary off-site databases. One-half of the annotation tracks are computed at the University of California, Santa Cruz from publicly available sequence data; collaborators worldwide provide the rest. Users can stably add their own custom tracks to the browser for educational or research purposes. The conceptual and technical framework of the browser, its underlying MYSQL database, and overall use are described. The web site currently serves over 50,000 pages per day to over 3000 different users.

8,355 citations

Journal ArticleDOI
TL;DR: DAMID is a web-accessible program that integrates functional genomic annotations with intuitive graphical summaries that assists in the interpretation of genome-scale datasets by facilitating the transition from data collection to biological meaning.
Abstract: The distributed nature of biological knowledge poses a major challenge to the interpretation of genome-scale datasets, including those derived from microarray and proteomic studies. This report describes DAVID, a web-accessible program that integrates functional genomic annotations with intuitive graphical summaries. Lists of gene or protein identifiers are rapidly annotated and summarized according to shared categorical data for Gene Ontology, protein domain, and biochemical pathway membership. DAVID assists in the interpretation of genome-scale datasets by facilitating the transition from data collection to biological meaning.

8,062 citations

Journal ArticleDOI
Midori A. Harris, Jennifer I. Clark1, Ireland A1, Jane Lomax1, Michael Ashburner1, Michael Ashburner2, R. Foulger2, R. Foulger1, Karen Eilbeck1, Karen Eilbeck3, Suzanna E. Lewis1, Suzanna E. Lewis3, B. Marshall1, B. Marshall3, Christopher J. Mungall1, Christopher J. Mungall3, J. Richter3, J. Richter1, Gerald M. Rubin1, Gerald M. Rubin3, Judith A. Blake1, Carol J. Bult1, Dolan M1, Drabkin H1, Janan T. Eppig1, Hill Dp1, L. Ni1, Ringwald M1, Rama Balakrishnan1, Rama Balakrishnan4, J. M. Cherry4, J. M. Cherry1, Karen R. Christie1, Karen R. Christie4, Maria C. Costanzo4, Maria C. Costanzo1, Selina S. Dwight1, Selina S. Dwight4, Stacia R. Engel4, Stacia R. Engel1, Dianna G. Fisk1, Dianna G. Fisk4, Jodi E. Hirschman4, Jodi E. Hirschman1, Eurie L. Hong1, Eurie L. Hong4, Robert S. Nash1, Robert S. Nash4, Anand Sethuraman4, Anand Sethuraman1, Chandra L. Theesfeld1, Chandra L. Theesfeld4, David Botstein1, David Botstein5, Kara Dolinski1, Kara Dolinski5, Becket Feierbach5, Becket Feierbach1, Tanya Z. Berardini1, Tanya Z. Berardini6, S. Mundodi6, S. Mundodi1, Seung Y. Rhee1, Seung Y. Rhee6, Rolf Apweiler1, Daniel Barrell1, Camon E1, E. Dimmer1, Lee1, Rex L. Chisholm, Pascale Gaudet1, Pascale Gaudet7, Warren A. Kibbe1, Warren A. Kibbe7, Ranjana Kishore1, Ranjana Kishore8, Erich M. Schwarz1, Erich M. Schwarz8, Paul W. Sternberg1, Paul W. Sternberg8, M. Gwinn1, Hannick L1, Wortman J1, Matthew Berriman9, Matthew Berriman1, Wood1, Wood9, de la Cruz N10, de la Cruz N1, Peter J. Tonellato10, Peter J. Tonellato1, Pankaj Jaiswal1, Pankaj Jaiswal11, Seigfried T12, Seigfried T1, White R1, White R13 
TL;DR: The Gene Ontology (GO) project as discussed by the authors provides structured, controlled vocabularies and classifications that cover several domains of molecular and cellular biology and are freely available for community use in the annotation of genes, gene products and sequences.
Abstract: The Gene Ontology (GO) project (http://www.geneontology.org/) provides structured, controlled vocabularies and classifications that cover several domains of molecular and cellular biology and are freely available for community use in the annotation of genes, gene products and sequences. Many model organism databases and genome annotation groups use the GO and contribute their annotation sets to the GO resource. The GO database integrates the vocabularies and contributed annotations and provides full access to this information in several formats. Members of the GO Consortium continually work collectively, involving outside experts as needed, to expand and update the GO vocabularies. The GO Web resource also provides access to extensive documentation about the GO project and links to applications that use GO data for functional analyses.

3,169 citations

Journal ArticleDOI
TL;DR: The SWISS-PROT protein knowledgebase connects amino acid sequences with the current knowledge in the Life Sciences by providing an interdisciplinary overview of relevant information by bringing together experimental results, computed features and sometimes even contradictory conclusions.
Abstract: The SWISS-PROT protein knowledgebase (http://www.expasy.org/sprot/ and http://www.ebi.ac.uk/swissprot/) connects amino acid sequences with the current knowledge in the Life Sciences. Each protein entry provides an interdisciplinary overview of relevant information by bringing together experimental results, computed features and sometimes even contradictory conclusions. Detailed expertise that goes beyond the scope of SWISS-PROT is made available via direct links to specialised databases. SWISS-PROT provides annotated entries for all species, but concentrates on the annotation of entries from human (the HPI project) and other model organisms to ensure the presence of high quality annotation for representative members of all protein families. Part of the annotation can be transferred to other family members, as is already done for microbes by the High-quality Automated and Manual Annotation of microbial Proteomes (HAMAP) project. Protein families and groups of proteins are regularly reviewed to keep up with current scientific findings. Complementarily, TrEMBL strives to comprise all protein sequences that are not yet represented in SWISS-PROT, by incorporating a perpetually increasing level of mostly automated annotation. Researchers are welcome to contribute their knowledge to the scientific community by submitting relevant findings to SWISS-PROT at swiss-prot@expasy.org.

3,129 citations

Network Information
Related Topics (5)
Inference
36.8K papers, 1.3M citations
81% related
Deep learning
79.8K papers, 2.1M citations
80% related
Graph (abstract data type)
69.9K papers, 1.2M citations
80% related
Unsupervised learning
22.7K papers, 1M citations
79% related
Cluster analysis
146.5K papers, 2.9M citations
78% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
20224
2021230
2020344
2019367
2018367
2017314