scispace - formally typeset
Search or ask a question
Topic

Annotation

About: Annotation is a research topic. Over the lifetime, 6719 publications have been published within this topic receiving 203463 citations. The topic is also known as: note & markup.


Papers
More filters
Journal ArticleDOI
TL;DR: The Gene Ontology Annotation resource now provides annotations to five times the number of proteins it did 4 years ago, thanks to the use of quality control checks that ensures that the GOA resource supplies high-quality functional information to proteins from a wide range of species.
Abstract: The Gene Ontology Annotation (GOA) resource (http://www.ebi.ac.uk/GOA) provides evidence-based Gene Ontology (GO) annotations to proteins in the UniProt Knowledgebase (UniProtKB). Manual annotations provided by UniProt curators are supplemented by manual and automatic annotations from model organism databases and specialist annotation groups. GOA currently supplies 368 million GO annotations to almost 54 million proteins in more than 480,000 taxonomic groups. The resource now provides annotations to five times the number of proteins it did 4 years ago. As a member of the GO Consortium, we adhere to the most up-to-date Consortium-agreed annotation guidelines via the use of quality control checks that ensures that the GOA resource supplies high-quality functional information to proteins from a wide range of species. Annotations from GOA are freely available and are accessible through a powerful web browser as well as a variety of annotation file formats.

498 citations

Journal ArticleDOI
Judith A. Blake, Mary E. Dolan, H. Drabkin, David P. Hill, Li N, D. Sitnikov, Susan M. Bridges1, Shane C. Burgess1, Teresia Buza1, Fiona M. McCarthy1, Divyaswetha Peddinti1, Lakshmi Pillai1, Seth Carbon2, Heiko Dietze2, Amelia Ireland2, Suzanna E. Lewis2, Christopher J. Mungall2, Pascale Gaudet3, Chrisholm Rl3, Petra Fey3, Warren A. Kibbe3, S. Basu3, Deborah A. Siegele4, B. K. McIntosh4, Daniel P. Renfro4, Adrienne E. Zweifel4, James C. Hu4, Nicholas H. Brown5, Susan Tweedie5, Yasmin Alam-Faruque6, Rolf Apweiler6, A. Auchinchloss6, Kristian B. Axelsen6, Benoit Bely6, M. C. Blatter6, Bonilla C6, Bouguerleret L6, Emmanuel Boutet6, Lionel Breuza6, Alan Bridge6, W. M. Chan6, Gayatri Chavali6, Elisabeth Coudert6, E. Dimmer6, Anne Estreicher6, L Famiglietti6, Marc Feuermann6, Arnaud Gos6, Nadine Gruaz-Gumowski6, Hieta R6, Hinz C6, Chantal Hulo6, Rachael P. Huntley6, J. James6, Florence Jungo6, Guillaume Keller6, Kati Laiho6, Duncan Legge6, P. Lemercier6, Damien Lieberherr6, Michele Magrane6, Maria Jesus Martin6, Patrick Masson6, Mutowo-Muellenet P6, Claire O'Donovan6, Ivo Pedruzzi6, Klemens Pichler6, Diego Poggioli6, Porras Millán P6, Sylvain Poux6, Catherine Rivoire6, Bernd Roechert6, Tony Sawford6, Michel Schneider6, Andre Stutz6, Shyamala Sundaram6, Michael Tognolli6, Ioannis Xenarios6, Foulgar R, Jane Lomax, Paola Roncaglia, Varsha K. Khodiyar7, Ruth C. Lovering7, Philippa J. Talmud7, Marcus C. Chibucos8, Giglio Mg9, Hsin-Yu Chang9, Sarah Hunter9, Craig McAnulla9, Alex L. Mitchell9, Sangrador A9, Stephan R, Midori A. Harris5, Stephen G. Oliver5, Kim Rutherford5, Wood7, Jürg Bähler7, Antonia Lock7, Paul J. Kersey9, McDowall Dm9, Daniel M. Staines9, Melinda R. Dwinell10, Mary Shimoyama10, Stan Laulederkind10, Tom Hayman10, Shur-Jen Wang10, Timothy F. Lowry10, P D'Eustachio11, Lisa Matthews11, Rama Balakrishnan12, Gail Binkley12, J. M. Cherry12, Maria C. Costanzo12, Selina S. Dwight12, Engel12, Dianna G. Fisk12, Benjamin C. Hitz12, Eurie L. Hong12, Kalpana Karra12, Miyasato12, Robert S. Nash12, Julie Park12, Marek S. Skrzypek12, Shuai Weng12, Edith D. Wong12, Tanya Z. Berardini13, Eva Huala13, Huaiyu Mi14, Paul Thomas14, Juancarlos Chan15, Ranjana Kishore15, Paul W. Sternberg15, Van Auken K15, Doug Howe16, Monte Westerfield16 
TL;DR: The Gene Ontology (GO) Consortium is a community-based bioinformatics resource that classifies gene product function through the use of structured, controlled vocabularies and has been expanded not only to cover new areas of biology through focused interaction with experts, but also to capture greater specificity in all areas of the ontology.
Abstract: The Gene Ontology (GO) Consortium (GOC, http://www.geneontology.org) is a community-based bioinformatics resource that classifies gene product function through the use of structured, controlled vocabularies. Over the past year, the GOC has implemented several processes to increase the quantity, quality and specificity of GO annotations. First, the number of manual, literature-based annotations has grown at an increasing rate. Second, as a result of a new 'phylogenetic annotation' process, manually reviewed, homology-based annotations are becoming available for a broad range of species. Third, the quality of GO annotations has been improved through a streamlined process for, and automated quality checks of, GO annotations deposited by different annotation groups. Fourth, the consistency and correctness of the ontology itself has increased by using automated reasoning tools. Finally, the GO has been expanded not only to cover new areas of biology through focused interaction with experts, but also to capture greater specificity in all areas of the ontology using tools for adding new combinatorial terms. The GOC works closely with other ontology developers to support integrated use of terminologies. The GOC supports its user community through the use of e-mail lists, social media and web-based resources.

492 citations

Proceedings ArticleDOI
20 Jun 2009
TL;DR: A new probabilistic model for jointly modeling the image, its class label, and its annotations is developed, which derives an approximate inference and estimation algorithms based on variational methods, as well as efficient approximations for classifying and annotating new images.
Abstract: Image classification and annotation are important problems in computer vision, but rarely considered together. Intuitively, annotations provide evidence for the class label, and the class label provides evidence for annotations. For example, an image of class highway is more likely annotated with words “road,” “car,” and “traffic” than words “fish,” “boat,” and “scuba.” In this paper, we develop a new probabilistic model for jointly modeling the image, its class label, and its annotations. Our model treats the class label as a global description of the image, and treats annotation terms as local descriptions of parts of the image. Its underlying probabilistic assumptions naturally integrate these two sources of information. We derive an approximate inference and estimation algorithms based on variational methods, as well as efficient approximations for classifying and annotating new images. We examine the performance of our model on two real-world image data sets, illustrating that a single model provides competitive annotation performance, and superior classification performance.

490 citations

Journal ArticleDOI
01 Oct 2010
TL;DR: This work proposes a strongly performing method that scales to image annotation datasets by simultaneously learning to optimize precision at k of the ranked list of annotations for a given image and learning a low-dimensional joint embedding space for both images and annotations.
Abstract: Image annotation datasets are becoming larger and larger, with tens of millions of images and tens of thousands of possible annotations. We propose a strongly performing method that scales to such datasets by simultaneously learning to optimize precision at k of the ranked list of annotations for a given image and learning a low-dimensional joint embedding space for both images and annotations. Our method both outperforms several baseline methods and, in comparison to them, is faster and consumes less memory. We also demonstrate how our method learns an interpretable model, where annotations with alternate spellings or even languages are close in the embedding space. Hence, even when our model does not predict the exact annotation given by a human labeler, it often predicts similar annotations, a fact that we try to quantify by measuring the newly introduced "sibling" precision metric, where our method also obtains excellent results.

488 citations

Book ChapterDOI
12 Oct 2008
TL;DR: This work introduces a new baseline technique for image annotation that treats annotation as a retrieval problem and outperforms the current state-of-the-art methods on two standard and one large Web dataset.
Abstract: Automatically assigning keywords to images is of great interest as it allows one to index, retrieve, and understand large collections of image data. Many techniques have been proposed for image annotation in the last decade that give reasonable performance on standard datasets. However, most of these works fail to compare their methods with simple baseline techniques to justify the need for complex models and subsequent training. In this work, we introduce a new baseline technique for image annotation that treats annotation as a retrieval problem. The proposed technique utilizes low-level image features and a simple combination of basic distances to find nearest neighbors of a given image. The keywords are then assigned using a greedy label transfer mechanism. The proposed baseline outperforms the current state-of-the-art methods on two standard and one large Web dataset. We believe that such a baseline measure will provide a strong platform to compare and better understand future annotation techniques.

483 citations


Network Information
Related Topics (5)
Inference
36.8K papers, 1.3M citations
81% related
Deep learning
79.8K papers, 2.1M citations
80% related
Graph (abstract data type)
69.9K papers, 1.2M citations
80% related
Unsupervised learning
22.7K papers, 1M citations
79% related
Cluster analysis
146.5K papers, 2.9M citations
78% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
20231,461
20223,073
2021305
2020401
2019383
2018373