Home
/
Topics
/
Annotation

Topic

Annotation

About: Annotation is a research topic. Over the lifetime, 6719 publications have been published within this topic receiving 203463 citations. The topic is also known as: note & markup.

...read moreread less

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1993
1992
1991
1990
1989
1987
1986
1981
1978
1968
1966
1965
1956
1949
1946
1938
1932
1900

Papers

PDF

Open Access

More filters

Posted Content•DOI•

Tximeta: reference sequence checksums for provenance identification in RNA-seq

[...]

Michael I. Love¹, Charlotte Soneson², Charlotte Soneson³, Peter Hickey⁴, Peter Hickey⁵, Lisa K. Johnson⁶, N. T. Pierce⁶, Lori Shepherd⁷, Martin Morgan⁷, Rob Patro⁸ - Show less +6 more•Institutions (8)

University of North Carolina at Chapel Hill¹, Friedrich Miescher Institute for Biomedical Research², Swiss Institute of Bioinformatics³, University of Melbourne⁴, Walter and Eliza Hall Institute of Medical Research⁵, University of California, Berkeley⁶, Roswell Park Cancer Institute⁷, University of Maryland, College Park⁸

25 Sep 2019-bioRxiv

TL;DR: This work provides a solution in the form of an R/Bioconductor package tximeta that performs numerous annotation and metadata gathering tasks automatically on behalf of users during the import of transcript quantification files.

...read moreread less

Abstract: Correct annotation metadata is critical for reproducible and accurate RNA-seq analysis. When files are shared publicly or among collaborators with incorrect or missing annotation metadata, it becomes difficult or impossible to reproduce bioinformatic analyses from raw data. It also makes it more difficult to locate the transcriptomic features, such as transcripts or genes, in their proper genomic context, which is necessary for overlapping expression data with other datasets. We provide a solution in the form of an R/Bioconductor package tximeta that performs numerous annotation and metadata gathering tasks automatically on behalf of users during the import of transcript quantification files. The correct reference transcriptome is identified via a hashed checksum stored in the quantification output, and key transcript databases are downloaded and cached locally. The computational paradigm of automatically adding annotation metadata based on reference sequence checksums can greatly facilitate genomic workflows, by helping to reduce overhead during bioinformatic analyses, preventing costly bioinformatic mistakes, and promoting computational reproducibility. The tximeta package is available at https://bioconductor.org/packages/tximeta.

...read moreread less

81 citations

Journal Article•DOI•

Reevaluating Human Gene Annotation: A Second-Generation Analysis of Chromosome 22

[...]

John E. Collins¹, Melanie E. Goward, Charlotte G. Cole, Luc J. Smink, Elizabeth J. Huckle, Sarah C. L. Knowles, Jacqueline M. Bye, David Beare, Ian Dunham - Show less +5 more•Institutions (1)

Wellcome Trust Sanger Institute¹

01 Jan 2003-Genome Research

TL;DR: A second-generation gene annotation of human chromosome 22 is reported, using expressed sequence databases, comparative sequence analysis, and experimental verification to suggest that the revised annotation criteria provide a paradigm for future annotation of the human genome.

...read moreread less

Abstract: We report a second-generation gene annotation of human chromosome 22. Using expressed sequence databases, comparative sequence analysis, and experimental verification, we have extended genes, fused previously fragmented structures, and identified new genes. The total length in exons of annotation was increased by 74% over our previously published annotation and includes 546 protein-coding genes and 234 pseudogenes. Thirty-two potential protein-coding annotations are partial copies of other genes, and may represent duplications on an evolutionary path to change or loss of function. We also identified 31 non-protein-coding transcripts, including 16 possible antisense RNAs. By extrapolation, we estimate the human genome contains 29,000-36,000 protein-coding genes, 21,300 pseudogenes, and 1500 antisense RNAs. We suggest that our revised annotation criteria provide a paradigm for future annotation of the human genome.

...read moreread less

81 citations

Proceedings Article•

EDUTELLA: searching and annotating resources within an RDF-based P2P network

[...]

Wolfgang Nejdl¹, Boris Wolf¹, Steffen Staab, Julien Tane•Institutions (1)

Leibniz University of Hanover¹

07 May 2002

TL;DR: This contribution describes the open source project Edutella which builds upon metadata standards defined for the WWW and aims to provide an RDF-based metadata infrastructure for P2P applications, building on the recently announced JXTA Framework.

...read moreread less

Abstract: P2P applications for searching and exchanging information over the Web have become increasingly popular. This has lead to a number of (usually thematically) focused communities, which allow efficient searching within such communities, and which use specific metadata sets to specify the resources stored within the P2P network. By concentrating on domain and application specific formats for metadata and query languages, however, current P2P networks appear to be fragmenting into non-interoperable niche markets. This contribution describes the open source project Edutella which builds upon metadata standards defined for the WWW and aims to provide an RDF-based metadata infrastructure for P2P applications, building on the recently announced JXTA Framework. We describe one basic service (query) and an Edutella application (annotation) within this network, both being built on a common query language exchange format, and specify the main architecture and APIs of the Edutella P2P network.

...read moreread less

81 citations

Proceedings Article•

Automatic annotation of data extracted from large Web sites.

[...]

Luigi Arlotta, Valter Crescenzi, Giansalvatore Mecca, Paolo Merialdo

01 Jan 2003

81 citations

Patent•

Interactive machine learning system for automated annotation of information in text

[...]

David Johnson, Sylvie Levesque, Tong Zhang

31 Jul 2003

TL;DR: In this article, an interactive machine learning-based system that incrementally learns how to annotate new text data is presented, where the user is selectively presented for review and appropriate action, with a convenient and efficient interface so that context of use can be verified if necessary in order to evaluate the annotations and correct them.

...read moreread less

Abstract: An interactive machine learning based system that incrementally learns, on the basis of text data, how to annotate new text data. The system and method starts with partially annotated training data or alternatively unannotated training data and a set of examples of what is to be learned. Through iterative interactive training sessions with a user the system trains annotators, and these are in turn used to discover more annotations in the text data. Once all of the text data or a sufficient amount of the text data is annotated, at the user's discretion, the system learns a final annotator or annotators, which are exported and available to annotate new textual data. As the iterative training process occurs the user is selectively presented for review and appropriate action, system-determined representations of the annotation instances and provided a convenient and efficient interface so that context of use can be verified if necessary in order to evaluate the annotations and correct them, where required. At the user's discretion, annotations that receive a high confidence level can be automatically accepted and those with low confidence levels can be automatically rejected.

...read moreread less

81 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
…
90
91
92
93
94
95
96
…
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

Network Information

Performance

Metrics

11,409

Papers

238,885

Citations

No. of papers in the topic in previous years
Year	Papers
2023	1,461
2022	3,073
2021	305
2020	401
2019	383
2018	373

Annotation

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics