Home
/
Topics
/
Annotation

Topic

Annotation

About: Annotation is a research topic. Over the lifetime, 6719 publications have been published within this topic receiving 203463 citations. The topic is also known as: note & markup.

...read moreread less

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1993
1992
1991
1990
1989
1987
1986
1981
1978
1968
1966
1965
1956
1949
1946
1938
1932
1900

Papers

PDF

Open Access

More filters

Syntax Annotation for the GENIA Corpus

[...]

Yuka Tateisi¹, Akane Yakushiji¹, Tomoko Ohta¹, Jun'ichi Tsujii¹•Institutions (1)

University of Tokyo¹

01 Jan 2005

TL;DR: Inter-annotator agreement test indicated that the writing style rather than the contents of the research abstracts is the source of the difficulty in tree annotation, and that annotation can be stably done by linguists without much knowledge of biology with appropriate guidelines regarding to linguistic phenomena particular to scientific texts.

...read moreread less

Abstract: Linguistically annotated corpus based on texts in biomedical domain has been constructed to tune natural language processing (NLP) tools for biotextmining. As the focus of information extraction is shifting from "nominal" information such as named entity to "verbal" information such as function and interaction of substances, application of parsers has become one of the key technologies and thus the corpus annotated for syntactic structure of sentences is in demand. A subset of the GENIA corpus consisting of 500 MEDLINE abstracts has been annotated for syntactic structure in an XMLbased format based on Penn Treebank II (PTB) scheme. Inter-annotator agreement test indicated that the writing style rather than the contents of the research abstracts is the source of the difficulty in tree annotation, and that annotation can be stably done by linguists without much knowledge of biology with appropriate guidelines regarding to linguistic phenomena particular to scientific texts.

...read moreread less

147 citations

Proceedings Article•DOI•

Content-Based Image Annotation Refinement

[...]

Changhu Wang¹, Feng Jing², Lei Zhang², Hong-Jiang Zhang³•Institutions (3)

University of Science and Technology of China¹, Microsoft², Advanced Technology Center³

17 Jun 2007

TL;DR: A content-based image annotation refinement (CIAR) algorithm is proposed to re-rank the candidate annotations of images and leverages both corpus information and the content feature of a query image.

...read moreread less

Abstract: Automatic image annotation has been an active research topic due to its great importance in image retrieval and management. However, results of the state-of-the-art image annotation methods are often unsatisfactory. Despite continuous efforts in inventing new annotation algorithms, it would be advantageous to develop a dedicated approach that could refine imprecise annotations. In this paper, a novel approach to automatically refining the original annotations of images is proposed. For a query image, an existing image annotation method is first employed to obtain a set of candidate annotations. Then, the candidate annotations are re-ranked and only the top ones are reserved as the final annotations. By formulating the annotation refinement process as a Markov process and defining the candidate annotations as the states of a Markov chain, a content-based image annotation refinement (CIAR) algorithm is proposed to re-rank the candidate annotations. It leverages both corpus information and the content feature of a query image. Experimental results on a typical Corel dataset show not only the validity of the refinement, but also the superiority of the proposed algorithm over existing ones.

...read moreread less

144 citations

Journal Article•DOI•

The EU-ADR corpus

[...]

Erik M. van Mulligen¹, Annie Fourrier-Réglat², David Gurwitz³, Mariam Molokhia⁴, Ainhoa Nieto⁵, Gianluca Trifirò⁶, Jan A. Kors¹, Laura I. Furlong⁷ - Show less +4 more•Institutions (7)

Erasmus University Medical Center¹, University of Bordeaux², Tel Aviv University³, King's College London⁴, University of Santiago de Compostela⁵, University of Messina⁶, Pompeu Fabra University⁷

01 Oct 2012-Journal of Biomedical Informatics

TL;DR: This paper describes an approach where a named-entity recognition system produces a first annotation and annotators revise this annotation using a web-based interface, showing that the inter-annotator agreement is much better than the agreement with the system provided annotations.

...read moreread less

144 citations

Video Collaborative Annotation Forum: Establishing Ground-Truth Labels on Large Multimedia Datasets

[...]

Ching-Yung Lin¹, Belle L. Tseng, John R. Smith•Institutions (1)

IBM¹

01 Jan 2003

TL;DR: A new version of The VideoAnnEx is developed, a.k.a. IBM MPEG-7 Annotation Tool, for collaborative multimedia annotation task in a distributed environment, and a forum to collaboratively annotate semantic labels to the NIST TRECVID 2003 development set is proposed.

...read moreread less

Abstract: We developed a new version of The VideoAnnEx, a.k.a. IBM MPEG-7 Annotation Tool, for collaborative multimedia annotation task in a distributed environment. The VideoAnnEx assists authors in the task of annotating video sequences with MPEG-7 metadata. Each shot in the video sequence can be annotated with static scene descriptions, key object descriptions, event descriptions, and other lexicon sets. The annotated descriptions are associated with each video shot or regions in the keyframes, and are stored as MPEG-7 XML file. We proposed a forum to collaboratively annotate semantic labels to the NIST TRECVID 2003 development set. From April to July 2003, 111 researchers from 23 institutes worked together to associate 198K of ground-truth labels (433K after hierarchy propagation) to 62.2 hours of videos. This large set of valuable ground-truth data is publicly available to the research community, especially for multimedia indexing and retrieval, semantic understanding, and supervised machine learning fields.

...read moreread less

143 citations

Proceedings Article•DOI•

Kodak's consumer video benchmark data set: concept definition and annotation

[...]

Alexander C. Loui¹, Jiebo Luo¹, Shih-Fu Chang², Daniel P. W. Ellis², Wei Jiang², Lyndon Kennedy², Keansub Lee², Akira Yanagawa² - Show less +4 more•Institutions (2)

Eastman Kodak Company¹, Columbia University²

24 Sep 2007

TL;DR: This work developed Kodak's consumer video benchmark data set, which includes a significant number of videos from actual users, a rich lexicon that accommodates consumers, and the annotation of a subset of concepts over the entire video data set.

...read moreread less

Abstract: Semantic indexing of images and videos in the consumer domain has become a very important issue for both research and actual application. In this work we developed Kodak's consumer video benchmark data set, which includes (1) a significant number of videos from actual users, (2) a rich lexicon that accommodates consumers. needs, and (3) the annotation of a subset of concepts over the entire video data set. To the best of our knowledge, this is the first systematic work in the consumer domain aimed at the definition of a large lexicon, construction of a large benchmark data set, and annotation of videos in a rigorous fashion. Such effort will have significant impact by providing a sound foundation for developing and evaluating large-scale learning-based semantic indexing/annotation techniques in the consumer domain.

...read moreread less

141 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
…
46
47
48
49
50
51
52
…
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

Network Information

Performance

Metrics

11,409

Papers

238,885

Citations

No. of papers in the topic in previous years
Year	Papers
2023	1,461
2022	3,073
2021	305
2020	401
2019	383
2018	373

Annotation

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics