Topic

Annotation

About: Annotation is a research topic. Over the lifetime, 6719 publications have been published within this topic receiving 203463 citations. The topic is also known as: note & markup.

...read moreread less

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Proceedings Article•

Callisto: A Configurable Annotation Workbench.

[...]

David Day¹, Chad Mchenry¹, Robyn Kozierok¹, Laurel D. Riek¹•Institutions (1)

Mitre Corporation¹

01 May 2004

TL;DR: Callisto provides a facility for the independent development, compilation and installation of task module plug-ins (in the form of Java Archive jar files) and provides a set of annotation services to which all separate GUI components can subscribe, enabling a common framework through which annotation updates are propagated to all components.

...read moreread less

Abstract: In order to support a range of textual annotation tasks, we have developed a new annotation tool called Callisto. To promote taskspecific specialization of the interface and associated constraint checking, Callisto provides a facility for the independent development, compilation and installation of task module plug-ins (in the form of Java Archive jar files). The common Callisto backend provides a set of annotation services to which all separate GUI components can subscribe, enabling a common framework through which annotation updates are propagated to all components. A number of annotation task models have already been defined, and those that are of very general applicability have been made easily re-configurable for small changes in task definition. Callisto is implemented in Java to make use of Java s considerable support for Unicode-encoded multilingual data. Callisto is freely available for downloading

...read moreread less

46 citations

Journal Article•DOI•

Transductive Multi-Instance Multi-Label learning algorithm with application to automatic image annotation

[...]

Songhe Feng¹, De Xu¹•Institutions (1)

Beijing Jiaotong University¹

01 Jan 2010-Expert Systems With Applications

TL;DR: An improved Transductive Multi-Instance Multi-Label (TMIML) learning framework is proposed, which aims at taking full advantage of both labeled and unlabeled data to address the annotation problem.

...read moreread less

Abstract: Automatic image annotation has emerged as an important research topic due to its potential application on both image understanding and web image search. Due to the inherent ambiguity of image-label mapping and the scarcity of training examples, the annotation task has become a challenge to systematically develop robust annotation models with better performance. From the perspective of machine learning, the annotation task fits both multi-instance and multi-label learning framework due to the fact that an image is usually described by multiple semantic labels (keywords) and these labels are often highly related to respective regions rather than the entire image. In this paper, we propose an improved Transductive Multi-Instance Multi-Label (TMIML) learning framework, which aims at taking full advantage of both labeled and unlabeled data to address the annotation problem. The experiments over the well known Corel 5000 data set demonstrate that the proposed method is beneficial in the image annotation task and outperforms most existing image annotation algorithms.

...read moreread less

46 citations

Posted Content•

Functional Map of the World

[...]

Gordon Christie¹, Neil Fendley¹, James Wilson, Ryan Mukherjee¹•Institutions (1)

Johns Hopkins University Applied Physics Laboratory¹

21 Nov 2017-arXiv: Computer Vision and Pattern Recognition

TL;DR: A new dataset, Functional Map of the World (fMoW), which aims to inspire the development of machine learning models capable of predicting the functional purpose of buildings and land use from temporal sequences of satellite images and a rich set of metadata features.

...read moreread less

Abstract: We present a new dataset, Functional Map of the World (fMoW), which aims to inspire the development of machine learning models capable of predicting the functional purpose of buildings and land use from temporal sequences of satellite images and a rich set of metadata features. The metadata provided with each image enables reasoning about location, time, sun angles, physical sizes, and other features when making predictions about objects in the image. Our dataset consists of over 1 million images from over 200 countries. For each image, we provide at least one bounding box annotation containing one of 63 categories, including a "false detection" category. We present an analysis of the dataset along with baseline approaches that reason about metadata and temporal views. Our data, code, and pretrained models have been made publicly available.

...read moreread less

46 citations

Book Chapter•DOI•

MuLVAT: A Video Annotation Tool Based on XML-Dictionaries and Shot Clustering

[...]

Zenonas Theodosiou¹, Anastasis Kounoudes, Nicolas Tsapatsoulis¹, Marios Milis•Institutions (1)

Cyprus University of Technology¹

02 Oct 2009

TL;DR: A video annotation tool, which uses structured knowledge, in the form of XML dictionaries, combined with a hierarchical classification scheme to attach semantic labels to video segments at various level of granularity is presented.

...read moreread less

Abstract: Recent advances in digital video technology have resulted in an explosion of digital video data which are available through the Web or in private repositories. Efficient searching in these repositories created the need of semantic labeling of video data at various levels of granularity, i.e., movie, scene, shot, keyframe, video object, etc. Through multilevel labeling video content is appropriately indexed, allowing access from various modalities and for a variety of applications. However, despite the huge efforts for automatic video annotation human intervention is the only way for reliable semantic video annotation. Manual video annotation is an extremely laborious process and efficient tools developed for this purpose can make, in many cases, the true difference. In this paper we present a video annotation tool, which uses structured knowledge, in the form of XML dictionaries, combined with a hierarchical classification scheme to attach semantic labels to video segments at various level of granularity. Video segmentation is supported through the use of an efficient shot detection algorithm; while shots are combined into scenes through clustering with the aid of a Genetic Algorithm scheme. Finally, XML dictionary creation and editing tools are available during annotation allowing the user to always use the semantic label she/he wishes instead of the automatically created ones.

...read moreread less

46 citations

Proceedings Article•DOI•

Online multi-label active annotation: towards large-scale content-based video search

[...]

Xian-Sheng Hua¹, Guo-Jun Qi²•Institutions (2)

Microsoft¹, University of Science and Technology of China²

26 Oct 2008

TL;DR: This paper proposes a scalable framework for annotation-based video search, as well as a novel approach to enable large-scale semantic concept annotation, that is, online multi-label active learning, scalable to both the video sample dimension and concept label dimension.

...read moreread less

Abstract: Existing video search engines have not taken the advantages of video content analysis and semantic understanding. Video search in academia uses semantic annotation to approach content-based indexing. We argue this is a promising direction to enable real content-based video search. However, due to the complexity of both video data and semantic concepts, existing techniques on automatic video annotation are still not able to handle large-scale video set and large-scale concept set, in terms of both annotation accuracy and computation cost. To address this problem, in this paper, we propose a scalable framework for annotation-based video search, as well as a novel approach to enable large-scale semantic concept annotation, that is, online multi-label active learning. This framework is scalable to both the video sample dimension and concept label dimension. Large-scale unlabeled video samples are assumed to arrive consecutively in batches with an initial pre-labeled training set, based on which a preliminary multi-label classifier is built. For each arrived batch, a multi-label active learning engine is applied, which automatically selects and manually annotates a set of unlabeled sample-label pairs. And then an online learner updates the original classifier by taking the newly labeled sample-label pairs into consideration. This process repeats until all data are arrived. During the process, new labels, even without any pre-labeled training samples, can be incorporated into the process anytime. Experiments on TRECVID dataset demonstrate the effectiveness and efficiency of the proposed framework.

...read moreread less

46 citations

Collapse

Network Information

Performance

Metrics

11,409

Papers

238,885

Citations

No. of papers in the topic in previous years
Year	Papers
2023	1,461
2022	3,073
2021	305
2020	401
2019	383
2018	373

Annotation

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics