scispace - formally typeset
Proceedings ArticleDOI

Managing document images in a digital library: an ontology guided approach

TLDR
Heritage+ deals with document images as distinct media type and implements tools and techniques for browsing and querying document images along with other media elements like video sequences and images and proposes a new scheme for encoding and use of ontology for accessing multimedia collection.
Abstract
We present Heritage+, an integrated platform for interactive access of different types of media elements through an unified interface. A unique aspect of Heritage+ is that it deals with document images as distinct media type and implements tools and techniques for browsing and querying document images along with other media elements like video sequences and images. Further, Heritage+ proposes a new scheme for encoding and use of ontology for accessing multimedia collection. In the context of document images, the ontology specifies the document class-specific semantics of the logical components that help in an automated semantically meaningful linking of documents and their components with heterogeneous media-type resources. Further, Heritage+ supports conceptual query of document images along with other media elements. This multifunctional access interface to the document images is provided in Heritage+ using a novel model guided document image segmentation scheme and word-image based indexing scheme.

read more

Citations
More filters
Journal ArticleDOI

A survey of keyword spotting techniques for printed document images

TL;DR: A survey of the past researches on character based as keyword based approaches used for retrieving information from document images to provide insights into the strengths and weaknesses of current techniques and the guidance in choosing the area that future work on document image retrieval could address.
Book ChapterDOI

Document Analysis Systems for Digital Libraries: Challenges and Opportunities

TL;DR: The essential features of document analysis systems that can assist in the creation of digital libraries, automatic indexing and retrieval of doc-images within DL’s, and the presentation ofdoc-images to DL users are specified.
Proceedings Article

Transcoding of Document Images for Mobile Devices.

TL;DR: A scheme for transcoding document images for presentation on handheld devices like PDA’s, e-books etc and use of the knowledge of the document model represented through standard ontology language for generation of document summary is presented.
Proceedings ArticleDOI

MetaOn - Ontology Driven Metadata Construction and Management for Intelligent Search in Text and Image Collections

TL;DR: The proposed MetaOn framework involves ontology-based information extraction and data mining, semi-automatic construction of domain specific ontologies, content-based image indexing and retrieval, and metadata management.
References
More filters
Book

Relevance weighting of search terms

TL;DR: This paper examines statistical techniques for exploiting relevance information to weight search terms using information about the distribution of index terms in documents in general and shows that specific weighted search methods are implied by a general probabilistic theory of retrieval.
Journal ArticleDOI

Relevance weighting of search terms

TL;DR: In this article, a series of relevance weighting functions is derived and is justified by theoretical considerations, in particular, it is shown that specific weighted search methods are implied by a general probabilistic theory of retrieval.
Proceedings ArticleDOI

A trainable document summarizer

TL;DR: The trends in the results are in agreement with those of Edmundson who used a subjectively weighted combination of features as opposed to training the feature weights using a corpus, which suggests that even shorter extracts may be useful indicative summmies.

Original articles Content-based image indexing and searching using Daubechies' wavelets

TL;DR: WBIIS as mentioned in this paper applies a Daubechies' wavelet transform for each of the three opponent color components, and the wavelet coefficients in the lowest few frequency bands, and their variances, are stored as feature vectors.
Journal ArticleDOI

Probabilistic models in information retrieval

Norbert Fuhr
- 01 Jun 1992 - 
TL;DR: An introduction and survey over probabilistic information retrieval (IR) is given: the probability-ranking principle shows that optimum retrieval quality can be achieved under certain assumptions; a conceptual model for IR along with the corresponding event space clarify the interpretation of the Probabilistic parameters involved.
Related Papers (5)