Proceedings ArticleDOI
Managing document images in a digital library: an ontology guided approach
Gaurav Harit,Santanu Chaudhury,Hiranmay Ghosh +2 more
- pp 64-92
TLDR
Heritage+ deals with document images as distinct media type and implements tools and techniques for browsing and querying document images along with other media elements like video sequences and images and proposes a new scheme for encoding and use of ontology for accessing multimedia collection.Abstract:
We present Heritage+, an integrated platform for interactive access of different types of media elements through an unified interface. A unique aspect of Heritage+ is that it deals with document images as distinct media type and implements tools and techniques for browsing and querying document images along with other media elements like video sequences and images. Further, Heritage+ proposes a new scheme for encoding and use of ontology for accessing multimedia collection. In the context of document images, the ontology specifies the document class-specific semantics of the logical components that help in an automated semantically meaningful linking of documents and their components with heterogeneous media-type resources. Further, Heritage+ supports conceptual query of document images along with other media elements. This multifunctional access interface to the document images is provided in Heritage+ using a novel model guided document image segmentation scheme and word-image based indexing scheme.read more
Citations
More filters
Journal ArticleDOI
A survey of keyword spotting techniques for printed document images
TL;DR: A survey of the past researches on character based as keyword based approaches used for retrieving information from document images to provide insights into the strengths and weaknesses of current techniques and the guidance in choosing the area that future work on document image retrieval could address.
Book ChapterDOI
Document Analysis Systems for Digital Libraries: Challenges and Opportunities
TL;DR: The essential features of document analysis systems that can assist in the creation of digital libraries, automatic indexing and retrieval of doc-images within DL’s, and the presentation ofdoc-images to DL users are specified.
Proceedings Article
Transcoding of Document Images for Mobile Devices.
TL;DR: A scheme for transcoding document images for presentation on handheld devices like PDA’s, e-books etc and use of the knowledge of the document model represented through standard ontology language for generation of document summary is presented.
Proceedings ArticleDOI
MetaOn - Ontology Driven Metadata Construction and Management for Intelligent Search in Text and Image Collections
Haralampos Karanikas,Nikos Pelekis,Dimitris K. Iakovidis,I. Kopanakis,Thomas Mavroudakis,Yannis Theodoridis +5 more
TL;DR: The proposed MetaOn framework involves ontology-based information extraction and data mining, semi-automatic construction of domain specific ontologies, content-based image indexing and retrieval, and metadata management.
References
More filters
Book
Relevance weighting of search terms
TL;DR: This paper examines statistical techniques for exploiting relevance information to weight search terms using information about the distribution of index terms in documents in general and shows that specific weighted search methods are implied by a general probabilistic theory of retrieval.
Journal ArticleDOI
Relevance weighting of search terms
TL;DR: In this article, a series of relevance weighting functions is derived and is justified by theoretical considerations, in particular, it is shown that specific weighted search methods are implied by a general probabilistic theory of retrieval.
Proceedings ArticleDOI
A trainable document summarizer
TL;DR: The trends in the results are in agreement with those of Edmundson who used a subjectively weighted combination of features as opposed to training the feature weights using a corpus, which suggests that even shorter extracts may be useful indicative summmies.
Original articles Content-based image indexing and searching using Daubechies' wavelets
TL;DR: WBIIS as mentioned in this paper applies a Daubechies' wavelet transform for each of the three opponent color components, and the wavelet coefficients in the lowest few frequency bands, and their variances, are stored as feature vectors.
Journal ArticleDOI
Probabilistic models in information retrieval
TL;DR: An introduction and survey over probabilistic information retrieval (IR) is given: the probability-ranking principle shows that optimum retrieval quality can be achieved under certain assumptions; a conceptual model for IR along with the corresponding event space clarify the interpretation of the Probabilistic parameters involved.