Proceedings ArticleDOI
An Integrated Scheme for Compression and Interactive Access to Document Images
Gaurav Harit,Ritu Garg,Santanu Chaudhury +2 more
- pp 506-511
Reads0
Chats0
TLDR
An integrated scheme for document image compression is presented which preserves the layout structure, and still allows the display of textual portions to adapt to the user preferences and screen area, and derives an SVG representation of the complete document image.Abstract:
We present an integrated scheme for document image compression which preserves the layout structure, and still allows the display of textual portions to adapt to the user preferences and screen area. We encode the layout structure of the document images in an XML representation. The textual components and picture components are compressed separately into different representations. We derive an SVG (scalable vector graphics) representation of the complete document image. Compression is achieved since the word-images are encoded using specifications for geometric primitives that compose a word. A document rendered from its SVG representation can be adapted for display and interactive access through common browsers on desktop as well as mobile devices. We demonstrate the effectiveness of the proposed scheme for document accessread more
Citations
More filters
Journal ArticleDOI
A survey of keyword spotting techniques for printed document images
TL;DR: A survey of the past researches on character based as keyword based approaches used for retrieving information from document images to provide insights into the strengths and weaknesses of current techniques and the guidance in choosing the area that future work on document image retrieval could address.
Journal ArticleDOI
Feature string-based intelligent information retrieval from Tamil document images
S. Abirami,D. Manjula +1 more
TL;DR: A simple and effective method to extract the text and perform intelligent IR from Tamil Document Images without Optical Character Recognition (OCR) that could be easily adopted in large digital libraries for IR.
Journal ArticleDOI
Online Information Search from Tamil Document Images in World Wide Web
S. Abirami,S. Murugappan +1 more
TL;DR: This paper proposes a simple and effective method to separate the document images from the available web image sources and to retrieve the information present in those web document images.
Book ChapterDOI
3D reconstruction and isometric representation of engineering drawings
TL;DR: This chapter extends previous work for the 3D reconstruction of engineering drawings in DXF and a new aspect is added to represent isometric views of these drawings in SVG format, proving the former suitable, specially for World Wide Web.
Proceedings ArticleDOI
Uniform Representation Model and Approximate Generation Algorithm of Mobile SVG
TL;DR: This paper puts forward an approximate generation algorithm based on cubic Bezier curve, and figure out the approximate element K for curves, and proves that the approximate degree of the algorithm is very high.
References
More filters
Journal ArticleDOI
Document retrieval from compressed images
Yue Lu,Chew Lim Tan +1 more
TL;DR: Preliminary experimental results with the document images captured from students’ theses show that the proposed approach to retrieve the documents from CCITT Group 4 compressed document images has achieved a promising performance.
Proceedings ArticleDOI
Trainable script identification strategies for Indian languages
Santanu Chaudhury,R. Sheth +1 more
TL;DR: This paper has proposed a novel Gabor filter-based feature extraction scheme for the connected components of Indian scripts, and found that frequency distribution of the width-to-height ratio of theconnected components can also be used for script recognition.
Proceedings ArticleDOI
A model guided document image analysis scheme
TL;DR: A new model-based document image segmentation scheme that uses XML-DTDs (eXtensible Markup Language Document Type Definitions) and makes use of this tool for identifying the logical components of a document image.
Proceedings Article
Transcoding of Document Images for Mobile Devices.
TL;DR: A scheme for transcoding document images for presentation on handheld devices like PDA’s, e-books etc and use of the knowledge of the document model represented through standard ontology language for generation of document summary is presented.