scispace - formally typeset
Proceedings ArticleDOI

An Integrated Scheme for Compression and Interactive Access to Document Images

Reads0
Chats0
TLDR
An integrated scheme for document image compression is presented which preserves the layout structure, and still allows the display of textual portions to adapt to the user preferences and screen area, and derives an SVG representation of the complete document image.
Abstract
We present an integrated scheme for document image compression which preserves the layout structure, and still allows the display of textual portions to adapt to the user preferences and screen area. We encode the layout structure of the document images in an XML representation. The textual components and picture components are compressed separately into different representations. We derive an SVG (scalable vector graphics) representation of the complete document image. Compression is achieved since the word-images are encoded using specifications for geometric primitives that compose a word. A document rendered from its SVG representation can be adapted for display and interactive access through common browsers on desktop as well as mobile devices. We demonstrate the effectiveness of the proposed scheme for document access

read more

Citations
More filters
Journal ArticleDOI

A survey of keyword spotting techniques for printed document images

TL;DR: A survey of the past researches on character based as keyword based approaches used for retrieving information from document images to provide insights into the strengths and weaknesses of current techniques and the guidance in choosing the area that future work on document image retrieval could address.
Journal ArticleDOI

Feature string-based intelligent information retrieval from Tamil document images

TL;DR: A simple and effective method to extract the text and perform intelligent IR from Tamil Document Images without Optical Character Recognition (OCR) that could be easily adopted in large digital libraries for IR.
Journal ArticleDOI

Online Information Search from Tamil Document Images in World Wide Web

TL;DR: This paper proposes a simple and effective method to separate the document images from the available web image sources and to retrieve the information present in those web document images.
Book ChapterDOI

3D reconstruction and isometric representation of engineering drawings

TL;DR: This chapter extends previous work for the 3D reconstruction of engineering drawings in DXF and a new aspect is added to represent isometric views of these drawings in SVG format, proving the former suitable, specially for World Wide Web.
Proceedings ArticleDOI

Uniform Representation Model and Approximate Generation Algorithm of Mobile SVG

TL;DR: This paper puts forward an approximate generation algorithm based on cubic Bezier curve, and figure out the approximate element K for curves, and proves that the approximate degree of the algorithm is very high.
References
More filters
Journal ArticleDOI

Document retrieval from compressed images

TL;DR: Preliminary experimental results with the document images captured from students’ theses show that the proposed approach to retrieve the documents from CCITT Group 4 compressed document images has achieved a promising performance.
Proceedings ArticleDOI

Trainable script identification strategies for Indian languages

TL;DR: This paper has proposed a novel Gabor filter-based feature extraction scheme for the connected components of Indian scripts, and found that frequency distribution of the width-to-height ratio of theconnected components can also be used for script recognition.
Proceedings ArticleDOI

A model guided document image analysis scheme

TL;DR: A new model-based document image segmentation scheme that uses XML-DTDs (eXtensible Markup Language Document Type Definitions) and makes use of this tool for identifying the logical components of a document image.
Proceedings Article

Transcoding of Document Images for Mobile Devices.

TL;DR: A scheme for transcoding document images for presentation on handheld devices like PDA’s, e-books etc and use of the knowledge of the document model represented through standard ontology language for generation of document summary is presented.