An Integrated Scheme for Compression and Interactive Access to Document Images

doi:10.1109/ICCTA.2007.29

Proceedings ArticleDOI

An Integrated Scheme for Compression and Interactive Access to Document Images

Gaurav Harit, +2 more

- pp 506-511

Chats0

TLDR

An integrated scheme for document image compression is presented which preserves the layout structure, and still allows the display of textual portions to adapt to the user preferences and screen area, and derives an SVG representation of the complete document image.

Abstract:

We present an integrated scheme for document image compression which preserves the layout structure, and still allows the display of textual portions to adapt to the user preferences and screen area. We encode the layout structure of the document images in an XML representation. The textual components and picture components are compressed separately into different representations. We derive an SVG (scalable vector graphics) representation of the complete document image. Compression is achieved since the word-images are encoded using specifications for geometric primitives that compose a word. A document rendered from its SVG representation can be adapted for display and interactive access through common browsers on desktop as well as mobile devices. We demonstrate the effectiveness of the proposed scheme for document access

Citations

PDF

Open Access

More filters

Journal ArticleDOI

A survey of keyword spotting techniques for printed document images

Abirami Murugappan, +2 more

- 01 Feb 2011 -

Artificial Intelligence Review

TL;DR: A survey of the past researches on character based as keyword based approaches used for retrieving information from document images to provide insights into the strengths and weaknesses of current techniques and the guidance in choosing the area that future work on document image retrieval could address.

...read moreread less

Journal ArticleDOI

Feature string-based intelligent information retrieval from Tamil document images

S. Abirami, +1 more

- 01 Jun 2009 -

Journal of Computer Applications in Tech...

TL;DR: A simple and effective method to extract the text and perform intelligent IR from Tamil Document Images without Optical Character Recognition (OCR) that could be easily adopted in large digital libraries for IR.

...read moreread less

Journal ArticleDOI

Online Information Search from Tamil Document Images in World Wide Web

S. Abirami, +1 more

- 30 Aug 2012 -

International Journal of Computer Applic...

TL;DR: This paper proposes a simple and effective method to separate the document images from the available web image sources and to retrieve the information present in those web document images.

...read moreread less

Book ChapterDOI

3D reconstruction and isometric representation of engineering drawings

Muhammad Abuzar Fahiem, +2 more

TL;DR: This chapter extends previous work for the 3D reconstruction of engineering drawings in DXF and a new aspect is added to represent isometric views of these drawings in SVG format, proving the former suitable, specially for World Wide Web.

...read moreread less

Proceedings ArticleDOI

Uniform Representation Model and Approximate Generation Algorithm of Mobile SVG

Wan-Lin, +3 more

TL;DR: This paper puts forward an approximate generation algorithm based on cubic Bezier curve, and figure out the approximate element K for curves, and proves that the approximate degree of the algorithm is very high.

...read moreread less

References

PDF

Open Access

More filters

Journal ArticleDOI

Document retrieval from compressed images

Yue Lu, +1 more

- 01 Apr 2003 -

Pattern Recognition

TL;DR: Preliminary experimental results with the document images captured from students’ theses show that the proposed approach to retrieve the documents from CCITT Group 4 compressed document images has achieved a promising performance.

...read moreread less

Proceedings ArticleDOI

Trainable script identification strategies for Indian languages

Santanu Chaudhury, +1 more

TL;DR: This paper has proposed a novel Gabor filter-based feature extraction scheme for the connected components of Indian scripts, and found that frequency distribution of the width-to-height ratio of theconnected components can also be used for script recognition.

...read moreread less

A general segmentation scheme for DjVu document compression

Patrick Haffner, +4 more

Proceedings ArticleDOI

A model guided document image analysis scheme

Gaurav Harit, +4 more

TL;DR: A new model-based document image segmentation scheme that uses XML-DTDs (eXtensible Markup Language Document Type Definitions) and makes use of this tool for identifying the logical components of a document image.

...read moreread less

Proceedings Article

Transcoding of Document Images for Mobile Devices.

Tabassum Yasmin, +2 more

TL;DR: A scheme for transcoding document images for presentation on handheld devices like PDA’s, e-books etc and use of the knowledge of the document model represented through standard ontology language for generation of document summary is presented.

...read moreread less

An Integrated Scheme for Compression and Interactive Access to Document Images

Citations

A survey of keyword spotting techniques for printed document images

Feature string-based intelligent information retrieval from Tamil document images

Online Information Search from Tamil Document Images in World Wide Web

3D reconstruction and isometric representation of engineering drawings

Uniform Representation Model and Approximate Generation Algorithm of Mobile SVG

References

Document retrieval from compressed images

Trainable script identification strategies for Indian languages

A general segmentation scheme for DjVu document compression

A model guided document image analysis scheme

Transcoding of Document Images for Mobile Devices.

Related Papers (5)

A structure editor for abstract document objects

Carrying graphical data embedded in a program stream

Digital media environment for intuitive modifications of digital graphics

Scalable vector graphics editing systems and methods

Analyzing a document that includes a text-based visual representation