scispace - formally typeset
Search or ask a question
Proceedings ArticleDOI

An Integrated Scheme for Compression and Interactive Access to Document Images

TL;DR: An integrated scheme for document image compression is presented which preserves the layout structure, and still allows the display of textual portions to adapt to the user preferences and screen area, and derives an SVG representation of the complete document image.
Abstract: We present an integrated scheme for document image compression which preserves the layout structure, and still allows the display of textual portions to adapt to the user preferences and screen area. We encode the layout structure of the document images in an XML representation. The textual components and picture components are compressed separately into different representations. We derive an SVG (scalable vector graphics) representation of the complete document image. Compression is achieved since the word-images are encoded using specifications for geometric primitives that compose a word. A document rendered from its SVG representation can be adapted for display and interactive access through common browsers on desktop as well as mobile devices. We demonstrate the effectiveness of the proposed scheme for document access
Citations
More filters
Journal ArticleDOI
TL;DR: A survey of the past researches on character based as keyword based approaches used for retrieving information from document images to provide insights into the strengths and weaknesses of current techniques and the guidance in choosing the area that future work on document image retrieval could address.
Abstract: This paper attempts to provide a survey of the past researches on character based as keyword based approaches used for retrieving information from document images. This survey also provides insights into the strengths and weaknesses of current techniques, relevancy lies between each technique and also the guidance in choosing the area that future work on document image retrieval could address.

39 citations


Cites methods from "An Integrated Scheme for Compressio..."

  • ...Subsequently, computational speed up to the above GFG extraction algorithm was achieved (Harit et al. 2007) through a modification of the primitive identification technique....

    [...]

Journal ArticleDOI
S. Abirami1, D. Manjula1
TL;DR: A simple and effective method to extract the text and perform intelligent IR from Tamil Document Images without Optical Character Recognition (OCR) that could be easily adopted in large digital libraries for IR.
Abstract: Information Retrieval (IR) in document images has become a growing and challenging problem due to its rising popularity. This paper proposes a simple and effective method to extract the text and perform intelligent IR from Tamil Document Images without Optical Character Recognition (OCR). This methodology generates a feature string for every word image by extracting its features. This relies on their basic characteristics or shapes of letters instead of recognising the letters like OCR. The strength of this technique lies in extracting the text based on their basic features such as lines and black and white disposition rates in characters which is almost same for the characters across various font sizes and font faces. As an offline process, document images are preprocessed and text extraction process extracts the features from the word images based on their shapes and they are stored in temporary files. During online retrieval, textual keyword is obtained from the user and its primitive string is framed. Based on the primitive string, IR is performed and the resultant images are provided to the user. This technique could be easily adopted in large digital libraries for IR.

10 citations


Cites background from "An Integrated Scheme for Compressio..."

  • ...Recently, in Hindi document images, Harit et al. (2007) present a GFG-based technique for document access....

    [...]

Journal ArticleDOI
TL;DR: This paper proposes a simple and effective method to separate the document images from the available web image sources and to retrieve the information present in those web document images.
Abstract: Information Retrieval (IR) from Tamil document images present in World Wide Web (WWW) has become a challenging problem today due to its rising popularity. Among the most valuable Web assets, categorizing web images and retrieval of information from the images on the Web is quite difficult. This paper proposes a simple and effective method to separate the document images from the available web image sources and to retrieve the information present in those web document images. This system works in two phases: In the first phase, it concentrates on Automatic Image categorization process over web images by employing a filtering technique to discriminate the document images from other images available in WWW. Filtering technique employed here captures the image information by intensity and frequency histograms to discriminate the web document images. As for information retrieval in the second phase, feature string generation technique has been used to generate feature strings for every word images by extracting its shape this generates a feature string for every word image by extracting its features relying on their statistical properties, such as lines, black and white disposition rates and outline features of characters, instead of recognizing the letters and assigning its ASCII value like OCR. This kind of information retrieval has been initiated over a list of web sites and experimental results are recorded.

1 citations


Cites methods from "An Integrated Scheme for Compressio..."

  • ...Therefore Geometric Feature GFG technique was introduced [9][10][11] to represent the features at word level in Hindi images....

    [...]

Book ChapterDOI
01 Jan 2009
TL;DR: This chapter extends previous work for the 3D reconstruction of engineering drawings in DXF and a new aspect is added to represent isometric views of these drawings in SVG format, proving the former suitable, specially for World Wide Web.
Abstract: Engineering drawings are mostly represented in drawing exchange file (DXF) format for information interchange. DXF format is recognized by CAD tools only and results in a large file size, requiring heavy loading time. In this chapter, we have extended our previous work for the 3D reconstruction of engineering drawings in DXF and a new aspect is added to represent isometric views of these drawings in SVG format. The discussion is concluded on a comparison of SVG with DXF, proving the former suitable, specially for World Wide Web.

Additional excerpts

  • ...Vector format SVG with its small file size plays a very important role in this regard; however, a suitable compression technique [ 15 ] can also be used....

    [...]

Proceedings ArticleDOI
08 Oct 2007
TL;DR: This paper puts forward an approximate generation algorithm based on cubic Bezier curve, and figure out the approximate element K for curves, and proves that the approximate degree of the algorithm is very high.
Abstract: Mobile SVG is adopted as the imaging standard of MMS by 3GPP - the partner of 3G. When generating mobile SVG on resource limited mobile devices, the memory space and computation capability become serious constrain which we should pay more attention. By analyzing the characteristics of mobile SVG basic shapes, this paper presents a uniform representation model based on path. Then this paper puts forward an approximate generation algorithm based on cubic Bezier curve, and figure out the approximate element K for curves. The experiments prove that the approximate degree of the algorithm is very high. The method is valuable for the mobile SVG representation.

Cites background from "An Integrated Scheme for Compressio..."

  • ...Various researches and potential applications are carried out with the use of its extraordinary characteristics; some examples are Information visualization [3][4], document representation [5], map service especially Mobile GIS [6][7][8][9] etc....

    [...]

References
More filters
Journal ArticleDOI
TL;DR: Preliminary experimental results with the document images captured from students’ theses show that the proposed approach to retrieve the documents from CCITT Group 4 compressed document images has achieved a promising performance.
Abstract: With the emergence of digital libraries, more and more documents are stored and transmitted through the Internet in the format of compressed images. It is of significant meaning to develop a system which is capable of retrieving documents from these compressed document images. Aiming at the popular compression standard-CCITT Group 4 which is widely used for compressing document images, we present an approach to retrieve the documents from CCITT Group 4 compressed document images in this paper. The black and white changing elements are extracted directly from the compressed document images to act as the feature pixels, and the connected components are detected simultaneously. Then the word boxes are bounded based on the merging of the connected components. Weighted Hausdorff distance is proposed to assign all of the word objects from both the query document and the document from database to corresponding classes by an unsupervised classifier, whereas the possible stop words are excluded. Document vectors are built by the occurrence frequency of the word object classes, and the pair-wise similarity of two document images is represented by the scalar product of the document vectors. Nine groups of articles pertaining to different domains are used to test the validity of the presented approach. Preliminary experimental results with the document images captured from students’ theses show that the proposed approach has achieved a promising performance.

85 citations


"An Integrated Scheme for Compressio..." refers methods in this paper

  • ...A document rendered from its SVG representation can be adapted for display and interactive access through common browsers on desktop as well as mobile devices, unlike other document image representation techniques [9]....

    [...]

Proceedings ArticleDOI
20 Sep 1999
TL;DR: This paper has proposed a novel Gabor filter-based feature extraction scheme for the connected components of Indian scripts, and found that frequency distribution of the width-to-height ratio of theconnected components can also be used for script recognition.
Abstract: Identification of the script in an image of a document page is of primary importance for a system processing multi-lingual documents. In this paper three trainable classification schemes have been proposed for identification of Indian scripts. The first scheme is based upon a frequency domain representation of the horizontal profile of the textual blocks. The other two schemes use connected components extracted from the textual region. We have proposed a novel Gabor filter-based feature extraction scheme for the connected components. We have also found that frequency distribution of the width-to-height ratio of the connected components can also be used for script recognition. It has been experimentally found that the Gabor filter-based scheme provides the most reliable performance. However, the other two techniques are computationally more efficient.

52 citations


Additional excerpts

  • ...The text blocks are further processed to identify the script [4]....

    [...]

Proceedings ArticleDOI
10 Sep 2001
TL;DR: A new model-based document image segmentation scheme that uses XML-DTDs (eXtensible Markup Language Document Type Definitions) and makes use of this tool for identifying the logical components of a document image.
Abstract: This paper presents a new model-based document image segmentation scheme that uses XML-DTDs (eXtensible Markup Language Document Type Definitions). Given a document image, the algorithm has the ability to select the appropriate model. A new wavelet-based tool has been designed for distinguishing text from non-text regions and characterization of font sizes. Our model-based analysis scheme makes use of this tool for identifying the logical components of a document image.

16 citations


"An Integrated Scheme for Compressio..." refers methods in this paper

  • ...The original document image is segmented using the XML-DTD model guided segmentation [7], which divides the page into rectangular blocks of labelled logical components....

    [...]

Proceedings Article
01 Jan 2004
TL;DR: A scheme for transcoding document images for presentation on handheld devices like PDA’s, e-books etc and use of the knowledge of the document model represented through standard ontology language for generation of document summary is presented.
Abstract: In this paper we have presented a scheme for transcoding document images for presentation on handheld devices like PDA’s, e-books etc. We have proposed technqiues suitable, in particular ,for images of documents of Indian languages having Devanagari based scripts (viz. Hindi, Marathi, Bengali, Assamese, etc). Appropriate compression scheme for textual component of document images exploiting script specific characteristics has been suggested. We have also explored use of the knowledge of the document model represented through standard ontology language for generation of document summary. An experimented system has been developed for validation of these schemes.

7 citations