scispace - formally typeset
Search or ask a question
Topic

Document layout analysis

About: Document layout analysis is a research topic. Over the lifetime, 1462 publications have been published within this topic receiving 34021 citations.


Papers
More filters
Patent
Kathrin Berkner1
30 Jun 2005
TL;DR: In this article, a method, article of manufacture, and apparatus for content-adaptive scaling of document images is described, which comprises identifying spatial relationships between document objects of a document image, determining space separating pairs of neighboring document objects, and determining at least one scaling factor based on the space separating the document objects in the document image and based on display device characteristics.
Abstract: A method, article of manufacture, and apparatus for content-adaptive scaling of document images is described. In one embodiment, the method comprises identifying spatial relationships between document objects of a document image, determining space separating pairs of neighboring document objects, and determining at least one scaling factor based on the space separating the document objects in the document image and based on display device characteristics.

61 citations

Patent
15 Nov 2007
TL;DR: In this article, the authors present a system for storing, organizing, and accessing image-based documents, which includes OCR conversion process to produce an equivalent document in text format, identifying the keywords of the equivalent document, linking the keywords with the image based document and storing the imagebased document, the corresponding equivalent document and the keywords in a relational database.
Abstract: Methods and systems are provided for storing, organizing, and accessing image-based documents The method includes receiving an image-based document, conducting an OCR conversion process to produce an equivalent document in text format, identifying keywords of the equivalent document in text format, linking the keywords with the image-based document and the corresponding equivalent document in text format, and storing the image-based document, the corresponding equivalent document in text format, and the keywords in a relational database

61 citations

Journal ArticleDOI
TL;DR: The usefulness of the features derived from interval coding in a hidden Markov model based page layout classification system that is trainable and extendible are demonstrated.
Abstract: This paper describes features and methods for document image comparison and classification at the spatial layout level. The methods are useful for visual similarity based document retrieval as well as fast algorithms for initial document type classification without OCR. A novel feature set called interval encoding is introduced to capture elements of spatial layout. This feature set encodes region layout information in fixed-length vectors by capturing structural characteristics of the image. These fixed-length vectors are then compared to each other through a Manhattan distance computation for fast page layout comparison. The paper describes experiments and results to rank-order a set of document pages in terms of their layout similarity to a test document. We also demonstrate the usefulness of the features derived from interval coding in a hidden Markov model based page layout classification system that is trainable and extendible. The methods described in the paper can be used in various document retrieval tasks including visual similarity based retrieval, categorization and information extraction.

59 citations

Patent
26 Aug 1994
TL;DR: In this article, a method and an apparatus for document formatting, capable of reflecting the preference of the operator and overall balance, such that the desired formatting can be obtained efficiently without tedious post-processing operations.
Abstract: A method and an apparatus for document formatting, capable of reflecting the preference of the operator and overall balance, such that the desired formatting can be obtained efficiently without tedious post-processing operations. In the apparatus, document data representing the document, including figure data representing figure elements of the document, and region data indicating layout regions to which the document is to be laid out, are inputted, candidate layouts for each figure element to be laid out are generated, one of the generated candidate layouts is selected, and the document is formatted in the layout region, according to the selected one of the candidate layouts.

59 citations

Proceedings ArticleDOI
09 Nov 2001
TL;DR: A document analysis system which is expected to extract regions of interest in greyscale document images using geometric and texture features and some entropic heuristic is presented.
Abstract: In this paper, we present a document analysis system which is expected to extract regions of interest in greyscale document images. Collected areas are then clustered in text zones and non-text areas using geometric and texture features. The system works in two steps. Regions of interest are retrieved via cumulative gradient considerations. In classification module, we introduced some entropic heuristic. Experiments are done on the MediaTeam Document Database to show the relevance of this criteria.

59 citations


Network Information
Related Topics (5)
Feature extraction
111.8K papers, 2.1M citations
82% related
Feature (computer vision)
128.2K papers, 1.7M citations
82% related
Object detection
46.1K papers, 1.3M citations
81% related
Image segmentation
79.6K papers, 1.8M citations
80% related
Convolutional neural network
74.7K papers, 2M citations
79% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
20235
202219
202134
202019
201914
20189