scispace - formally typeset
Search or ask a question
Topic

Document layout analysis

About: Document layout analysis is a research topic. Over the lifetime, 1462 publications have been published within this topic receiving 34021 citations.


Papers
More filters
Proceedings ArticleDOI
23 Apr 2006
TL;DR: An algorithm to separate out tables and math-zones from document images with low computation cost is proposed and it has been observed that tables have distinct columns which imply that gaps between the fields are substantially larger than the gap between the words in text lines.
Abstract: We propose an algorithm to separate out tables and math-zones from document images The algorithm relies on the spatial characteristics of tables and math-zones in a document It has been observed that tables have distinct columns which imply that gaps between the fields are substantially larger than the gaps between the words in text lines and in math-zones the characters and symbols are less dense in comparison to normal text lines These deceptively simple observations have led us to design a simple but powerful table and math-zone detection system with low computation cost

6 citations

Patent
Ajay Jain1
31 Oct 2014
TL;DR: In this article, a modified document is scanned into a digital form using an optical scanning device and the content of the modified digital document including one or more annotations is then grouped into several components, including text, images, form fields and text boxes, and marked shapes, based on corresponding component specifications.
Abstract: Techniques are disclosed for restoring a modified document to an original state. The modified document is scanned into a digital form using an optical scanning device. The content of the modified digital document including one or more annotations is then grouped into several components, including text, images, form fields and text boxes, and marked shapes, based on corresponding component specifications. Each component is then categorized as being structured or unstructured. Structured components that correspond with representative entries in a component repository, such as text in a standard font size, weight and style, are identified as core document content. Unstructured components are identified as annotated document content or highlighted document content, depending on certain characteristics of the components. The categorized and identified components can then be presented separately or in various combinations.

6 citations

Proceedings ArticleDOI
10 Sep 2013
TL;DR: The Conditional Random Field (CRF) model is used to perform OFR and it is shown that the effectiveness of this approach on a set of 616 fonts is demonstrated.
Abstract: Automated publishing systems require large databases containing document page layout templates. Most of these layout templates are created manually. A lower cost alternative is to extract document page layouts from existing documents. In order to extract the layout from a scanned document image, it is necessary to perform Optical Font Recognition (OFR) since the font is an important element in layout design. In this paper, we use the Conditional Random Field (CRF) model to perform OFR. First, we extract typographical features of the text. Then, we train the probabilistic model using a log-linear parameterization of CRF. The advantage of using CRF is that it does not assume that the typographical features are independent of each other. We demonstrate the effectiveness of this approach on a set of 616 fonts.

6 citations

Proceedings ArticleDOI
18 Sep 2012
TL;DR: The method was evaluated on the benchmarking dataset of the International Document Image Binarization Contest (DIBCO 2011) and show promising results.
Abstract: Document image binarization is an initial though critical stage towards the recognition of the text components of a document. This paper describes an efficient method based on mathematical morphology for extracting text regions from degraded handwritten document images. The basic stages of our approach are: a) top-hat-by-reconstruction to produce a filtered image with reasonable even background, b) region growing starting from a set of seed points and attaching to each seed similar intensity neighboring pixels and c) conditional extension of the initially detected text regions based on the values of the second derivative of the filtered image. The method was evaluated on the benchmarking dataset of the International Document Image Binarization Contest (DIBCO 2011) and show promising results.

6 citations

Proceedings ArticleDOI
26 Feb 1997
TL;DR: A system was developed that integrates intelligent document analysis with multiple character/numeral recognition engines in order to achieve high accuracy automated financial document processing and performs well on a test set of machine printed business checks.
Abstract: A system was developed that integrates intelligent document analysis with multiple character/numeral recognition engines in order to achieve high accuracy automated financial document processing. In this system, images are accepted in both their grayscale and binary formats. A document analysis module starts by extracting essential features from the document to help identify its type (e.g. personal check, business check, etc.). These features are also utilized to conduct a full analysis of the image to determine the location of interesting zones such as the courtesy amount and the legal amount. These fields are then made available to several recognition knowledge sources such as courtesy amount recognition engines and legal amount recognition engines through a blackboard architecture. This architecture allows all the available knowledge sources to contribute incrementally and opportunistically to the solution of the given recognition query. Performance results on a test set of machine printed business checks using the integrated system are also reported.© (1997) COPYRIGHT SPIE--The International Society for Optical Engineering. Downloading of the abstract is permitted for personal use only.

6 citations


Network Information
Related Topics (5)
Feature extraction
111.8K papers, 2.1M citations
82% related
Feature (computer vision)
128.2K papers, 1.7M citations
82% related
Object detection
46.1K papers, 1.3M citations
81% related
Image segmentation
79.6K papers, 1.8M citations
80% related
Convolutional neural network
74.7K papers, 2M citations
79% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
20235
202219
202134
202019
201914
20189