Topic

Document layout analysis

About: Document layout analysis is a research topic. Over the lifetime, 1462 publications have been published within this topic receiving 34021 citations.

...read moreread less

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Learning nongenerative grammatical models for document analysis

[...]

M. Shilman¹, Percy Liang¹, Paul A. Viola¹•Institutions (1)

Microsoft¹

17 Oct 2005

TL;DR: This approach models document layout as a grammar and performs a global search for the optimal parse based on a grammatical cost function and applies this technique to two document image analysis tasks: page layout structure extraction and mathematical expression interpretation.

...read moreread less

Abstract: We present a general approach for the hierarchical segmentation and labeling of document layout structures. This approach models document layout as a grammar and performs a global search for the optimal parse based on a grammatical cost function. Our contribution is to utilize machine learning to discriminatively select features and set all parameters in the parsing process. Therefore, and unlike many other approaches for layout analysis, ours can easily adapt itself to a variety of document analysis problems. One need only specify the page grammar and provide a set of correctly labeled pages. We apply this technique to two document image analysis tasks: page layout structure extraction and mathematical expression interpretation. Experiments demonstrate that the learned grammars can be used to extract the document structure in 57 files from the UWIII document image database. We also show that the same framework can be used to automatically interpret printed mathematical expressions so as to recreate the original LaTeX

...read moreread less

50 citations

Proceedings Article•DOI•

Skew Detection in Binary Image Documents Based on Image Dilation and Region labeling Approach

[...]

B. V. Dhandra¹, V.S. Malemath¹, H. Mallikarjun¹, Ravindra S. Hegadi¹•Institutions (1)

Gulbarga University¹

20 Aug 2006

TL;DR: A new and efficient method for estimation of skew angle in a binary document image, based on image dilation and region labeling technique, which is robust for machine printed document of any size/font, multi-column layouts and documents containing graphics, pictures, charts, tables etc is proposed.

...read moreread less

Abstract: In this paper we propose a new and efficient method for estimation of skew angle in a binary document image, based on Image dilation and Region labeling technique. The input document is dilated by using structuring element as line whose length is fixed experimentally and the region labeling technique is applied using depth first search. Orientation angle is calculated for all the labeled regions, the average of all orientation angles is considered as the skew angle of the document. The experimental results show that better accuracy of estimation could be achieved using this approach since it is based on orientation angles of all the text lines of the underlying document and is the minimum variance unbiased estimator of the true skew angle. The novelty of the proposed method is that, it is robust for machine printed document of any size/font, multi-column layouts and documents containing graphics, pictures, charts, tables etc.

...read moreread less

50 citations

Patent•

Document retrieving method and apparatus

[...]

Eiichiro Toshima¹•Institutions (1)

Canon Inc.¹

26 Apr 2004

TL;DR: In this paper, text feature data that bases upon text data included in a document and image feature data based upon a document image are stored in a memory, and a document corresponding to the search document is retrieved from plural documents.

...read moreread less

Abstract: In the proposed document retrieving apparatus, text feature data that bases upon text data included in a document and image feature data that bases upon a document image are stored in a memory. Image data of a search document is subjected to character recognition processing, text feature data is acquired based on the obtained text data, and image feature data (layout data) is acquired based on the image data of the search document. Using the text feature data and image feature data acquired with respect to the search document, a memory is searched, and a document corresponding to the search document is retrieved from plural documents.

...read moreread less

50 citations

Proceedings Article•DOI•

Pixel-Accurate Representation and Evaluation of Page Segmentation in Document Images

[...]

Faisal Shafait¹, Daniel Keysers¹, Thomas M. Breuel¹•Institutions (1)

German Research Centre for Artificial Intelligence¹

20 Aug 2006

TL;DR: A new representation and evaluation procedure of page segmentation algorithms and analyzes six widely-used layout analysis algorithms using the procedure, permitting easy interchange of segmentation results and ground truth.

...read moreread less

Abstract: This paper presents a new representation and evaluation procedure of page segmentation algorithms and analyzes six widely-used layout analysis algorithms using the procedure. The method permits a detailed analysis of the behavior of page segmentation algorithms in terms of over- and undersegmentation at different layout levels, as well as determination of the geometric accuracy of the segmentation. The representation of document layouts relies on labeling each pixel according to its function in the overall segmentation, permitting pixel-accurate representation of layout information of arbitrary layouts and allowing background pixels to be classified as "dont care". Our representations can be encoded easily in standard color image formats like PNG, permitting easy interchange of segmentation results and ground truth.

...read moreread less

50 citations

Journal Article•DOI•

Document analysis-from pixels to contents

[...]

Jürgen Schürmann¹, Norbert Bartneck¹, Thomas Bayer¹, Jürgen Franke¹, E. Mandler¹, Matthias Oberländer¹ - Show less +2 more•Institutions (1)

Daimler AG¹

01 Jul 1992

TL;DR: The authors present a conceptual framework for solving the task of document analysis, which, in essence, consists in the conversion of the document's pixel representation into an equivalent knowledge network representation holding the document"s content and layout.

...read moreread less

Abstract: The authors present a conceptual framework for solving the task of document analysis, which, in essence, consists in the conversion of the document's pixel representation into an equivalent knowledge network representation holding the document's content and layout. Starting on the pixel level, the formation of elementary geometric objects on which layout analysis as well as the definition of character objects is based is described. Character recognition accomplishes the mapping from geometric object to character meaning in ASCII representation. On the next level of abstraction words are formed and verified by contextual processing. Modeled knowledge about complete documents and about how their constituents are related to the application forms the highest level of abstraction. The various problems arising at each stage are discussed. The dependencies between the different levels are exemplified and technical solutions put forward. >

...read moreread less

49 citations

Collapse

Network Information

Performance

Metrics

1,488

Papers

35,779

Citations

No. of papers in the topic in previous years
Year	Papers
2023	5
2022	19
2021	34
2020	19
2019	14
2018	9

Document layout analysis

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics