Topic

Document layout analysis

About: Document layout analysis is a research topic. Over the lifetime, 1462 publications have been published within this topic receiving 34021 citations.

...read moreread less

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Hierarchical Recurrent Neural Network for Handwritten Strokes Classification

[...]

Illya Degtyarenko¹, Ivan Deriuga¹, Andrii Grygoriev¹, Serhii Polotskyi¹, Volodymyr Melnyk¹, Dmytro Zakharchuk¹, Olga Radyvonenko¹ - Show less +3 more•Institutions (1)

Samsung¹

06 Jun 2021

TL;DR: In this paper, a hierarchical recurrent neural network (RNN) architecture is proposed to address the hierarchical structure inherent to the handwritten document, and the novelty of feature aggregation pooling technique for transferring data between hierarchical levels allows achieving higher computational efficiency for using the suggested approach in on-device mobile computing.

...read moreread less

Abstract: The paper presents an original solution to the online handwritten document processing in a free form, which is aimed at separating multi-class handwritten documents into texts, tables, formulas, drawings, etc. Stroke classification is an important step in automatic document layout analysis (DLA) in handwritten document recognition systems. Major DLA challenges arise due to a wide diversity of handwritten content, various writing styles, a lack of contextual knowledge, and the complicated structure of freeform handwritten documents. In this paper, we propose the hierarchical recurrent neural network (RNN) architecture to address the hierarchical structure inherent to the handwritten document. The novelty of feature aggregation pooling technique for transferring data between hierarchical levels allows achieving higher computational efficiency for using the suggested approach in on-device mobile computing. The presented approach gives an access to new state-of-the-art results in the task of multi-class classification with an accuracy of 97.25% on the IAMonDo dataset. This result can serve as the basis for efficient mobile applications for freeform handwriting document recognition.

...read moreread less

9 citations

Patent•

Inferring Layout Intent

[...]

Karim Farouki¹, David Benjamin Lee¹, Marko Rakita¹, Dusan Lukic¹, Milos Raskovic¹, Dragan Slaveski¹, Aljosa Obuljen¹, Milan Sesum¹ - Show less +4 more•Institutions (1)

Microsoft¹

28 Sep 2015

TL;DR: In this article, the layout intent associated with explicitly formatted document elements in a document is inferred and an intent-based document is then created using the inferred layout intent for some or all of the explicitly formatted documents in the document.

...read moreread less

Abstract: Technologies are described herein for inferring the layout intent associated with explicitly formatted document elements in a document. The layout type of a document having explicitly formatted document elements is determined. Once the layout type for the document has been determined, the layout intent of explicitly formatted document elements in the document may be determined based, at least in part, on the determined layout type of the document. Heuristic algorithms and/or machine learning classifiers may determine the layout intent of the explicitly formatted document elements in the document. An intent-based document is then created using the inferred layout intent for some or all of the explicitly formatted document elements in the document. The intent-based document may then be provided to an intent-based rendering or authoring application for rendering based upon the inferred layout intent.

...read moreread less

9 citations

Proceedings Article•DOI•

Text and Non-text Segmentation and Classification from Document Images

[...]

Zaidah Ibrahim, Dino Isa¹, Rajprasad Kumar Rajkumar¹•Institutions (1)

University of Nottingham Malaysia Campus¹

12 Dec 2008

TL;DR: This research focuses on the classification of non-text block in technical documents into table, graph, and figure and shows that support vector machine classifies better than back propagation neural network.

...read moreread less

Abstract: Text and non-text segmentation and classification is very important in document layout analysis system before it is presented to an OCR system. Heuristic rules have been used in segmenting and classifying the text and non-text blocks. This research focuses on the classification of non-text block in technical documents into table, graph, and figure. A comparative study is conducted between backpropagation neural network and support vector machine and the result shows that support vector machine classifies better than back propagation neural network.

...read moreread less

9 citations

Patent•

Document Template Generation

[...]

Jose Abad Peiro¹, Sherif Yacoub¹•Institutions (1)

Hewlett-Packard¹

27 Jul 2005

TL;DR: A method of generating a document template comprising: analysing a document (16) by: extracting layout information from the document (6), the layout information comprising one or more document zones and the position of the zones (112) on a page or pages of the document; determining properties of each of the one/more document zones, and generating a semantic label for each zone according to the properties determined for that zone as discussed by the authors.

...read moreread less

Abstract: A method of generating a document template comprising: analysing a document (16) by: extracting layout information from the document (6), the layout information comprising one or more document zones and the position of the zones (112) on a page or pages of the document; determining properties of each of the one or more zones (112); and generating a semantic label (113) for each zone (112) according to the properties determined for that zone (112); and generating a template, the template comprising the layout information and the semantic labels (112).

...read moreread less

9 citations

Proceedings Article•DOI•

Ground-Truth Estimation in Multispectral Representation Space: Application to Degraded Document Image Binarization

[...]

Rachid Hedjam, Mohamed Cheriet

25 Aug 2013

TL;DR: A new method of ground-truth estimation using multispectral (MS) imaging representation space for the sake of document image binarization and based on the cooperation of multiple classifiers under some constraints is proposed.

...read moreread less

Abstract: Human ground-truthing is the manual labelling of samples (pixels for example) to generate reference data without any automatic algorithm help. Although a manual ground-truth is more accurate than a machine ground-truth, it still suffers from mislabeling and/or judgement errors. In this paper we propose a new method of ground-truth estimation using multispectral (MS) imaging representation space for the sake of document image binarization. Starting from the initial manual ground-truth, the proposed classification method aims to select automatically some samples with correct labels (well-labeled pixels) from each class for the training phase, then reassign new labels to the document image pixels. The classification scheme is based on the cooperation of multiple classifiers under some constraints. A real data set of MS historical document images and their ground-truth is created to demonstrate the effectiveness of the proposed method of ground-truth estimation.

...read moreread less

9 citations

Collapse

Network Information

Performance

Metrics

1,488

Papers

35,779

Citations

No. of papers in the topic in previous years
Year	Papers
2023	5
2022	19
2021	34
2020	19
2019	14
2018	9

Document layout analysis

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics