scispace - formally typeset
Search or ask a question
Topic

Document layout analysis

About: Document layout analysis is a research topic. Over the lifetime, 1462 publications have been published within this topic receiving 34021 citations.


Papers
More filters
Proceedings ArticleDOI
01 Sep 2015
TL;DR: A completely automatic and scalable framework to perform query-by-example word-spotting on medieval manuscripts that does not require any human intervention to produce a large amount of annotated training data and provides Computer Vision researchers and Cultural Heritage practitioners with a compact and efficient system for document analysis.
Abstract: We present a completely automatic and scalable framework to perform query-by-example word-spotting on medieval manuscripts. Our system does not require any human intervention to produce a large amount of annotated training data, and it provides Computer Vision researchers and Cultural Heritage practitioners with a compact and efficient system for document analysis. We have executed the pipeline both in a single-manuscript and a cross-manuscript setup, and we have tested it on a heterogeneous set of medieval manuscripts, that includes a variety of writing styles, languages, image resolutions, levels of conservation, noise and amount of illumination and ornamentation. We also present a precision/recall based analysis to quantitatively assess the quality of the proposed algorithm.

7 citations

Proceedings ArticleDOI
10 Sep 2001
TL;DR: This paper presents a computer assisted document synthesis system based on the grammar-based structure analysis designed to accomplish the analysis and synthesis of table form documents cooperatively by a user and computer.
Abstract: Document structure is an important issue not only for document analysis but for document synthesis. This paper presents a computer assisted document synthesis system based on the grammar-based structure analysis. The system is designed to accomplish the analysis and synthesis of table form documents cooperatively by a user and computer; namely, the user interprets the document meaning and gives the entry data to be filled in, while the computer detects the boxes formed by horizontal and vertical rules and determines the logical relations of adjacent boxes. First, the document is decomposed into a set of boxes and they are classified semi-automatically into four types: blank, insertion, indication and explanation. Then the box relations between the indication box and its associated entry one are analyzed based on the semantic and geometric knowledge defined in the document structure grammar. Finally, the system generates LATEX codes of the synthesized documents whose blank and insertion boxes are filled with the text and image data given by user. Experimental results have shown that the system analyzed successfully several kinds of table forms and yielded synthesized documents as expected.

7 citations

Proceedings ArticleDOI
31 Aug 2005
TL;DR: This paper proposes a learning-based method to label logical components on Arabic newspaper documents called LUNET, which is driven by artificial neural nets.
Abstract: Logical structure analysis is an important phase in the process of document image understanding. In this paper we propose a learning-based method to label logical components on Arabic newspaper documents. The labeling is driven by artificial neural nets. Each one is specialized in a document class. The first prototype of LUNET has been tested on a set of Arabic newspapers of three document classes. Some promising experimental results are reported.

7 citations

Patent
19 Aug 1997
TL;DR: In this paper, a matching processing for estimating the preceding version of a new document to be archived is to search the document having the most descriptors similar to the new document, when the most suitable document is the latest version, the document which is newly archived, shows that it is the unique succeeding version of the most appropriate document.
Abstract: PROBLEM TO BE SOLVED: To facilitate the filing of an electronic document and document comparasion by storing the descriptor of the stored document and comparing the descriptor with a descriptor in a data base. SOLUTION: The descriptor data base 304 stores the plural descriptors for identifying the feature of one document. The documents in the document data base 302 are stored for every descriptor. A matching processing for estimating the preceding version of the new document to be archived is to search the document having the most descriptors similar to the new document. When the most suitable document is the latest version, the document, which is newly archived, shows that it is the unique succeeding version of the most suitable document. Namely, a hyper text data base 312 is updated in such a way that the title of the document which has been newly archived becomes just below the title of the most suitable document on HTML pages 314 and 316 provided for respective document version groups.

7 citations

Journal ArticleDOI
TL;DR: A competent DLA system, named as BINYAS, based on the connected component (CC) and pixel analysis based approach is proposed, which performs significantly better than state-of-the-art methods in terms of the evaluation metrics considered by the research community of this domain.
Abstract: Document layout analysis (DLA) is an irreplaceable pre-requisite for the development of a comprehensive document image processing and analysis system. The main purpose of DLA is to segment an input document image into its constituent and coherent regions and identify their classes. In this paper, we propose a competent DLA system, named as BINYAS, based on the connected component (CC) and pixel analysis based approach. Here, we initially identify the regions and then classify these regions as paragraph, separator, graphic, image, table, chart, and inverted text etc. The proposed system is evaluated on four publicly available standard datasets, namely ICDAR 2009, 2015, 2017 and 2019 page segmentation competition datasets, and the performance is compared with many contemporary methods, which also include some well-known software products and deep learning based methods. Experimental results show that our method performs significantly better than state-of-the-art methods in terms of the evaluation metrics considered by the research community of this domain.

7 citations


Network Information
Related Topics (5)
Feature extraction
111.8K papers, 2.1M citations
82% related
Feature (computer vision)
128.2K papers, 1.7M citations
82% related
Object detection
46.1K papers, 1.3M citations
81% related
Image segmentation
79.6K papers, 1.8M citations
80% related
Convolutional neural network
74.7K papers, 2M citations
79% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
20235
202219
202134
202019
201914
20189