scispace - formally typeset
Search or ask a question
Topic

Document layout analysis

About: Document layout analysis is a research topic. Over the lifetime, 1462 publications have been published within this topic receiving 34021 citations.


Papers
More filters
Proceedings Article
01 Aug 2011
TL;DR: A binarization-free layout analysis method for ancient manuscripts is proposed, which identifies and localizes layout entities exploiting their structural similarities on the local level.
Abstract: A binarization-free layout analysis method for ancient manuscripts is proposed, which identifies and localizes layout entities exploiting their structural similarities on the local level. Hence, the textual entities are disassembled into segments, and a part-based detection is done which employs local gradient features known from the field of object recognition, the Scale Invariant Feature Transform (SIFT), to describe these structures. Layout analysis is the first step in the process of document understanding; it identifies regions of interest and, hence, serves as input for other algorithms such as Optical Character Recognition (OCR). Moreover, the document layout allows scholars to establish the spatio-temporal origin, authenticate, or index a document. The layout entities considered in this approach include the body text, embellished initials, plain initials and headings.

8 citations

Patent
17 Feb 2004
TL;DR: In this article, a document coding system that allows a single person, company or organization to tack leaked or copied confidential documents, issued to different departments or associates within an organisation, back to the department or person of the non-approved document copy is described.
Abstract: The invention described here consists of a document coding system that will give a single person, company or organisation the ability to tack leaked or copied confidential documents, issued to different departments or associates within an organisation, back to the department or person of the non approved document copy. It will give each printed copy of a document a unique fingerprint. The invention described herein may achieve this by encoding information in a word processed document using subtle changes in font, spacing and page layout, for example, to reflect the time, user and printer information. This will provide a way of tracking a document to the time and place of creation. The invention includes means for either automatically decoding information in documents by using optical character recognition (OCR) or document image analysis, for example, or to provide a visual means to assist manual document decoding.

8 citations

Patent
24 Aug 2000
TL;DR: In this paper, the problem of taking out contents such as a text, a picture, and a table from an electronic document and making them integrally handleable and reusable is addressed.
Abstract: PROBLEM TO BE SOLVED: To take out contents such as a 'text', a 'picture' and a 'table' from an electronic document and to make them integrally handleable and reusable. SOLUTION: In the document processor processing the electronic document, an electronic document preparation part 103 analyzes the layout of picture data, divides it into the areas of prescribed attributes and prepares the electronic document 104 including the contents for every divided area by designating the attribute so that it can be extracted. A contents detection part 109 detects the contents in the electronic document 104 and a content management part 110 registers and manages the detected contents based on information showing the attribute.

8 citations

Patent
Einat Amitay1, Nadav Har'El1
19 Jan 2005
TL;DR: In this article, a method for finding content-rich text in a document by identifying areas of narrative in the document is presented. But this method requires a large number of annotated documents.
Abstract: A method includes finding content-rich text in a document by identifying areas of narrative in the document. An apparatus includes a detector and a content-rich text indicator. The detector detects linguistic parameters which characterize narrative text in an input document and the content-rich text indicator provides the locations of narrative text in the input document.

8 citations

Book ChapterDOI
04 Nov 1998
TL;DR: The system consists of three main components namely detection of mathematical expressions in a document, recognition of the symbols present in the expression and meaningful arrangement of the recognized symbols.
Abstract: In this paper, we propose an approach for understanding mathematical expressions in printed document. The system consists of three main components namely (i) detection of mathematical expressions in a document, (ii) recognition of the symbols present in the expression and (iii) meaningful arrangement of the recognized symbols. However, detection of mathematical expressions is done through recognition of symbols. Moreover, some structural features of the expressions are also used for this purpose. For recognition of the symbols a hybrid of feature based and template based recognition techniques is used. The bounding-box coordinates and the size information of the symbols help to determine the spatial relationships among the symbols. A set of predefined grammar rules is used to form the meaningful symbol groups to properly arrange the symbols. Experiments conducted using these approaches on a large number of documents show high accuracy.

8 citations


Network Information
Related Topics (5)
Feature extraction
111.8K papers, 2.1M citations
82% related
Feature (computer vision)
128.2K papers, 1.7M citations
82% related
Object detection
46.1K papers, 1.3M citations
81% related
Image segmentation
79.6K papers, 1.8M citations
80% related
Convolutional neural network
74.7K papers, 2M citations
79% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
20235
202219
202134
202019
201914
20189