Home
/
Topics
/
Document layout analysis

Topic

Document layout analysis

About: Document layout analysis is a research topic. Over the lifetime, 1462 publications have been published within this topic receiving 34021 citations.

...read moreread less

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1993
1992
1991
1990
1989
1988
1987
1986
1985
1984
1983
1982
1969

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Discrimination of machine-printed from handwritten text using simple structural characteristics

[...]

Ergina Kavallieratou¹, S. Stamatatos¹•Institutions (1)

American Hotel & Lodging Educational Institute¹

23 Aug 2004

TL;DR: Experiments on document images taken from IAM-DB and GRUHD databases show a remarkable performance of the proposed approach to discriminate between machine-printed and handwritten text that requires minimal training data.

...read moreread less

Abstract: In this paper, we present a trainable approach to discriminate between machine-printed and handwritten text. An integrated system able to localize text areas and split them in text-lines is used. A set of simple and easy-to-compute structural characteristics that capture the differences between machine-printed and handwritten text-lines is introduced. Experiments on document images taken from IAM-DB and GRUHD databases show a remarkable performance of the proposed approach that requires minimal training data.

...read moreread less

38 citations

Patent•

Document data output device capable of appropriately outputting document data containing a text and layout information

[...]

Hiroaki Zaima, Osamu Tsumori, Shuichiro Ono

11 May 2004

TL;DR: In this article, the text portion is reversibly compressed and the image portion is irreversibly compressed so as to create compressed document data, which is used for communication, thereby reducing the communication amount.

...read moreread less

Abstract: In a document data display system, document data consists of a text portion, an image portion, and layout information. Among them, the text portion is reversibly compressed and the image portion is reversibly or irreversibly compressed so as to create compressed document data, which is used for communication, thereby reducing the communication amount. Moreover, a document data display device (1) decompresses the text portion or the image portion from the received compressed document data in a text decompression section (105) or an image decompression section (106) and arranges the text portion or the image portion in a plot section (108) according to the layout information analysis result in the layout information analysis section (107) for displaying them in a display section (109).

...read moreread less

38 citations

Proceedings Article•

ANASTASIL: hybrid knowledge-based system for document layout analysis

[...]

Andreas Dengel¹, Gerhard Barth¹•Institutions (1)

German Research Centre for Artificial Intelligence¹

20 Aug 1989

TL;DR: A knowledge-based system for the identification of the different regions of a document image that uses a hybrid, modular knowledge representation, a so called geometric tree being its essential part to perform a best-first search in combination with a "hypothesize & test"- strategy.

...read moreread less

Abstract: This paper describes a knowledge-based system for the identification of the different regions of a document image. It uses a hybrid, modular knowledge representation, a so called geometric tree being its essential part. This tree is used to perform a best-first search in combination with a "hypothesize & test"- strategy. It produces an internal, editable description of the entire document and its constituents. The system has been implemented for the analysis of single-sided business letters in Common Lisp on a SUN 3/60 Workstation. It is running for a large population of different business letters. The results obtained have been very encouraging and have convincingly confirmed the soundness of the approach.

...read moreread less

38 citations

Patent•

Document retrieval system for displaying document image data with inputted bibliographic items and character string selected from multiple character candidates

[...]

Hiromichi Fujisawa¹, Atsushi Hatakeyama¹, Yasuaki Nakano¹, Junichi Higashino¹, Toshihiro Hananoi¹ - Show less +1 more•Institutions (1)

Hitachi¹

30 Dec 1987

TL;DR: In this article, a document storage and retrieval system for storing a document body in the form of image, means for storing text information in a form of a character code string for retrieval, apparatus for executing a retrieval with reference to the text information, and apparatus for displaying a document image relating thereto on a retrieval terminal according to the retrieval result.

...read moreread less

Abstract: A document storage and retrieval system for storing a document body in the form of image, means for storing text information in the form of a character code string for retrieval, apparatus for executing a retrieval with reference to the text information, and apparatus for displaying a document image relating thereto on a retrieval terminal according to the retrieval result. Such a form of the system is available for retrieving the full contents of a document and also for displaying the document body printed in a format easy to read straight in the form of image. Users are capable of retrieving documents with arbitrary words and also capable of reading even such a document as is complicated to include mathematical expressions and charts through a terminal in the form of image, the same as on paper. A system is provided wherein the text information for retrieval is extracted automatically from the document image through character recognition. Since a precision of the character recognition has not been satisfactory hitherto, a visual retrieval and correction have been carried out without fail by operators. However, there is no necessity for the operators to attend therefor.

...read moreread less

38 citations

Book Chapter•DOI•

Logical Labeling of Document Images Using Layout Graph Matching with Adaptive Learning

[...]

Jian Liang¹, David Doermann¹•Institutions (1)

University of Maryland, College Park¹

19 Aug 2002

TL;DR: This system is able to learn a model for a document class, use this model to label document images through graph matching, and adaptively improve the model with error feed back.

...read moreread less

Abstract: Logical structure analysis of document images is an important problem in document image understanding. In this paper, we propose a graph matching approach to label logical components on a document page. Our system is able to learn a model for a document class, use this model to label document images through graph matching, and adaptively improve the model with error feed back. We tested our method on journal/proceeding article title pages. The experimental results show promising accuracy, and confirm the ability of adaptive learning.

...read moreread less

38 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
…
45
46
47
48
49
50
51
…
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

Network Information

Performance

Metrics

1,488

Papers

35,779

Citations

No. of papers in the topic in previous years
Year	Papers
2023	5
2022	19
2021	34
2020	19
2019	14
2018	9

Document layout analysis

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics