Home
/
Topics
/
Document layout analysis

Topic

Document layout analysis

About: Document layout analysis is a research topic. Over the lifetime, 1462 publications have been published within this topic receiving 34021 citations.

...read moreread less

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1993
1992
1991
1990
1989
1988
1987
1986
1985
1984
1983
1982
1969

Papers

PDF

Open Access

More filters

Patent•

Document image generating apparatus, document image generating method and computer program

[...]

Takeshi Kutsumi, Ai Fujiwara, Ichiko Sata

24 Feb 2011

TL;DR: In this paper, the authors present a document image generating apparatus that can keep a layout of original text present in the original text image and then can improve the readability of original texts.

...read moreread less

Abstract: It is expected to provide a document image generating apparatus, a document image generating method and a computer program that can keep a layout of original text present in the original text image and then can improve the readability of original text and the readability annotation corresponding to the original text (e.g., translation). A translation 421 of original text 411 is aligned at the interline space between the original text 411 at the first line and the original text 412 at the second line. When the interline space is narrow as shown in FIG. 4B, the original text 411 overlays the translation 421. At that time, the color regarding the original text 411 is changed to be a low visibility color, and the color regarding the translation 421 is changed to be a high visibility color.

...read moreread less

12 citations

Patent•

Document storage system

[...]

Anjaneyulu Seetha Rama Kuchibhotla, Guruprasad Chintakunta, Sitaram Ramachandrula, Sriganesh Madhvanath, Deivanayagam Ramakrishnan - Show less +1 more

21 Jul 2009

TL;DR: In this paper, a method of storing a document and one or more related images of alterations made to the document, comprising capturing an image of the document and storing the image in memory, is described.

...read moreread less

Abstract: Methods for storing and managing hard copy documents and their modified versions are disclosed. Specifically, a method of storing a document and one or more related images of alterations made to the document, comprising capturing an image of the document; storing the image of the document in memory; capturing an image of an altered version of the document; comparing the image of the document to the image of the altered version of the document; extracting the differences between the image of the document and the image of the altered version of the document; creating an image of the extracted differences between the image of the document and the image of the altered version of the document; and storing the image of the extracted differences in memory.

...read moreread less

12 citations

Patent•

Document logical structure generating method

[...]

Hiromichi Fujisawa, Masashi Koga, Tatsuya Murakami, Yoshihiro Shima

09 Nov 1990

TL;DR: In this article, the authors propose to convert a text file which is represented with linear character strings into a hierarchical tree structure by analyzing index character strings corresponding to the chapters, paragraphs, and clauses in the main body of a document and automatically generating the tree-shaped logical structure.

...read moreread less

Abstract: PURPOSE: To convert a text file which is represented with linear character strings into a hierarchical tree structure by analyzing index character strings corresponding to the chapters, paragraphs, and clauses in the main body of a document and automatically generating the tree-shaped logical structure. CONSTITUTION: A document read part 101 recognizes the characters of inputted document image data and the recognized document data are stored, document by document, in a document data storage part 103; and an index symbol analytic part 102 extracts index symbols and generate the logical structures of the documents from the meaning of the index symbols, and the generated logical structures are stored in the logical structure data storage part 104. A display control part 105 displays the logical structure of a document on a terminal device 106 with a screen according to the stored logical structure data. Consequently, the document file which is represented with linear character strings can be converted into the hierarchical tree structure. COPYRIGHT: (C)1992,JPO&Japio

...read moreread less

12 citations

Journal Article•DOI•

Layout analysis and content enrichment of digitized books

[...]

Costantino Grana¹, Giuseppe Serra¹, Marco Manfredi¹, Dalia Coppi¹, Rita Cucchiara¹ - Show less +1 more•Institutions (1)

University of Modena and Reggio Emilia¹

01 Apr 2016-Multimedia Tools and Applications

TL;DR: A supervised learning approach to segment text and illustration of digitized old documents using a texture feature based on local correlation aimed at detecting the repeating patterns of text regions and differentiate them from pictorial elements is proposed.

...read moreread less

Abstract: In this paper we describe a system for automatically analyzing old documents and creating hyper linking between different epochs, thus opening ancient documents to young people and to make them available on the web with old and current content. We propose a supervised learning approach to segment text and illustration of digitized old documents using a texture feature based on local correlation aimed at detecting the repeating patterns of text regions and differentiate them from pictorial elements. Moreover we present a solution to help the user in finding contemporary content connected to what is automatically extracted from the ancient documents.

...read moreread less

12 citations

Patent•

Device, method, and program for document classification

[...]

Hiromi Oda¹•Institutions (1)

Hewlett-Packard¹

07 Oct 2005

TL;DR: A document classifying device, including a vector creating element for creating a document feature vector from an input document to classify, based upon frequencies with which predetermined collocations occur in the input document as discussed by the authors.

...read moreread less

Abstract: A document classifying device, including (a) a vector creating element for creating a document feature vector from an input document to be classified, based upon frequencies with which predetermined collocations occur in the input document; and (b) a classifying element for classifying the input document into one of a number of categories using the document feature vector.

...read moreread less

12 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
…
120
121
122
123
124
125
126
…
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

Network Information

Performance

Metrics

1,488

Papers

35,779

Citations

No. of papers in the topic in previous years
Year	Papers
2023	5
2022	19
2021	34
2020	19
2019	14
2018	9

Document layout analysis

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics