scispace - formally typeset
Open AccessJournal Article

Document Analysis and Recognition

Takahiro Watanabe
- 25 Mar 1999 - 
- Vol. 82, Iss: 3, pp 601-610
Reads0
Chats0
TLDR
This paper addresses current topics about document image understanding from a technical point of view as a survey and proposes methods/approaches for recognition of various kinds of documents.
Abstract
The subject about document image understanding is to extract and classify individual data meaningfully from paper-based documents. Until today, many methods/approaches have been proposed with regard to recognition of various kinds of documents, various technical problems for extensions of OCR, and requirements for practical usages. Of course, though the technical research issues in the early stage are looked upon as complementary attacks for the traditional OCR which is dependent on character recognition techniques, the application ranges or related issues are widely investigated or should be established progressively. This paper addresses current topics about document image understanding from a technical point of view as a survey. key words: document model, top-down, bottom-up, layout structure, logical structure, document types, layout recognition

read more

Citations
More filters
Book

Supervised Sequence Labelling with Recurrent Neural Networks

Alex Graves
TL;DR: A new type of output layer that allows recurrent networks to be trained directly for sequence labelling tasks where the alignment between the inputs and the labels is unknown, and an extension of the long short-term memory network architecture to multidimensional data, such as images and video sequences.

Methods and Applications

TL;DR: The aim of the research presented in this thesis is to create new methods for design for manufacturing, by using several approaches of KE, and find the beneficial and less beneficial aspects of these methods in comparison to each other and earlier research.
Journal ArticleDOI

Understanding and capturing people's privacy policies in a mobile social networking application

TL;DR: This article reports on the work on PeopleFinder, an application that enables cell phone and laptop users to selectively share their locations with others, and explores technologies that empower users to more effectively and efficiently specify their privacy preferences.
Proceedings ArticleDOI

Enhancing one-class support vector machines for unsupervised anomaly detection

TL;DR: This work applies two modifications in order to make one-class SVMs more suitable for unsupervised anomaly detection: Robust one- Class SVMs and eta one- class SVMs, with the key idea, that outliers should contribute less to the decision boundary as normal instances.
Journal ArticleDOI

A recursive thresholding technique for image segmentation

TL;DR: A general recursive approach for image segmentation by extending Otsu's (1978) method, which segments the brightest homogeneous object from a given image at each recursion, leaving only the darkesthomogeneous object after the last recursion.
References
More filters

Methods and Applications

TL;DR: The aim of the research presented in this thesis is to create new methods for design for manufacturing, by using several approaches of KE, and find the beneficial and less beneficial aspects of these methods in comparison to each other and earlier research.
Journal ArticleDOI

The writer independent online handwriting recognition system frog on hand and cluster generative statistical dynamic time warping

TL;DR: This work concerns the presentation of the classification/training approach, which is called cluster generative statistical dynamic time warping (CSDTW), a general, scalable, HMM-based method for variable-sized, sequential data that holistically combines cluster analysis and statistical sequence modeling.
Proceedings ArticleDOI

Multi-Column Deep Neural Networks for offline handwritten Chinese character classification

TL;DR: Multi-Column Deep Neural Networks achieve state of the art recognition rates on Chinese characters from the ICDAR 2011 and 2013 offline handwriting competitions, approaching human accuracy.
Proceedings ArticleDOI

A rule-based system for document image segmentation

TL;DR: A rule-based system for automatically segmenting a document image into regions of text and nontext is presented and allows easy fine tuning of the algorithmic steps to produce robust rules, to incorporate additional tools (as they become available), and to handle special segmentation needs.
Related Papers (5)