GradientBased Learning Applied to Document Recognition

doi:10.1109/9780470544976.CH9

Book ChapterDOI

GradientBased Learning Applied to Document Recognition

- pp 306-351

TLDR

Various methods applied to handwritten character recognition are reviewed and compared and Convolutional Neural Networks, that are specifically designed to deal with the variability of 2D shapes, are shown to outperform all other techniques.

Abstract:

Multilayer Neural Networks trained with the backpropagation algorithm constitute the best example of a successful Gradient-Based Learning technique. Given an appropriate network architecture, Gradient-Based Learning algorithms can be used to synthesize a complex decision surface that can classify high-dimensional patterns such as handwritten characters, with minimal preprocessing. This paper reviews various methods applied to handwritten character recognition and compares them on a standard handwritten digit recognition task. Convolutional Neural Networks, that are specifically designed to deal with the variability of 2D shapes, are shown to outperform all other techniques. Real-life document recognition systems are composed of multiple modules including field extraction, segmentation, recognition, and language modeling. A new learning paradigm, called Graph Transformer Networks (GTN), allows such multi-module systems to be trained globally using Gradient-Based methods so as to minimize an overall performance measure. Two systems for on-line handwriting recognition are described. Experiments demonstrate the advantage of global training, and the flexibility of Graph Transformer Networks. A Graph Transformer Network for reading bank check is also described. It uses Convolutional Neural Network character recognizers combined with global training techniques to provides record accuracy on business and personal checks. It is deployed commercially and reads several million checks per day.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Modeling language and cognition with deep unsupervised learning: a tutorial overview

Marco Zorzi, +2 more

- 20 Aug 2013 -

Frontiers in Psychology

TL;DR: It is argued that the focus on deep architectures and generative (rather than discriminative) learning represents a crucial step forward for the connectionist modeling enterprise, because it offers a more plausible model of cortical learning as well as a way to bridge the gap between emergentist connectionist models and structured Bayesian models of cognition.

...read moreread less

Posted Content

Multi-source Domain Adaptation in the Deep Learning Era: A Systematic Survey.

Sicheng Zhao, +4 more

- 26 Feb 2020 -

arXiv: Learning

TL;DR: This survey defines various MDA strategies and summarize available datasets for evaluation, and compares modern MDA methods in the deep learning era, including latent space transformation and intermediate domain generation.

...read moreread less

Proceedings ArticleDOI

Broad learning system: A new learning paradigm and system without going deep

C. L. Philip Chen, +1 more

TL;DR: A Broad Learning System is introduced that gives a new paradigm and learning system without the need of deep architecture and can be updated dynamically and incrementally without going through a retraining process.

...read moreread less

Journal ArticleDOI

ESFNet: Efficient Network for Building Extraction From High-Resolution Aerial Images

Jingbo Lin, +3 more

- 23 Apr 2019 -

IEEE Access

TL;DR: Compared to available deep learning models, the proposed ARC-Net demonstrates better segmentation performance with less computational costs and is both effective and efficient in automatic building extraction from high-resolution aerial images.

...read moreread less

Proceedings ArticleDOI

Mask-SLAM: Robust Feature-Based Monocular SLAM by Masking Using Semantic Segmentation

Masaya Kaneko, +4 more

TL;DR: A new framework to exclude feature points using a mask produced by semantic segmentation enables vSLAM to stably estimate camera motion and can achieve significantly higher accuracy than state-of-the-art methods.

...read moreread less

Collapse

GradientBased Learning Applied to Document Recognition

Citations

Modeling language and cognition with deep unsupervised learning: a tutorial overview

Multi-source Domain Adaptation in the Deep Learning Era: A Systematic Survey.

Broad learning system: A new learning paradigm and system without going deep

ESFNet: Efficient Network for Building Extraction From High-Resolution Aerial Images

Mask-SLAM: Robust Feature-Based Monocular SLAM by Masking Using Semantic Segmentation

Related Papers (5)

Gradient-based learning applied to document recognition

Deep Residual Learning for Image Recognition

ImageNet Classification with Deep Convolutional Neural Networks

Very Deep Convolutional Networks for Large-Scale Image Recognition

ImageNet: A large-scale hierarchical image database