scispace - formally typeset
Book ChapterDOI

GradientBased Learning Applied to Document Recognition

TLDR
Various methods applied to handwritten character recognition are reviewed and compared and Convolutional Neural Networks, that are specifically designed to deal with the variability of 2D shapes, are shown to outperform all other techniques.
Abstract
Multilayer Neural Networks trained with the backpropagation algorithm constitute the best example of a successful Gradient-Based Learning technique. Given an appropriate network architecture, Gradient-Based Learning algorithms can be used to synthesize a complex decision surface that can classify high-dimensional patterns such as handwritten characters, with minimal preprocessing. This paper reviews various methods applied to handwritten character recognition and compares them on a standard handwritten digit recognition task. Convolutional Neural Networks, that are specifically designed to deal with the variability of 2D shapes, are shown to outperform all other techniques. Real-life document recognition systems are composed of multiple modules including field extraction, segmentation, recognition, and language modeling. A new learning paradigm, called Graph Transformer Networks (GTN), allows such multi-module systems to be trained globally using Gradient-Based methods so as to minimize an overall performance measure. Two systems for on-line handwriting recognition are described. Experiments demonstrate the advantage of global training, and the flexibility of Graph Transformer Networks. A Graph Transformer Network for reading bank check is also described. It uses Convolutional Neural Network character recognizers combined with global training techniques to provides record accuracy on business and personal checks. It is deployed commercially and reads several million checks per day.

read more

Citations
More filters
Journal ArticleDOI

Modeling language and cognition with deep unsupervised learning: a tutorial overview

TL;DR: It is argued that the focus on deep architectures and generative (rather than discriminative) learning represents a crucial step forward for the connectionist modeling enterprise, because it offers a more plausible model of cortical learning as well as a way to bridge the gap between emergentist connectionist models and structured Bayesian models of cognition.
Posted Content

Multi-source Domain Adaptation in the Deep Learning Era: A Systematic Survey.

TL;DR: This survey defines various MDA strategies and summarize available datasets for evaluation, and compares modern MDA methods in the deep learning era, including latent space transformation and intermediate domain generation.
Proceedings ArticleDOI

Broad learning system: A new learning paradigm and system without going deep

TL;DR: A Broad Learning System is introduced that gives a new paradigm and learning system without the need of deep architecture and can be updated dynamically and incrementally without going through a retraining process.
Journal ArticleDOI

ESFNet: Efficient Network for Building Extraction From High-Resolution Aerial Images

TL;DR: Compared to available deep learning models, the proposed ARC-Net demonstrates better segmentation performance with less computational costs and is both effective and efficient in automatic building extraction from high-resolution aerial images.
Proceedings ArticleDOI

Mask-SLAM: Robust Feature-Based Monocular SLAM by Masking Using Semantic Segmentation

TL;DR: A new framework to exclude feature points using a mask produced by semantic segmentation enables vSLAM to stably estimate camera motion and can achieve significantly higher accuracy than state-of-the-art methods.
Related Papers (5)