GradientBased Learning Applied to Document Recognition

doi:10.1109/9780470544976.CH9

Book ChapterDOI

GradientBased Learning Applied to Document Recognition

- pp 306-351

TLDR

Various methods applied to handwritten character recognition are reviewed and compared and Convolutional Neural Networks, that are specifically designed to deal with the variability of 2D shapes, are shown to outperform all other techniques.

Abstract:

Multilayer Neural Networks trained with the backpropagation algorithm constitute the best example of a successful Gradient-Based Learning technique. Given an appropriate network architecture, Gradient-Based Learning algorithms can be used to synthesize a complex decision surface that can classify high-dimensional patterns such as handwritten characters, with minimal preprocessing. This paper reviews various methods applied to handwritten character recognition and compares them on a standard handwritten digit recognition task. Convolutional Neural Networks, that are specifically designed to deal with the variability of 2D shapes, are shown to outperform all other techniques. Real-life document recognition systems are composed of multiple modules including field extraction, segmentation, recognition, and language modeling. A new learning paradigm, called Graph Transformer Networks (GTN), allows such multi-module systems to be trained globally using Gradient-Based methods so as to minimize an overall performance measure. Two systems for on-line handwriting recognition are described. Experiments demonstrate the advantage of global training, and the flexibility of Graph Transformer Networks. A Graph Transformer Network for reading bank check is also described. It uses Convolutional Neural Network character recognizers combined with global training techniques to provides record accuracy on business and personal checks. It is deployed commercially and reads several million checks per day.

Citations

PDF

Open Access

More filters

Posted Content

Variational Deep Embedding: A Generative Approach to Clustering.

Zhuxi Jiang, +4 more

TL;DR: Variational Deep Embedding (VaDE), a novel unsupervised generative clustering approach within the framework of Variational Auto-Encoder (VAE), shows its capability of generating highly realistic samples for any specified cluster, without using supervised information during training.

...read moreread less

Posted Content

Characterizing the Decision Boundary of Deep Neural Networks.

Hamid Karimi, +2 more

- 24 Dec 2019 -

arXiv: Learning

TL;DR: The proposed Deep Decision boundary Instance Generation utilizes a method based on adversarial example generation as an effective way of generating samples near the decision boundary of any deep neural network model and introduces a set of important principled characteristics that take advantage of the generated instances near the decided boundary to provide multifaceted understandings of deep neural networks.

...read moreread less

Posted Content

Visual Tracking by Reinforced Decision Making.

Janghoon Choi, +2 more

TL;DR: This paper introduces a novel visual tracking algorithm based on a template selection strategy constructed by deep reinforcement learning methods and shows that the tracking algorithm effectively decides the best template for visual tracking.

...read moreread less

Journal ArticleDOI

Facial Expression Recognition Based on a Hybrid Model Combining Deep and Shallow Features

Xiao Sun, +1 more

- 01 Aug 2019 -

Cognitive Computation

TL;DR: The results of additional cross-database experiments also demonstrate the considerable potential of combining shallow features with deep learning features, and these results are more promising than state-of-the-art models.

...read moreread less

Proceedings Article

Likelihood-free MCMC with Amortized Approximate Ratio Estimators

Joeri Hermans, +2 more

TL;DR: In this paper, a flexible amortized estimator which approximates the likelihood-to-evidence ratio is proposed to address the intractability of the likelihood and the marginal model.

...read moreread less

Collapse

GradientBased Learning Applied to Document Recognition

Citations

Variational Deep Embedding: A Generative Approach to Clustering.

Characterizing the Decision Boundary of Deep Neural Networks.

Visual Tracking by Reinforced Decision Making.

Facial Expression Recognition Based on a Hybrid Model Combining Deep and Shallow Features

Likelihood-free MCMC with Amortized Approximate Ratio Estimators

Related Papers (5)

Gradient-based learning applied to document recognition

Deep Residual Learning for Image Recognition

ImageNet Classification with Deep Convolutional Neural Networks

Very Deep Convolutional Networks for Large-Scale Image Recognition

ImageNet: A large-scale hierarchical image database