scispace - formally typeset
Book ChapterDOI

GradientBased Learning Applied to Document Recognition

TLDR
Various methods applied to handwritten character recognition are reviewed and compared and Convolutional Neural Networks, that are specifically designed to deal with the variability of 2D shapes, are shown to outperform all other techniques.
Abstract
Multilayer Neural Networks trained with the backpropagation algorithm constitute the best example of a successful Gradient-Based Learning technique. Given an appropriate network architecture, Gradient-Based Learning algorithms can be used to synthesize a complex decision surface that can classify high-dimensional patterns such as handwritten characters, with minimal preprocessing. This paper reviews various methods applied to handwritten character recognition and compares them on a standard handwritten digit recognition task. Convolutional Neural Networks, that are specifically designed to deal with the variability of 2D shapes, are shown to outperform all other techniques. Real-life document recognition systems are composed of multiple modules including field extraction, segmentation, recognition, and language modeling. A new learning paradigm, called Graph Transformer Networks (GTN), allows such multi-module systems to be trained globally using Gradient-Based methods so as to minimize an overall performance measure. Two systems for on-line handwriting recognition are described. Experiments demonstrate the advantage of global training, and the flexibility of Graph Transformer Networks. A Graph Transformer Network for reading bank check is also described. It uses Convolutional Neural Network character recognizers combined with global training techniques to provides record accuracy on business and personal checks. It is deployed commercially and reads several million checks per day.

read more

Citations
More filters
Posted Content

Variational Deep Embedding: A Generative Approach to Clustering.

TL;DR: Variational Deep Embedding (VaDE), a novel unsupervised generative clustering approach within the framework of Variational Auto-Encoder (VAE), shows its capability of generating highly realistic samples for any specified cluster, without using supervised information during training.
Posted Content

Characterizing the Decision Boundary of Deep Neural Networks.

TL;DR: The proposed Deep Decision boundary Instance Generation utilizes a method based on adversarial example generation as an effective way of generating samples near the decision boundary of any deep neural network model and introduces a set of important principled characteristics that take advantage of the generated instances near the decided boundary to provide multifaceted understandings of deep neural networks.
Posted Content

Visual Tracking by Reinforced Decision Making.

TL;DR: This paper introduces a novel visual tracking algorithm based on a template selection strategy constructed by deep reinforcement learning methods and shows that the tracking algorithm effectively decides the best template for visual tracking.
Journal ArticleDOI

Facial Expression Recognition Based on a Hybrid Model Combining Deep and Shallow Features

TL;DR: The results of additional cross-database experiments also demonstrate the considerable potential of combining shallow features with deep learning features, and these results are more promising than state-of-the-art models.
Proceedings Article

Likelihood-free MCMC with Amortized Approximate Ratio Estimators

TL;DR: In this paper, a flexible amortized estimator which approximates the likelihood-to-evidence ratio is proposed to address the intractability of the likelihood and the marginal model.
Related Papers (5)