GradientBased Learning Applied to Document Recognition

doi:10.1109/9780470544976.CH9

Book ChapterDOI

GradientBased Learning Applied to Document Recognition

- pp 306-351

TLDR

Various methods applied to handwritten character recognition are reviewed and compared and Convolutional Neural Networks, that are specifically designed to deal with the variability of 2D shapes, are shown to outperform all other techniques.

Abstract:

Multilayer Neural Networks trained with the backpropagation algorithm constitute the best example of a successful Gradient-Based Learning technique. Given an appropriate network architecture, Gradient-Based Learning algorithms can be used to synthesize a complex decision surface that can classify high-dimensional patterns such as handwritten characters, with minimal preprocessing. This paper reviews various methods applied to handwritten character recognition and compares them on a standard handwritten digit recognition task. Convolutional Neural Networks, that are specifically designed to deal with the variability of 2D shapes, are shown to outperform all other techniques. Real-life document recognition systems are composed of multiple modules including field extraction, segmentation, recognition, and language modeling. A new learning paradigm, called Graph Transformer Networks (GTN), allows such multi-module systems to be trained globally using Gradient-Based methods so as to minimize an overall performance measure. Two systems for on-line handwriting recognition are described. Experiments demonstrate the advantage of global training, and the flexibility of Graph Transformer Networks. A Graph Transformer Network for reading bank check is also described. It uses Convolutional Neural Network character recognizers combined with global training techniques to provides record accuracy on business and personal checks. It is deployed commercially and reads several million checks per day.

Citations

PDF

Open Access

More filters

Posted Content

What are the Receptive, Effective Receptive, and Projective Fields of Neurons in Convolutional Neural Networks?

Hung Le, +1 more

- 19 May 2017 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: This work explains in detail how receptive fields, effective receptive fields and projective fields of neurons in different layers, convolution or pooling, of a Convolutional Neural Network are calculated.

...read moreread less

Proceedings ArticleDOI

Private-kNN: Practical Differential Privacy for Computer Vision

Yuqing Zhu, +3 more

TL;DR: This work proposes a practically data-efï¬cient scheme based on private release of k-nearest neighbor (kNN) queries, which altogether avoids splitting the training dataset, and achieves comparable or better accuracy than PATE while reducing more than 90% of the privacy loss.

...read moreread less

Proceedings ArticleDOI

Large-scale short-term urban taxi demand forecasting using deep learning

Siyu Liao, +4 more

TL;DR: The experimental results show DNNs indeed outperform most traditional machine learning techniques, but such superior results can only be achieved with proper design of the right DNN architecture, where domain knowledge plays a key role.

...read moreread less

Posted Content

A Comparative Evaluation of Approximate Probabilistic Simulation and Deep Neural Networks as Accounts of Human Physical Scene Understanding

Renqiao Zhang, +4 more

- 04 May 2016 -

arXiv: Artificial Intelligence

TL;DR: Four experiments are reported that are the first rigorous comparisons of simulation-based and CNN-based models, where both approaches are concretely instantiated in algorithms that can run on raw image inputs and produce as outputs physical judgments such as whether a stack of blocks will fall.

...read moreread less

Posted Content

Learning Bag-of-Features Pooling for Deep Convolutional Neural Networks

Nikolaos Passalis, +1 more

- 25 Jul 2017 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: The proposed approach, called Convolutional BoF (CBoF), uses RBF neurons to quantize the information extracted from the convolutional layers and it is able to natively classify images of various sizes as well as to significantly reduce the number of parameters in the network.

...read moreread less

Collapse

GradientBased Learning Applied to Document Recognition

Citations

What are the Receptive, Effective Receptive, and Projective Fields of Neurons in Convolutional Neural Networks?

Private-kNN: Practical Differential Privacy for Computer Vision

Large-scale short-term urban taxi demand forecasting using deep learning

A Comparative Evaluation of Approximate Probabilistic Simulation and Deep Neural Networks as Accounts of Human Physical Scene Understanding

Learning Bag-of-Features Pooling for Deep Convolutional Neural Networks

Related Papers (5)

Gradient-based learning applied to document recognition

Deep Residual Learning for Image Recognition

ImageNet Classification with Deep Convolutional Neural Networks

Very Deep Convolutional Networks for Large-Scale Image Recognition

ImageNet: A large-scale hierarchical image database