scispace - formally typeset
Book ChapterDOI

GradientBased Learning Applied to Document Recognition

TLDR
Various methods applied to handwritten character recognition are reviewed and compared and Convolutional Neural Networks, that are specifically designed to deal with the variability of 2D shapes, are shown to outperform all other techniques.
Abstract
Multilayer Neural Networks trained with the backpropagation algorithm constitute the best example of a successful Gradient-Based Learning technique. Given an appropriate network architecture, Gradient-Based Learning algorithms can be used to synthesize a complex decision surface that can classify high-dimensional patterns such as handwritten characters, with minimal preprocessing. This paper reviews various methods applied to handwritten character recognition and compares them on a standard handwritten digit recognition task. Convolutional Neural Networks, that are specifically designed to deal with the variability of 2D shapes, are shown to outperform all other techniques. Real-life document recognition systems are composed of multiple modules including field extraction, segmentation, recognition, and language modeling. A new learning paradigm, called Graph Transformer Networks (GTN), allows such multi-module systems to be trained globally using Gradient-Based methods so as to minimize an overall performance measure. Two systems for on-line handwriting recognition are described. Experiments demonstrate the advantage of global training, and the flexibility of Graph Transformer Networks. A Graph Transformer Network for reading bank check is also described. It uses Convolutional Neural Network character recognizers combined with global training techniques to provides record accuracy on business and personal checks. It is deployed commercially and reads several million checks per day.

read more

Citations
More filters
Journal ArticleDOI

PULP: A Ultra-Low Power Parallel Accelerator for Energy-Efficient and Flexible Embedded Vision

TL;DR: PULP (Parallel processing Ultra-Low Power platform), an architecture built on clusters of tightly-coupled OpenRISC ISA cores, with advanced techniques for fast performance and energy scalability that exploit the capabilities of the STMicroelectronics UTBB FD-SOI 28nm technology is proposed.
Proceedings ArticleDOI

Towards Automated Melanoma Detection With Deep Learning: Data Purification and Augmentation

TL;DR: This work builds deep-learning-based tools for data purification and augmentation to counter-act limitations of the available skin lesion data bases and shows that incorporating these two units into melanoma detection system results in the superior performance over common baselines.
Book ChapterDOI

Multi-dimensional graph convolutional networks

TL;DR: In this article, a multi-dimensional convolutional neural network (mGCN) is proposed to capture rich information in learning node-level representations for multidimensional graphs, where each type of relation is modeled as a dimension.
Posted Content

Understanding Intra-Class Knowledge Inside CNN

TL;DR: To invert the intra-class knowledge inside CNN into more interpretable images, a non-parametric patch prior upon previous CNN visualization models is proposed and it is shown how different "styles" of templates for an object class are organized by CNN in terms of location and content, and represented in a hierarchical and ensemble way.
Proceedings Article

Coupling Adaptive Batch Sizes with Learning Rates

TL;DR: In this article, the authors proposed a method for dynamic batch size adaptation, which estimates the variance of the stochastic gradients and adapts the batch size to decrease the variance proportionally to the value of the objective function, removing the need for a learning rate decrease.
Related Papers (5)