A fast learning algorithm for deep belief nets

doi:10.1162/NECO.2006.18.7.1527

Journal ArticleDOI

A fast learning algorithm for deep belief nets

Geoffrey E. Hinton, +2 more

- 01 Jul 2006 -

Neural Computation

- Vol. 18, Iss: 7, pp 1527-1554

Chats0

TLDR

A fast, greedy algorithm is derived that can learn deep, directed belief networks one layer at a time, provided the top two layers form an undirected associative memory.

Abstract:

We show how to use "complementary priors" to eliminate the explaining-away effects that make inference difficult in densely connected belief nets that have many hidden layers. Using complementary priors, we derive a fast, greedy algorithm that can learn deep, directed belief networks one layer at a time, provided the top two layers form an undirected associative memory. The fast, greedy algorithm is used to initialize a slower learning procedure that fine-tunes the weights using a contrastive version of the wake-sleep algorithm. After fine-tuning, a network with three hidden layers forms a very good generative model of the joint distribution of handwritten digit images and their labels. This generative model gives better digit classification than the best discriminative learning algorithms. The low-dimensional manifolds on which the digits lie are modeled by long ravines in the free-energy landscape of the top-level associative memory, and it is easy to explore these ravines by using the directed connections to display what the associative memory has in mind.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Application of deep learning to cybersecurity: A survey

Samaneh Mahdavifar, +1 more

- 28 Jun 2019 -

Neurocomputing

TL;DR: This survey focuses on recent DL approaches that have been proposed in the area of cybersecurity, namely intrusion detection, malware detection, phishing/spam detection, and website defacement detection.

...read moreread less

Journal ArticleDOI

Neural networks

Alberto Prieto, +6 more

- 19 Nov 2016 -

Neurocomputing

TL;DR: The development and evolution of different topics related to neural networks is described showing that the field has acquired maturity and consolidation, proven by its competitiveness in solving real-world problems.

...read moreread less

Journal ArticleDOI

Improved mispronunciation detection with deep neural network trained acoustic models and transfer learning based logistic regression classifiers

Wenping Hu, +4 more

- 01 Mar 2015 -

Speech Communication

TL;DR: Experimental results on an isolated English word corpus recorded by non-native (L2) English learners show that the proposed GOP measure can improve the performance of GOP based mispronunciation detection approach, i.e., 7.4 % of the precision and recall rate are improved, compared with the conventional GOP estimated from GMM-HMM.

...read moreread less

Journal ArticleDOI

The role of big data analytics in industrial Internet of Things

Muhammad Habib ur Rehman, +5 more

- 01 Oct 2019 -

Future Generation Computer Systems

TL;DR: In this paper, the authors investigated the recent BDA technologies, algorithms and techniques that can lead to the development of intelligent Industrial Internet of Things (IIoT) systems and identified the indispensable challenges that remain to be addressed as future research directions as well.

...read moreread less

Posted Content

Parallel training of DNNs with Natural Gradient and Parameter Averaging

Daniel Povey, +2 more

- 27 Oct 2014 -

arXiv: Neural and Evolutionary Computing

TL;DR: Another method is described, an approximate and efficient implementation of Natural Gradient for Stochastic Gradient Descent (NG-SGD), which seems to allow the periodic-averaging method to work well, as well as substantially improving the convergence of SGD on a single machine.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Gradient-based learning applied to document recognition

Yann LeCun, +6 more

TL;DR: In this article, a graph transformer network (GTN) is proposed for handwritten character recognition, which can be used to synthesize a complex decision surface that can classify high-dimensional patterns, such as handwritten characters.

...read moreread less

Book

Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference

Judea Pearl

TL;DR: Probabilistic Reasoning in Intelligent Systems as mentioned in this paper is a complete and accessible account of the theoretical foundations and computational methods that underlie plausible reasoning under uncertainty, and provides a coherent explication of probability as a language for reasoning with partial belief.

...read moreread less

Journal ArticleDOI

Shape matching and object recognition using shape contexts

Serge Belongie, +2 more

- 01 Apr 2002 -

IEEE Transactions on Pattern Analysis an...

TL;DR: This paper presents work on computing shape models that are computationally fast and invariant basic transformations like translation, scaling and rotation, and proposes shape detection using a feature called shape context, which is descriptive of the shape of the object.

...read moreread less

Journal ArticleDOI

Training products of experts by minimizing contrastive divergence

Geoffrey E. Hinton

- 01 Aug 2002 -

Neural Computation

TL;DR: A product of experts (PoE) is an interesting candidate for a perceptual system in which rapid inference is vital and generation is unnecessary because it is hard even to approximate the derivatives of the renormalization term in the combination rule.

...read moreread less

Proceedings ArticleDOI

Best practices for convolutional neural networks applied to visual document analysis

Patrice Y. Simard, +2 more

TL;DR: A set of concrete bestpractices that document analysis researchers can use to get good results with neural networks, including a simple "do-it-yourself" implementation of convolution with a flexible architecture suitable for many visual document problems.

...read moreread less