Top 7 papers published by Ian Goodfellow from Google in 2012

Posted Content•

Theano: new features and speed improvements

[...]

Frédéric Bastien, Pascal Lamblin¹, Razvan Pascanu, James Bergstra, Ian Goodfellow¹, Arnaud Bergeron¹, Nicolas Bouchard, David Warde-Farley¹, Yoshua Bengio¹ - Show less +5 more•Institutions (1)

Université de Montréal¹

23 Nov 2012-arXiv: Symbolic Computation

TL;DR: New features and efficiency improvements to Theano are presented, and benchmarks demonstrating Theano's performance relative to Torch7, a recently introduced machine learning library, and to RNNLM, a C++ library targeted at recurrent neural networks.

...read moreread less

Abstract: Theano is a linear algebra compiler that optimizes a user's symbolically-specified mathematical computations to produce efficient low-level implementations. In this paper, we present new features and efficiency improvements to Theano, and benchmarks demonstrating Theano's performance relative to Torch7, a recently introduced machine learning library, and to RNNLM, a C++ library targeted at recurrent neural networks.

...read moreread less

1,437 citations

Theano: Deep Learning on GPUs with Python

[...]

James Bergstra, Frédéric Bastien, Olivier Breuleux, Pascal Lamblin, Razvan Pascanu, Olivier Delalleau, Guillaume Desjardins, David Warde-Farley, Ian Goodfellow, Arnaud Bergeron, Yoshua Bengio - Show less +7 more

01 Jan 2012

TL;DR: This paper presents Theano, a framework in the Python programming language for defining, optimizing and evaluating expressions involving high-level operations on tensors, and adds automatic symbolic differentiation, GPU support, and faster expression evaluation.

...read moreread less

Abstract: In this paper, we present Theano 1 , a framework in the Python programming language for defining, optimizing and evaluating expressions involving high-level operations on tensors. Theano offers most of NumPy’s functionality, but adds automatic symbolic differentiation, GPU support, and faster expression evaluation. Theano is a general mathematical tool, but it was developed with the goal of facilitating research in deep learning. The Deep Learning Tutorials 2 introduce recent advances in deep learning, and showcase how Theano

...read moreread less

249 citations

Proceedings Article•

Unsupervised and Transfer Learning Challenge: a Deep Learning Approach

[...]

Grégoire Mesnil¹, Yann N. Dauphin¹, Xavier Glorot¹, Salah Rifai¹, Yoshua Bengio¹, Ian Goodfellow¹, Erick Lavoie¹, Xavier Muller¹, Guillaume Desjardins¹, David Warde-Farley¹, Pascal Vincent¹, Aaron Courville¹, James Bergstra² - Show less +9 more•Institutions (2)

Université de Montréal¹, Harvard University²

27 Jun 2012

TL;DR: In this article, the authors describe different kinds of layers they trained for learning representations in the setting of the Unsupervised and Transfer Learning Challenge, and the strategy of their team won the final phase of the challenge.

...read moreread less

Abstract: Learning good representations from a large set of unlabeled data is a particularly challenging task. Recent work (see Bengio (2009) for a review) shows that training deep architectures is a good way to extract such representations, by extracting and disentangling gradually higher-level factors of variation characterizing the input distribution. In this paper, we describe different kinds of layers we trained for learning representations in the setting of the Unsupervised and Transfer Learning Challenge. The strategy of our team won the final phase of the challenge. It combined and stacked different one-layer unsupervised learning algorithms, adapted to each of the five datasets of the competition. This paper describes that strategy and the particular one-layer learning algorithms feeding a simple linear classifier with a tiny number of labeled training samples (1 to 64 per class).

...read moreread less

199 citations

Proceedings Article•

Large-Scale Feature Learning With Spike-and-Slab Sparse Coding

[...]

Ian Goodfellow¹, Yoshua Bengio¹, Aaron Courville¹•Institutions (1)

Université de Montréal¹

26 Jun 2012

TL;DR: This work introduces a new feature learning and extraction procedure based on a factor model the authors call spike-and-slab sparse coding (S3C), and presents a novel inference procedure for appropriate for use with GPUs which allows to dramatically increase both the training set size and the amount of latent factors that S3C may be trained with.

...read moreread less

Abstract: We consider the problem of object recognition with a large number of classes. In order to overcome the low amount of labeled examples available in this setting, we introduce a new feature learning and extraction procedure based on a factor model we call spike-and-slab sparse coding (S3C). Prior work on S3C has not prioritized the ability to exploit parallel architectures and scale S3C to the enormous problem sizes needed for object recognition. We present a novel inference procedure for appropriate for use with GPUs which allows us to dramatically increase both the training set size and the amount of latent factors that S3C may be trained with. We demonstrate that this approach improves upon the supervised learning capabilities of both sparse coding and the spike-and-slab Restricted Boltzmann Machine (ssRBM) on the CIFAR-10 dataset. We use the CIFAR-100 dataset to demonstrate that our method scales to large numbers of classes better than previous methods. Finally, we use our method to win the NIPS 2011 Workshop on Challenges In Learning Hierarchical Models' Transfer Learning Challenge.

...read moreread less

89 citations

Posted Content•

Large-Scale Feature Learning With Spike-and-Slab Sparse Coding

[...]

Ian Goodfellow¹, Aaron Courville¹, Yoshua Bengio¹•Institutions (1)

Université de Montréal¹

27 Jun 2012-arXiv: Learning

TL;DR: Spike-and-slab sparse coding (S3C) as discussed by the authors is a feature learning and extraction procedure based on a factor model for object recognition with a large number of classes.

...read moreread less

Abstract: We consider the problem of object recognition with a large number of classes. In order to overcome the low amount of labeled examples available in this setting, we introduce a new feature learning and extraction procedure based on a factor model we call spike-and-slab sparse coding (S3C). Prior work on S3C has not prioritized the ability to exploit parallel architectures and scale S3C to the enormous problem sizes needed for object recognition. We present a novel inference procedure for appropriate for use with GPUs which allows us to dramatically increase both the training set size and the amount of latent factors that S3C may be trained with. We demonstrate that this approach improves upon the supervised learning capabilities of both sparse coding and the spike-and-slab Restricted Boltzmann Machine (ssRBM) on the CIFAR-10 dataset. We use the CIFAR-100 dataset to demonstrate that our method scales to large numbers of classes better than previous methods. Finally, we use our method to win the NIPS 2011 Workshop on Challenges In Learning Hierarchical Models? Transfer Learning Challenge.

...read moreread less

61 citations

Posted Content•

Spike-and-Slab Sparse Coding for Unsupervised Feature Discovery

[...]

Ian Goodfellow, Aaron Courville, Yoshua Bengio

16 Jan 2012-arXiv: Machine Learning

TL;DR: This work derives a structured variational inference procedure and employs a variational EM training algorithm to improve upon the supervised learning capabilities of both sparse coding and the ssRBM on the CIFAR-10 dataset.

...read moreread less

Abstract: We consider the problem of using a factor model we call {\em spike-and-slab sparse coding} (S3C) to learn features for a classification task. The S3C model resembles both the spike-and-slab RBM and sparse coding. Since exact inference in this model is intractable, we derive a structured variational inference procedure and employ a variational EM training algorithm. Prior work on approximate inference for this model has not prioritized the ability to exploit parallel architectures and scale to enormous problem sizes. We present an inference procedure appropriate for use with GPUs which allows us to dramatically increase both the training set size and the amount of latent factors. We demonstrate that this approach improves upon the supervised learning capabilities of both sparse coding and the ssRBM on the CIFAR-10 dataset. We evaluate our approach's potential for semi-supervised learning on subsets of CIFAR-10. We demonstrate state-of-the art self-taught learning performance on the STL-10 dataset and use our method to win the NIPS 2011 Workshop on Challenges In Learning Hierarchical Models' Transfer Learning Challenge.

...read moreread less

61 citations

Posted Content•

Joint Training of Deep Boltzmann Machines

[...]

Ian Goodfellow, Aaron Courville, Yoshua Bengio

12 Dec 2012-arXiv: Machine Learning

TL;DR: A new method for training deep Boltzmann machines jointly is introduced that requires an initial learning pass that trains the deep BoltZmann machine greedily, one layer at a time, or do not perform well on classifi- cation tasks.

...read moreread less

Abstract: We introduce a new method for training deep Boltzmann machines jointly. Prior methods require an initial learning pass that trains the deep Boltzmann machine greedily, one layer at a time, or do not perform well on classifi- cation tasks.

...read moreread less

30 citations

Showing papers by "Ian Goodfellow published in 2012"