Theano: A Python framework for fast computation of mathematical expressions

Open AccessPosted Content

Theano: A Python framework for fast computation of mathematical expressions

- 09 May 2016 -

TLDR

The performance of Theano is compared against Torch7 and TensorFlow on several machine learning models and recently-introduced functionalities and improvements are discussed.

Abstract:

Theano is a Python library that allows to define, optimize, and evaluate mathematical expressions involving multi-dimensional arrays efficiently. Since its introduction, it has been one of the most used CPU and GPU mathematical compilers - especially in the machine learning community - and has shown steady performance improvements. Theano is being actively and continuously developed since 2008, multiple frameworks have been built on top of it and it has been used to produce many state-of-the-art machine learning models. The present article is structured as follows. Section I provides an overview of the Theano software and its community. Section II presents the principal features of Theano and how to use them, and compares them with other similar projects. Section III focuses on recently-introduced functionalities and improvements. Section IV compares the performance of Theano against Torch7 and TensorFlow on several machine learning models. Section V discusses current limitations of Theano and potential ways of improving it.

Theano: A Python framework for fast computation of mathematical expressions

Citations

TensorFlow: a system for large-scale machine learning

PyTorch: An Imperative Style, High-Performance Deep Learning Library

TensorFlow: A system for large-scale machine learning

Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network

Layer Normalization

References

ImageNet Classification with Deep Convolutional Neural Networks

Very Deep Convolutional Networks for Large-Scale Image Recognition

Very Deep Convolutional Networks for Large-Scale Image Recognition

Going deeper with convolutions

Caffe: Convolutional Architecture for Fast Feature Embedding

Related Papers (5)

Long short-term memory

ImageNet Classification with Deep Convolutional Neural Networks

Adam: A Method for Stochastic Optimization

Deep Residual Learning for Image Recognition

Gradient-based learning applied to document recognition