Improving Variational Inference with Inverse Autoregressive Flow

Open AccessPosted Content

Improving Variational Inference with Inverse Autoregressive Flow

- 15 Jun 2016 -

TLDR

This paper proposed a new type of normalizing flow, inverse autoregressive flow (IAF), that, in contrast to earlier published flows, scales well to high-dimensional latent spaces, and demonstrated that a novel type of variational autoencoder, coupled with IAF, is competitive with neural autoregression models in terms of attained log-likelihood on natural images, while allowing significantly faster synthesis.

Abstract:

The framework of normalizing flows provides a general strategy for flexible variational inference of posteriors over latent variables. We propose a new type of normalizing flow, inverse autoregressive flow (IAF), that, in contrast to earlier published flows, scales well to high-dimensional latent spaces. The proposed flow consists of a chain of invertible transformations, where each transformation is based on an autoregressive neural network. In experiments, we show that IAF significantly improves upon diagonal Gaussian approximate posteriors. In addition, we demonstrate that a novel type of variational autoencoder, coupled with IAF, is competitive with neural autoregressive models in terms of attained log-likelihood on natural images, while allowing significantly faster synthesis.

Citations

PDF

Open Access

More filters

Proceedings Article

Conditional image synthesis with auxiliary classifier GANs

Augustus Odena, +2 more

TL;DR: A variant of GANs employing label conditioning that results in 128 x 128 resolution image samples exhibiting global coherence is constructed and it is demonstrated that high resolution samples provide class information not present in low resolution samples.

...read moreread less

Posted Content

Density estimation using Real NVP

Laurent Dinh, +2 more

- 27 May 2016 -

arXiv: Learning

TL;DR: This work extends the space of probabilistic models using real-valued non-volume preserving (real NVP) transformations, a set of powerful invertible and learnable transformations, resulting in an unsupervised learning algorithm with exact log-likelihood computation, exact sampling, exact inference of latent variables, and an interpretable latent space.

...read moreread less

Proceedings Article

Weight normalization: a simple reparameterization to accelerate training of deep neural networks

Tim Salimans, +1 more

TL;DR: A reparameterization of the weight vectors in a neural network that decouples the length of those weight vectors from their direction is presented, improving the conditioning of the optimization problem and speeding up convergence of stochastic gradient descent.

...read moreread less

Posted Content

Adversarially Learned Inference

Vincent Dumoulin, +6 more

- 02 Jun 2016 -

arXiv: Machine Learning

TL;DR: The adversarially learned inference (ALI) model is introduced, which jointly learns a generation network and an inference network using an adversarial process and the usefulness of the learned representations is confirmed by obtaining a performance competitive with state-of-the-art on the semi-supervised SVHN and CIFAR10 tasks.

...read moreread less

Posted Content

Unsupervised Image-to-Image Translation Networks

Ming-Yu Liu, +2 more

- 02 Mar 2017 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: In this paper, the authors make a shared-latent space assumption and propose an unsupervised image-to-image translation framework based on Coupled GANs, which achieves state-of-the-art performance on benchmark datasets.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings ArticleDOI

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.

...read moreread less

Journal ArticleDOI

Long short-term memory

Sepp Hochreiter, +1 more

- 01 Nov 1997 -

Neural Computation

TL;DR: A novel, efficient, gradient based method called long short-term memory (LSTM) is introduced, which can learn to bridge minimal time lags in excess of 1000 discrete-time steps by enforcing constant error flow through constant error carousels within special units.

...read moreread less

Posted Content

Adam: A Method for Stochastic Optimization

Diederik P. Kingma, +1 more

- 22 Dec 2014 -

arXiv: Learning

TL;DR: In this article, the adaptive estimates of lower-order moments are used for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimate of lowerorder moments.

...read moreread less

Posted Content

Auto-Encoding Variational Bayes

Diederik P. Kingma, +1 more

- 20 Dec 2013 -

arXiv: Machine Learning

TL;DR: In this paper, a stochastic variational inference and learning algorithm was proposed for directed probabilistic models with intractable posterior distributions and large datasets, which scales to large datasets.

...read moreread less

Posted Content

WaveNet: A Generative Model for Raw Audio

Aaron van den Oord, +8 more

- 12 Sep 2016 -

arXiv: Sound

TL;DR: This paper proposed WaveNet, a deep neural network for generating audio waveforms, which is fully probabilistic and autoregressive, with the predictive distribution for each audio sample conditioned on all previous ones.

...read moreread less

Collapse

Improving Variational Inference with Inverse Autoregressive Flow

Citations

Conditional image synthesis with auxiliary classifier GANs

Density estimation using Real NVP

Weight normalization: a simple reparameterization to accelerate training of deep neural networks

Adversarially Learned Inference

Unsupervised Image-to-Image Translation Networks

References

Deep Residual Learning for Image Recognition

Long short-term memory

Adam: A Method for Stochastic Optimization

Auto-Encoding Variational Bayes

WaveNet: A Generative Model for Raw Audio

Related Papers (5)

Auto-Encoding Variational Bayes

Adam: A Method for Stochastic Optimization

Stochastic Backpropagation and Approximate Inference in Deep Generative Models

Generative Adversarial Nets

Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks