Categorical Reparameterization with Gumbel-Softmax

Open AccessProceedings Article

Categorical Reparameterization with Gumbel-Softmax

Chats0

TLDR

Gumbel-Softmax as mentioned in this paper replaces the non-differentiable samples from a categorical distribution with a differentiable sample from a novel Gumbel softmax distribution, which has the essential property that it can be smoothly annealed into the categorical distributions.

Abstract:

Categorical variables are a natural choice for representing discrete structure in the world. However, stochastic neural networks rarely use categorical latent variables due to the inability to backpropagate through samples. In this work, we present an efficient gradient estimator that replaces the non-differentiable sample from a categorical distribution with a differentiable sample from a novel Gumbel-Softmax distribution. This distribution has the essential property that it can be smoothly annealed into a categorical distribution. We show that our Gumbel-Softmax estimator outperforms state-of-the-art gradient estimators on structured output prediction and unsupervised generative modeling tasks with categorical latent variables, and enables large speedups on semi-supervised classification.

Citations

PDF

Open Access

More filters

Posted Content

Improved Training of Wasserstein GANs

Ishaan Gulrajani, +4 more

- 31 Mar 2017 -

arXiv: Learning

TL;DR: This work proposes an alternative to clipping weights: penalize the norm of gradient of the critic with respect to its input, which performs better than standard WGAN and enables stable training of a wide variety of GAN architectures with almost no hyperparameter tuning.

...read moreread less

Proceedings Article

Improved training of wasserstein GANs

Ishaan Gulrajani, +4 more

TL;DR: The authors proposed to penalize the norm of the gradient of the critic with respect to its input to improve the training stability of Wasserstein GANs and achieve stable training of a wide variety of GAN architectures with almost no hyperparameter tuning.

...read moreread less

Posted Content

Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments

Ryan Lowe, +5 more

- 07 Jun 2017 -

arXiv: Learning

TL;DR: An adaptation of actor-critic methods that considers action policies of other agents and is able to successfully learn policies that require complex multi-agent coordination is presented.

...read moreread less

Posted Content

NIPS 2016 Tutorial: Generative Adversarial Networks

Ian Goodfellow

- 31 Dec 2016 -

arXiv: Learning

TL;DR: This report summarizes the tutorial presented by the author at NIPS 2016 on generative adversarial networks (GANs), and describes state-of-the-art image models that combine GANs with other methods.

...read moreread less

Proceedings ArticleDOI

FBNet: Hardware-Aware Efficient ConvNet Design via Differentiable Neural Architecture Search

Bichen Wu, +9 more

TL;DR: This work proposes a differentiable neural architecture search (DNAS) framework that uses gradient-based methods to optimize ConvNet architectures, avoiding enumerating and training individual architectures separately as in previous methods.

...read moreread less

Collapse

Neural Computation

Categorical Reparameterization with Gumbel-Softmax

Citations

Improved Training of Wasserstein GANs

Improved training of wasserstein GANs

Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments

NIPS 2016 Tutorial: Generative Adversarial Networks

FBNet: Hardware-Aware Efficient ConvNet Design via Differentiable Neural Architecture Search

Related Papers (5)

Adam: A Method for Stochastic Optimization

Deep Residual Learning for Image Recognition

Attention is All you Need

Generative Adversarial Nets

Long short-term memory