Disentangling Factors of Variation via Generative Entangling

Open AccessPosted Content

Disentangling Factors of Variation via Generative Entangling

- 19 Oct 2012 -

TLDR

This work proposes a novel model family based on the spike-and-slab restricted Boltzmann machine which is generalize to include higher-order interactions among multiple latent variables and applies it to the task of facial expression classification.

Abstract:

Here we propose a novel model family with the objective of learning to disentangle the factors of variation in data. Our approach is based on the spike-and-slab restricted Boltzmann machine which we generalize to include higher-order interactions among multiple latent variables. Seen from a generative perspective, the multiplicative interactions emulates the entangling of factors of variation. Inference in the model can be seen as disentangling these generative factors. Unlike previous attempts at disentangling latent factors, the proposed model is trained using no supervised information regarding the latent factors. We apply our model to the task of facial expression classification.

Citations

PDF

Open Access

More filters

Proceedings Article

beta-VAE: Learning Basic Visual Concepts with a Constrained Variational Framework

Irina Higgins, +7 more

TL;DR: In this article, a modification of the variational autoencoder (VAE) framework is proposed to learn interpretable factorised latent representations from raw image data in a completely unsupervised manner.

...read moreread less

Posted Content

InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets

Xi Chen, +5 more

- 12 Jun 2016 -

arXiv: Learning

TL;DR: InfoGAN as mentioned in this paper is a generative adversarial network that maximizes the mutual information between a small subset of the latent variables and the observation, which can be interpreted as a variation of the Wake-Sleep algorithm.

...read moreread less

Proceedings Article

InfoGAN: interpretable representation learning by information maximizing generative adversarial nets

Xi Chen, +5 more

TL;DR: InfoGAN as mentioned in this paper is an information-theoretic extension to the GAN that is able to learn disentangled representations in a completely unsupervised manner, and it also discovers visual concepts that include hair styles, presence of eyeglasses, and emotions on the CelebA face dataset.

...read moreread less

Posted Content

A Style-Based Generator Architecture for Generative Adversarial Networks

Tero Karras, +2 more

- 12 Dec 2018 -

arXiv: Neural and Evolutionary Computing

TL;DR: This article proposed an alternative generator architecture for GANs, borrowing from style transfer literature, which leads to an automatically learned, unsupervised separation of high-level attributes (e.g., pose and identity when trained on human faces) and stochastic variation in the generated images.

...read moreread less

Proceedings Article

Challenging Common Assumptions in the Unsupervised Learning of Disentangled Representations

Francesco Locatello, +6 more

TL;DR: The authors show that the unsupervised learning of disentangled representations is fundamentally impossible without inductive biases on both the models and the data, and suggest that future work on disentanglement learning should be explicit about the role of inductive bias and (implicit) supervision.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings ArticleDOI

A unified architecture for natural language processing: deep neural networks with multitask learning

Ronan Collobert, +1 more

TL;DR: This work describes a single convolutional neural network architecture that, given a sentence, outputs a host of language processing predictions: part-of-speech tags, chunks, named entity tags, semantic roles, semantically similar words and the likelihood that the sentence makes sense using a language model.

...read moreread less

Proceedings Article

Deep Boltzmann machines

Ruslan Salakhutdinov, +1 more

TL;DR: A new learning algorithm for Boltzmann machines that contain many layers of hidden variables that is made more efficient by using a layer-by-layer “pre-training” phase that allows variational inference to be initialized with a single bottomup pass.

...read moreread less

Proceedings Article

An analysis of single-layer networks in unsupervised feature learning

Adam Coates, +2 more

TL;DR: In this paper, the authors show that the number of hidden nodes in the model may be more important to achieving high performance than the learning algorithm or the depth of the model, and they apply several othe-shelf feature learning algorithms (sparse auto-encoders, sparse RBMs, K-means clustering, and Gaussian mixtures) to CIFAR, NORB, and STL datasets using only single-layer networks.

...read moreread less

Proceedings ArticleDOI

Evaluation of local spatio-temporal features for action recognition

Heng Wang, +4 more

TL;DR: It is demonstrated that regular sampling of space-time features consistently outperforms all testedspace-time interest point detectors for human actions in realistic settings and is a consistent ranking for the majority of methods over different datasets.

...read moreread less

Book ChapterDOI

Transforming auto-encoders

Geoffrey E. Hinton, +2 more

TL;DR: It is argued that neural networks can be used to learn features that output a whole vector of instantiation parameters and this is a much more promising way of dealing with variations in position, orientation, scale and lighting than the methods currently employed in the neural networks community.

...read moreread less

Related Papers (5)

Representation Learning: A Review and New Perspectives

Yoshua Bengio, +2 more

- 01 Aug 2013 -

IEEE Transactions on Pattern Analysis an...

Disentangling Factors of Variation via Generative Entangling

Citations

beta-VAE: Learning Basic Visual Concepts with a Constrained Variational Framework

InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets

InfoGAN: interpretable representation learning by information maximizing generative adversarial nets

A Style-Based Generator Architecture for Generative Adversarial Networks

Challenging Common Assumptions in the Unsupervised Learning of Disentangled Representations

References

A unified architecture for natural language processing: deep neural networks with multitask learning

Deep Boltzmann machines

An analysis of single-layer networks in unsupervised feature learning

Evaluation of local spatio-temporal features for action recognition

Transforming auto-encoders

Related Papers (5)

Representation Learning: A Review and New Perspectives

beta-VAE: Learning Basic Visual Concepts with a Constrained Variational Framework

Generative Adversarial Nets

Auto-Encoding Variational Bayes

InfoGAN: interpretable representation learning by information maximizing generative adversarial nets