Disentangled Representation Learning GAN for Pose-Invariant Face Recognition

doi:10.1109/CVPR.2017.141

Proceedings ArticleDOI

Disentangled Representation Learning GAN for Pose-Invariant Face Recognition

- pp 1283-1292

TLDR

Quantitative and qualitative evaluation on both controlled and in-the-wild databases demonstrate the superiority of DR-GAN over the state of the art.

Abstract:

The large pose discrepancy between two face images is one of the key challenges in face recognition. Conventional approaches for pose-invariant face recognition either perform face frontalization on, or learn a pose-invariant representation from, a non-frontal face image. We argue that it is more desirable to perform both tasks jointly to allow them to leverage each other. To this end, this paper proposes Disentangled Representation learning-Generative Adversarial Network (DR-GAN) with three distinct novelties. First, the encoder-decoder structure of the generator allows DR-GAN to learn a generative and discriminative representation, in addition to image synthesis. Second, this representation is explicitly disentangled from other face variations such as pose, through the pose code provided to the decoder and pose estimation in the discriminator. Third, DR-GAN can take one or multiple images as the input, and generate one unified representation along with an arbitrary number of synthetic images. Quantitative and qualitative evaluation on both controlled and in-the-wild databases demonstrate the superiority of DR-GAN over the state of the art.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Deep Facial Expression Recognition: A Survey

Shan Li, +1 more

- 23 Apr 2018 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: A comprehensive survey on deep facial expression recognition (FER) can be found in this article, including datasets and algorithms that provide insights into the intrinsic problems of deep FER, including overfitting caused by lack of sufficient training data and expression-unrelated variations, such as illumination, head pose and identity bias.

...read moreread less

Proceedings ArticleDOI

Interpreting the Latent Space of GANs for Semantic Face Editing

Yujun Shen, +3 more

TL;DR: This work proposes a novel framework, called InterFaceGAN, for semantic face editing by interpreting the latent semantics learned by GANs, and finds that the latent code of well-trained generative models actually learns a disentangled representation after linear transformations.

...read moreread less

Journal ArticleDOI

Deep learning on image denoising: An overview.

Chunwei Tian, +5 more

- 01 Nov 2020 -

Neural Networks

TL;DR: A comparative study of deep techniques in image denoising by classifying the deep convolutional neural networks for additive white noisy images, the deep CNNs for real noisy images; the deepCNNs for blind Denoising and the deep network for hybrid noisy images.

...read moreread less

Proceedings ArticleDOI

Beyond Face Rotation: Global and Local Perception GAN for Photorealistic and Identity Preserving Frontal View Synthesis

Rui Huang, +3 more

TL;DR: Tang et al. as discussed by the authors proposed a Two-Pathway Generative Adversarial Network (TP-GAN) for photorealistic frontal view synthesis by simultaneously perceiving global structures and local details.

...read moreread less

Proceedings ArticleDOI

Learning Deep Models for Face Anti-Spoofing: Binary or Auxiliary Supervision

Yaojie Liu, +2 more

TL;DR: This paper argues the importance of auxiliary supervision to guide the learning toward discriminative and generalizable cues, and introduces a new face anti-spoofing database that covers a large range of illumination, subject, and pose variations.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings ArticleDOI

Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network

Christian Ledig, +10 more

TL;DR: SRGAN as mentioned in this paper proposes a perceptual loss function which consists of an adversarial loss and a content loss, which pushes the solution to the natural image manifold using a discriminator network that is trained to differentiate between the super-resolved images and original photo-realistic images.

...read moreread less

Proceedings ArticleDOI

Deep face recognition

Omkar M. Parkhi, +2 more

TL;DR: It is shown how a very large scale dataset can be assembled by a combination of automation and human in the loop, and the trade off between data purity and time is discussed.

...read moreread less

Posted Content

Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network

Christian Ledig, +10 more

- 15 Sep 2016 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: SRGAN, a generative adversarial network (GAN) for image super-resolution (SR), is presented, to its knowledge, the first framework capable of inferring photo-realistic natural images for 4x upscaling factors and a perceptual loss function which consists of an adversarial loss and a content loss.

...read moreread less

Posted Content

InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets

Xi Chen, +5 more

- 12 Jun 2016 -

arXiv: Learning

TL;DR: InfoGAN as mentioned in this paper is a generative adversarial network that maximizes the mutual information between a small subset of the latent variables and the observation, which can be interpreted as a variation of the Wake-Sleep algorithm.

...read moreread less

Proceedings Article

InfoGAN: interpretable representation learning by information maximizing generative adversarial nets

Xi Chen, +5 more

TL;DR: InfoGAN as mentioned in this paper is an information-theoretic extension to the GAN that is able to learn disentangled representations in a completely unsupervised manner, and it also discovers visual concepts that include hair styles, presence of eyeglasses, and emotions on the CelebA face dataset.

...read moreread less

Collapse

Disentangled Representation Learning GAN for Pose-Invariant Face Recognition

Citations

Deep Facial Expression Recognition: A Survey

Interpreting the Latent Space of GANs for Semantic Face Editing

Deep learning on image denoising: An overview.

Beyond Face Rotation: Global and Local Perception GAN for Photorealistic and Identity Preserving Frontal View Synthesis

Learning Deep Models for Face Anti-Spoofing: Binary or Auxiliary Supervision

References

Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network

Deep face recognition

Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network

InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets

InfoGAN: interpretable representation learning by information maximizing generative adversarial nets

Related Papers (5)

Generative Adversarial Nets

Deep Residual Learning for Image Recognition

Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks

FaceNet: A unified embedding for face recognition and clustering

Image-to-Image Translation with Conditional Adversarial Networks