scispace - formally typeset
Proceedings ArticleDOI

Disentangled Representation Learning GAN for Pose-Invariant Face Recognition

TLDR
Quantitative and qualitative evaluation on both controlled and in-the-wild databases demonstrate the superiority of DR-GAN over the state of the art.
Abstract: 
The large pose discrepancy between two face images is one of the key challenges in face recognition. Conventional approaches for pose-invariant face recognition either perform face frontalization on, or learn a pose-invariant representation from, a non-frontal face image. We argue that it is more desirable to perform both tasks jointly to allow them to leverage each other. To this end, this paper proposes Disentangled Representation learning-Generative Adversarial Network (DR-GAN) with three distinct novelties. First, the encoder-decoder structure of the generator allows DR-GAN to learn a generative and discriminative representation, in addition to image synthesis. Second, this representation is explicitly disentangled from other face variations such as pose, through the pose code provided to the decoder and pose estimation in the discriminator. Third, DR-GAN can take one or multiple images as the input, and generate one unified representation along with an arbitrary number of synthetic images. Quantitative and qualitative evaluation on both controlled and in-the-wild databases demonstrate the superiority of DR-GAN over the state of the art.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

Deep Facial Expression Recognition: A Survey

TL;DR: A comprehensive survey on deep facial expression recognition (FER) can be found in this article, including datasets and algorithms that provide insights into the intrinsic problems of deep FER, including overfitting caused by lack of sufficient training data and expression-unrelated variations, such as illumination, head pose and identity bias.
Proceedings ArticleDOI

Interpreting the Latent Space of GANs for Semantic Face Editing

TL;DR: This work proposes a novel framework, called InterFaceGAN, for semantic face editing by interpreting the latent semantics learned by GANs, and finds that the latent code of well-trained generative models actually learns a disentangled representation after linear transformations.
Journal ArticleDOI

Deep learning on image denoising: An overview.

TL;DR: A comparative study of deep techniques in image denoising by classifying the deep convolutional neural networks for additive white noisy images, the deep CNNs for real noisy images; the deepCNNs for blind Denoising and the deep network for hybrid noisy images.
Proceedings ArticleDOI

Beyond Face Rotation: Global and Local Perception GAN for Photorealistic and Identity Preserving Frontal View Synthesis

TL;DR: Tang et al. as discussed by the authors proposed a Two-Pathway Generative Adversarial Network (TP-GAN) for photorealistic frontal view synthesis by simultaneously perceiving global structures and local details.
Proceedings ArticleDOI

Learning Deep Models for Face Anti-Spoofing: Binary or Auxiliary Supervision

TL;DR: This paper argues the importance of auxiliary supervision to guide the learning toward discriminative and generalizable cues, and introduces a new face anti-spoofing database that covers a large range of illumination, subject, and pose variations.
References
More filters
Proceedings ArticleDOI

Fisher vector encoded deep convolutional features for unconstrained face verification

TL;DR: Evaluations on two challenging verification datasets show that the proposed FV-DCNN method is able to capture the salient local features and also performs well when compared to many state-of-the-art face verification methods.
Proceedings ArticleDOI

Improving face recognition with a quality-based probabilistic framework

TL;DR: This paper addresses the problem of developing facial image quality metrics that are predictive of the performance of existing biometric matching algorithms and incorporating the quality estimates into the recognition decision process to improve overall performance by employing a Bayesian network.
Proceedings ArticleDOI

Optimal Pose for Face Recognition

TL;DR: A number of algorithms are used to evaluate face recognition performance when various poses are used for training, and it is found that the 3/4 view is the best, justified by the discrimination power of different regions on the face, computed from both the appearance and the geometry of these regions.
Posted Content

Ways of Conditioning Generative Adversarial Networks

Hanock Kwak, +1 more
- 04 Nov 2016 - 
TL;DR: This work proposes novel methods of conditioning generative adversarial networks (GANs) that achieve state-of-the-art results on MNIST and CIFAR-10 and introduces two models: an information retrieving model that extracts conditional information from the samples, and a spatial bilinear pooling model that forms Bilinear features derived from the spatial cross product of an image and a condition vector.
Related Papers (5)