scispace - formally typeset
Proceedings ArticleDOI

Disentangled Representation Learning GAN for Pose-Invariant Face Recognition

TLDR
Quantitative and qualitative evaluation on both controlled and in-the-wild databases demonstrate the superiority of DR-GAN over the state of the art.
Abstract
The large pose discrepancy between two face images is one of the key challenges in face recognition. Conventional approaches for pose-invariant face recognition either perform face frontalization on, or learn a pose-invariant representation from, a non-frontal face image. We argue that it is more desirable to perform both tasks jointly to allow them to leverage each other. To this end, this paper proposes Disentangled Representation learning-Generative Adversarial Network (DR-GAN) with three distinct novelties. First, the encoder-decoder structure of the generator allows DR-GAN to learn a generative and discriminative representation, in addition to image synthesis. Second, this representation is explicitly disentangled from other face variations such as pose, through the pose code provided to the decoder and pose estimation in the discriminator. Third, DR-GAN can take one or multiple images as the input, and generate one unified representation along with an arbitrary number of synthetic images. Quantitative and qualitative evaluation on both controlled and in-the-wild databases demonstrate the superiority of DR-GAN over the state of the art.

read more

Content maybe subject to copyright    Report

Citations
More filters
Proceedings ArticleDOI

Learning from Dances: Pose-Invariant Re-Identification for Multi-Person Tracking

TL;DR: This work proposes an end-to-end deep learning framework Sparse-Temporal ReID Network, which not only realizes human pose disentanglement in an image recovery manner, but also makes efficient linkages between the identical subjects via a unique Sparse temporal identity sampling technique across time steps.
Journal ArticleDOI

The Untold Secrets of WiFi-Calling Services: Vulnerabilities, Attacks, and Countermeasures

TL;DR: It is disclosed that current Wi-Fi calling security is not bullet-proof and three vulnerabilities are uncovered, and two proof-of-concept attacks are devised: telephony harassment or denial of voice service and user privacy leakage; both can bypass the existing security defenses.
Journal ArticleDOI

PGM-face: Pose-guided margin loss for cross-pose face recognition

TL;DR: Compared with pose-robust face representation learning methods, the proposed Pose-Guided Margin Loss (PGM-Face) extends the dimensions of the linear transformation matrix for each class to estimate the head poses, therefore the learned features of each class are soft clustered guided by the head pose.
Journal ArticleDOI

Frontal-Centers Guided Face: Boosting Face Recognition by Learning Pose-Invariant Features

TL;DR: This work proposes a novel Frontal-Centers Guided Loss (FCGFace) to obtain highly discriminative features for face recognition, capable of adaptively adjusting the distribution of profile face features and narrowing the gap between them and frontal face features during different training stages to form compact identity clusters.
Proceedings ArticleDOI

A Hybrid Network for Facial Age Progression and Regression Learning

TL;DR: Experiments show that the proposed network can generates better facial age images with more age traits compared with other state-of-the-art approaches.
References
More filters
Proceedings Article

Adam: A Method for Stochastic Optimization

TL;DR: This work introduces Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments, and provides a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework.
Journal ArticleDOI

Generative Adversarial Nets

TL;DR: A new framework for estimating generative models via an adversarial process, in which two models are simultaneously train: a generative model G that captures the data distribution and a discriminative model D that estimates the probability that a sample came from the training data rather than G.
Journal ArticleDOI

Representation Learning: A Review and New Perspectives

TL;DR: Recent work in the area of unsupervised feature learning and deep learning is reviewed, covering advances in probabilistic models, autoencoders, manifold learning, and deep networks.
Proceedings ArticleDOI

FaceNet: A unified embedding for face recognition and clustering

TL;DR: A system that directly learns a mapping from face images to a compact Euclidean space where distances directly correspond to a measure offace similarity, and achieves state-of-the-art face recognition performance using only 128-bytes perface.
Posted Content

Conditional Generative Adversarial Nets

Mehdi Mirza, +1 more
- 06 Nov 2014 - 
TL;DR: The conditional version of generative adversarial nets is introduced, which can be constructed by simply feeding the data, y, to the generator and discriminator, and it is shown that this model can generate MNIST digits conditioned on class labels.
Related Papers (5)