Proceedings ArticleDOI
Disentangled Representation Learning GAN for Pose-Invariant Face Recognition
Luan Tran,Xi Yin,Xiaoming Liu +2 more
- pp 1283-1292
TLDR
Quantitative and qualitative evaluation on both controlled and in-the-wild databases demonstrate the superiority of DR-GAN over the state of the art.Abstract:
The large pose discrepancy between two face images is one of the key challenges in face recognition. Conventional approaches for pose-invariant face recognition either perform face frontalization on, or learn a pose-invariant representation from, a non-frontal face image. We argue that it is more desirable to perform both tasks jointly to allow them to leverage each other. To this end, this paper proposes Disentangled Representation learning-Generative Adversarial Network (DR-GAN) with three distinct novelties. First, the encoder-decoder structure of the generator allows DR-GAN to learn a generative and discriminative representation, in addition to image synthesis. Second, this representation is explicitly disentangled from other face variations such as pose, through the pose code provided to the decoder and pose estimation in the discriminator. Third, DR-GAN can take one or multiple images as the input, and generate one unified representation along with an arbitrary number of synthetic images. Quantitative and qualitative evaluation on both controlled and in-the-wild databases demonstrate the superiority of DR-GAN over the state of the art.read more
Citations
More filters
Posted Content
Self-Supervised Learning Disentangled Group Representation as Feature.
TL;DR: Wang et al. as mentioned in this paper proposed an iterative partition-based invariant risk minimization (IP-IRM) algorithm, which successfully grounds the abstract semantics and the group acting on them into concrete contrastive learning.
Journal ArticleDOI
Toward Robust Facial Authentication for Low-Power Edge-AI Consumer Devices
TL;DR: In this paper , the authors explored a number of approaches to test the robustness of a state-of-the-art facial recognition (FR) technique, Arcface, for low-power, on-device facial authentication.
MaLP: Manipulation Localization Using a Proactive Scheme – Supplementary material –
TL;DR: In this article , the authors train MaLP for 150, 000 iterations with a batch size of 4 and the learning rate is 1e−5 for all networks, except for the transformer which uses AdamW.
Journal ArticleDOI
CADA-GAN: Context-Aware GAN with Data Augmentation
TL;DR: This paper proposed CADA-GAN, a C ontext-a-ware GAN that allows optimal feature extraction, with added robustness from additional D ata-A ugmentation, adapted from the popular StyleGAN2-Ada model, with attention on augmentation and segmentation of the parent images.
Journal ArticleDOI
Cross-Domain Attention and Center Loss for Sketch Re-Identification
TL;DR: Wang et al. as mentioned in this paper proposed a cross-domain attention (CDA) mechanism, which uses different ways of splitting feature maps in its two branches and calculates the relationship between different parts in the sketch images and RGB photos.
References
More filters
Proceedings Article
Adam: A Method for Stochastic Optimization
Diederik P. Kingma,Jimmy Ba +1 more
TL;DR: This work introduces Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments, and provides a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework.
Journal ArticleDOI
Generative Adversarial Nets
Ian Goodfellow,Jean Pouget-Abadie,Mehdi Mirza,Bing Xu,David Warde-Farley,Sherjil Ozair,Aaron Courville,Yoshua Bengio +7 more
TL;DR: A new framework for estimating generative models via an adversarial process, in which two models are simultaneously train: a generative model G that captures the data distribution and a discriminative model D that estimates the probability that a sample came from the training data rather than G.
Journal ArticleDOI
Representation Learning: A Review and New Perspectives
TL;DR: Recent work in the area of unsupervised feature learning and deep learning is reviewed, covering advances in probabilistic models, autoencoders, manifold learning, and deep networks.
Proceedings ArticleDOI
FaceNet: A unified embedding for face recognition and clustering
TL;DR: A system that directly learns a mapping from face images to a compact Euclidean space where distances directly correspond to a measure offace similarity, and achieves state-of-the-art face recognition performance using only 128-bytes perface.
Posted Content
Conditional Generative Adversarial Nets
Mehdi Mirza,Simon Osindero +1 more
TL;DR: The conditional version of generative adversarial nets is introduced, which can be constructed by simply feeding the data, y, to the generator and discriminator, and it is shown that this model can generate MNIST digits conditioned on class labels.