Deep Learning Face Attributes in the Wild

doi:10.1109/ICCV.2015.425

Open AccessProceedings ArticleDOI

Deep Learning Face Attributes in the Wild

Ziwei Liu, +3 more

- pp 3730-3738

Chats0

TLDR

A novel deep learning framework for attribute prediction in the wild that cascades two CNNs, LNet and ANet, which are fine-tuned jointly with attribute tags, but pre-trained differently.

Abstract:

Predicting face attributes in the wild is challenging due to complex face variations. We propose a novel deep learning framework for attribute prediction in the wild. It cascades two CNNs, LNet and ANet, which are fine-tuned jointly with attribute tags, but pre-trained differently. LNet is pre-trained by massive general object categories for face localization, while ANet is pre-trained by massive face identities for attribute prediction. This framework not only outperforms the state-of-the-art with a large margin, but also reveals valuable facts on learning face representation. (1) It shows how the performances of face localization (LNet) and attribute prediction (ANet) can be improved by different pre-training strategies. (2) It reveals that although the filters of LNet are fine-tuned only with image-level attribute tags, their response maps over entire images have strong indication of face locations. This fact enables training LNet for face localization with only image-level annotations, but without face bounding boxes or landmarks, which are required by all attribute recognition works. (3) It also demonstrates that the high-level hidden neurons of ANet automatically discover semantic concepts after pre-training with massive face identities, and such concepts are significantly enriched after fine-tuning with attribute tags. Each attribute can be well explained with a sparse linear combination of these concepts.

Citations

PDF

Open Access

More filters

Posted Content

GANs Trained by a Two Time-Scale Update Rule Converge to a Nash Equilibrium

Martin Heusel, +5 more

- 26 Jun 2017 -

arXiv: Learning

TL;DR: In this article, a two time-scale update rule (TTUR) was proposed for training GANs with stochastic gradient descent on arbitrary GAN loss functions, which has an individual learning rate for both the discriminator and the generator.

...read moreread less

Journal ArticleDOI

Joint Face Detection and Alignment Using Multitask Cascaded Convolutional Networks

Kaipeng Zhang, +3 more

- 26 Aug 2016 -

IEEE Signal Processing Letters

TL;DR: Zhang et al. as mentioned in this paper proposed a deep cascaded multitask framework that exploits the inherent correlation between detection and alignment to boost up their performance, which leverages a cascaded architecture with three stages of carefully designed deep convolutional networks to predict face and landmark location in a coarse-to-fine manner.

...read moreread less

Book ChapterDOI

A Discriminative Feature Learning Approach for Deep Face Recognition

Yandong Wen, +3 more

TL;DR: This paper proposes a new supervision signal, called center loss, for face recognition task, which simultaneously learns a center for deep features of each class and penalizes the distances between the deep features and their corresponding class centers.

...read moreread less

Proceedings ArticleDOI

StarGAN: Unified Generative Adversarial Networks for Multi-domain Image-to-Image Translation

Yunjey Choi, +5 more

TL;DR: StarGAN as discussed by the authors proposes a unified model architecture to perform image-to-image translation for multiple domains using only a single model, which leads to superior quality of translated images compared to existing models as well as the capability of flexibly translating an input image to any desired target domain.

...read moreread less

Proceedings Article

InfoGAN: interpretable representation learning by information maximizing generative adversarial nets

Xi Chen, +5 more

TL;DR: InfoGAN as mentioned in this paper is an information-theoretic extension to the GAN that is able to learn disentangled representations in a completely unsupervised manner, and it also discovers visual concepts that include hair styles, presence of eyeglasses, and emotions on the CelebA face dataset.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings Article

ImageNet Classification with Deep Convolutional Neural Networks

Alex Krizhevsky, +2 more

TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.

...read moreread less

Proceedings ArticleDOI

ImageNet: A large-scale hierarchical image database

Jia Deng, +5 more

TL;DR: A new database called “ImageNet” is introduced, a large-scale ontology of images built upon the backbone of the WordNet structure, much larger in scale and diversity and much more accurate than the current image datasets.

...read moreread less

Journal Article

LIBLINEAR: A Library for Large Linear Classification

Rong-En Fan, +4 more

- 01 Jun 2008 -

Journal of Machine Learning Research

TL;DR: LIBLINEAR is an open source library for large-scale linear classification that supports logistic regression and linear support vector machines and provides easy-to-use command-line tools and library calls for users and developers.

...read moreread less

Proceedings ArticleDOI

DeepFace: Closing the Gap to Human-Level Performance in Face Verification

Yaniv Taigman, +3 more

TL;DR: This work revisits both the alignment step and the representation step by employing explicit 3D face modeling in order to apply a piecewise affine transformation, and derive a face representation from a nine-layer deep neural network.

...read moreread less

Labeled Faces in the Wild: A Database forStudying Face Recognition in Unconstrained Environments

Gary B. Huang, +3 more

TL;DR: The database contains labeled face photographs spanning the range of conditions typically encountered in everyday life, and exhibits “natural” variability in factors such as pose, lighting, race, accessories, occlusions, and background.

...read moreread less

Collapse

Deep Learning Face Attributes in the Wild

Citations

GANs Trained by a Two Time-Scale Update Rule Converge to a Nash Equilibrium

Joint Face Detection and Alignment Using Multitask Cascaded Convolutional Networks

A Discriminative Feature Learning Approach for Deep Face Recognition

StarGAN: Unified Generative Adversarial Networks for Multi-domain Image-to-Image Translation

InfoGAN: interpretable representation learning by information maximizing generative adversarial nets

References

ImageNet Classification with Deep Convolutional Neural Networks

ImageNet: A large-scale hierarchical image database

LIBLINEAR: A Library for Large Linear Classification

DeepFace: Closing the Gap to Human-Level Performance in Face Verification

Labeled Faces in the Wild: A Database forStudying Face Recognition in Unconstrained Environments

Related Papers (5)

Generative Adversarial Nets

Deep Residual Learning for Image Recognition

Adam: A Method for Stochastic Optimization

Image-to-Image Translation with Conditional Adversarial Networks

Auto-Encoding Variational Bayes