High Resolution Face Editing with Masked GAN Latent Code Optimization.

Open AccessPosted Content

High Resolution Face Editing with Masked GAN Latent Code Optimization.

Martin Pernus, +2 more

- 20 Mar 2021 -

arXiv: Computer Vision and Pattern Recog...

Chats0

TLDR

MaskFaceGAN as discussed by the authors is based on an optimization procedure that directly optimizes the latent code of a pre-trained Generative Adversarial Network (i.e., StyleGAN2) with respect to several constraints that ensure preservation of relevant image content, generation of the targeted facial attributes, and spatially-selective treatment of local image areas.

Abstract:

Face editing represents a popular research topic within the computer vision and image processing communities. While significant progress has been made recently in this area, existing solutions: (i) are still largely focused on low-resolution images, (ii) often generate editing results with visual artefacts, or (iii) lack fine-grained control and alter multiple (entangled) attributes at once, when trying to generate the desired facial semantics. In this paper, we aim to address these issues though a novel attribute editing approach called MaskFaceGAN. The proposed approach is based on an optimization procedure that directly optimizes the latent code of a pre-trained (state-of-the-art) Generative Adversarial Network (i.e., StyleGAN2) with respect to several constraints that ensure: (i) preservation of relevant image content, (ii) generation of the targeted facial attributes, and (iii) spatially--selective treatment of local image areas. The constraints are enforced with the help of an (differentiable) attribute classifier and face parser that provide the necessary reference information for the optimization procedure. MaskFaceGAN is evaluated in extensive experiments on the CelebA-HQ, Helen and SiblingsDB-HQf datasets and in comparison with several state-of-the-art techniques from the literature, i.e., StarGAN, AttGAN, STGAN, and two versions of InterFaceGAN. Our experimental results show that the proposed approach is able to edit face images with respect to several facial attributes with unprecedented image quality and at high-resolutions (1024x1024), while exhibiting considerably less problems with attribute entanglement than competing solutions. The source code is made freely available from: this https URL.

References

PDF

Open Access

More filters

Proceedings Article

Adam: A Method for Stochastic Optimization

Diederik P. Kingma, +1 more

TL;DR: This work introduces Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments, and provides a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework.

...read moreread less

Journal ArticleDOI

Generative Adversarial Nets

Ian Goodfellow, +7 more

TL;DR: A new framework for estimating generative models via an adversarial process, in which two models are simultaneously train: a generative model G that captures the data distribution and a discriminative model D that estimates the probability that a sample came from the training data rather than G.

...read moreread less

Proceedings ArticleDOI

Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks

Jun-Yan Zhu, +3 more

TL;DR: CycleGAN as discussed by the authors learns a mapping G : X → Y such that the distribution of images from G(X) is indistinguishable from the distribution Y using an adversarial loss.

...read moreread less

Book ChapterDOI

Perceptual Losses for Real-Time Style Transfer and Super-Resolution

Justin Johnson, +2 more

TL;DR: In this paper, the authors combine the benefits of both approaches, and propose the use of perceptual loss functions for training feed-forward networks for image style transfer, where a feedforward network is trained to solve the optimization problem proposed by Gatys et al. in real-time.

...read moreread less

Proceedings ArticleDOI

A Style-Based Generator Architecture for Generative Adversarial Networks

Tero Karras, +2 more

TL;DR: This paper proposed an alternative generator architecture for GANs, borrowing from style transfer literature, which leads to an automatically learned, unsupervised separation of high-level attributes (e.g., pose and identity when trained on human faces) and stochastic variation in the generated images.

...read moreread less

Collapse

arXiv: Computer Vision and Pattern Recog...

Conditional Image Generation and Manipulation for User-Specified Content

David Stap, +3 more

- 11 May 2020 -

arXiv: Computer Vision and Pattern Recog...

High Resolution Face Editing with Masked GAN Latent Code Optimization.

References

Adam: A Method for Stochastic Optimization

Generative Adversarial Nets

Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks

Perceptual Losses for Real-Time Style Transfer and Super-Resolution

A Style-Based Generator Architecture for Generative Adversarial Networks

Related Papers (5)

ELEGANT: Exchanging Latent Encodings with GAN for Transferring Multiple Face Attributes

Towards automatic image editing: learning to see another you

The GAN That Warped: Semantic Attribute Editing With Unpaired Data

Learning Residual Images for Face Attribute Manipulation

Conditional Image Generation and Manipulation for User-Specified Content