MaskGAN: Towards Diverse and Interactive Facial Image Manipulation

doi:10.1109/CVPR42600.2020.00559

Open AccessProceedings ArticleDOI

MaskGAN: Towards Diverse and Interactive Facial Image Manipulation

Cheng-Han Lee, +3 more

- pp 5549-5558

Chats0

TLDR

MaskGAN as mentioned in this paper proposes MaskGAN to enable diverse and interactive face manipulation by learning style mapping between a free-form user modified mask and a target image, enabling diverse generation results.

Abstract:

Facial image manipulation has achieved great progress in recent years. However, previous methods either operate on a predefined set of face attributes or leave users little freedom to interactively manipulate images. To overcome these drawbacks, we propose a novel framework termed MaskGAN, enabling diverse and interactive face manipulation. Our key insight is that semantic masks serve as a suitable intermediate representation for flexible face manipulation with fidelity preservation. MaskGAN has two main components: 1) Dense Mapping Network (DMN) and 2) Editing Behavior Simulated Training (EBST). Specifically, DMN learns style mapping between a free-form user modified mask and a target image, enabling diverse generation results. EBST models the user editing behavior on the source mask, making the overall framework more robust to various manipulated inputs. Specifically, it introduces dual-editing consistency as the auxiliary supervision signal. To facilitate extensive studies, we construct a large-scale high-resolution face dataset with fine-grained mask annotations named CelebAMask-HQ. MaskGAN is comprehensively evaluated on two challenging tasks: attribute transfer and style copy, demonstrating superior performance over other state-of-the-art methods. The code, models, and dataset are available at https://github.com/switchablenorms/CelebAMask-HQ.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

SEAN: Image Synthesis With Semantic Region-Adaptive Normalization

Peihao Zhu, +3 more

TL;DR: Semantic Region Adaptive Normalization (SEAN) as mentioned in this paper is a simple but effective building block for Generative Adversarial Networks conditioned on segmentation masks that describe the semantic regions in the desired output image.

...read moreread less

Posted Content

StyleSpace Analysis: Disentangled Controls for StyleGAN Image Generation

Zongze Wu, +2 more

- 25 Nov 2020 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: The latent style space of Style-GAN2, a state-of-the-art architecture for image generation, is explored and StyleSpace, the space of channel-wise style parameters, is shown to be significantly more disentangled than the other intermediate latent spaces explored by previous works.

...read moreread less

Proceedings ArticleDOI

Towards Photo-Realistic Virtual Try-On by Adaptively Generating↔Preserving Image Content

Han Yang, +5 more

TL;DR: This work proposes a novel visual try-on network, namely Adaptive Content Generating and Preserving Network (ACGPN), which can generate photo-realistic images with much better perceptual quality and richer fine-details.

...read moreread less

Proceedings ArticleDOI

PD-GAN: Probabilistic Diverse GAN for Image Inpainting

Hongyu Liu, +5 more

TL;DR: PD-GAN as mentioned in this paper modulates deep features of input random noise from coarse-to-fine by injecting an initially restored image and the hole regions in multiple scales to generate multiple inpainting results with diverse and visually realistic content.

...read moreread less

Proceedings ArticleDOI

StyleSpace Analysis: Disentangled Controls for StyleGAN Image Generation

Zongze Wu, +2 more

TL;DR: In this paper, the authors explore and analyze the latent style space of Style-GAN2, a state-of-the-art architecture for image generation, using models pretrained on several different datasets.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings ArticleDOI

Generative Image Inpainting with Contextual Attention

Jiahui Yu, +5 more

TL;DR: Yu et al. as discussed by the authors proposed a new deep generative model-based approach which can not only synthesize novel image structures but also explicitly utilize surrounding image features as references during network training to make better predictions.

...read moreread less

Proceedings ArticleDOI

Compositing digital images

Thomas K Porter, +1 more

TL;DR: In this article, a matte component can be computed similarly to the color channels for four-channel pictures, and guidelines for the generation of elements and arithmetic for their arbitrary compositing are discussed.

...read moreread less

Book ChapterDOI

Interactive facial feature localization

Vuong Le, +4 more

TL;DR: An improvement to the Active Shape Model is proposed that allows for greater independence among the facial components and improves on the appearance fitting step by introducing a Viterbi optimization process that operates along the facial contours.

...read moreread less

Journal ArticleDOI

FaceWarehouse: A 3D Facial Expression Database for Visual Computing

Chen Cao, +4 more

- 01 Mar 2014 -

IEEE Transactions on Visualization and C...

TL;DR: There is a much richer matching collection of expressions, enabling depiction of most human facial actions, in FaceWarehouse, a database of 3D facial expressions for visual computing applications.

...read moreread less

Proceedings ArticleDOI

Free-Form Image Inpainting With Gated Convolution

Jiahui Yu, +5 more

TL;DR: Yu et al. as mentioned in this paper proposed a generative image inpainting system to complete images with free-form mask and guidance, which is based on gated convolutions learned from millions of images without additional labeling efforts.

...read moreread less

Collapse

MaskGAN: Towards Diverse and Interactive Facial Image Manipulation

Citations

SEAN: Image Synthesis With Semantic Region-Adaptive Normalization

StyleSpace Analysis: Disentangled Controls for StyleGAN Image Generation

Towards Photo-Realistic Virtual Try-On by Adaptively Generating↔Preserving Image Content

PD-GAN: Probabilistic Diverse GAN for Image Inpainting

StyleSpace Analysis: Disentangled Controls for StyleGAN Image Generation

References

Generative Image Inpainting with Contextual Attention

Compositing digital images

Interactive facial feature localization

FaceWarehouse: A 3D Facial Expression Database for Visual Computing

Free-Form Image Inpainting With Gated Convolution

Related Papers (5)

A Style-Based Generator Architecture for Generative Adversarial Networks

Image-to-Image Translation with Conditional Adversarial Networks

Generative Adversarial Nets

Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks

GANs Trained by a Two Time-Scale Update Rule Converge to a Local Nash Equilibrium