A Style-Based Generator Architecture for Generative Adversarial Networks
Tero Karras,Samuli Laine,Timo Aila +2 more
- pp 4396-4405
Reads0
Chats0
TLDR
This paper proposed an alternative generator architecture for GANs, borrowing from style transfer literature, which leads to an automatically learned, unsupervised separation of high-level attributes (e.g., pose and identity when trained on human faces) and stochastic variation in the generated images.Abstract:
We propose an alternative generator architecture for generative adversarial networks, borrowing from style transfer literature. The new architecture leads to an automatically learned, unsupervised separation of high-level attributes (e.g., pose and identity when trained on human faces) and stochastic variation in the generated images (e.g., freckles, hair), and it enables intuitive, scale-specific control of the synthesis. The new generator improves the state-of-the-art in terms of traditional distribution quality metrics, leads to demonstrably better interpolation properties, and also better disentangles the latent factors of variation. To quantify interpolation quality and disentanglement, we propose two new, automated methods that are applicable to any generator architecture. Finally, we introduce a new, highly varied and high-quality dataset of human faces.read more
Citations
More filters
Proceedings ArticleDOI
Encoding in Style: a StyleGAN Encoder for Image-to-Image Translation
Elad Richardson,Yuval Alaluf,Or Patashnik,Yotam Nitzan,Yaniv Azar,Stav Shapiro,Daniel Cohen-Or +6 more
TL;DR: The pixel2style2pixel (pSp) as discussed by the authors framework is based on a novel encoder network that directly generates a series of style vectors which are fed into a pretrained StyleGAN generator, forming the extended $\mathcal{W} + $ latent space.
Proceedings ArticleDOI
SynSin: End-to-End View Synthesis From a Single Image
TL;DR: This work proposes a novel differentiable point cloud renderer that is used to transform a latent 3D point cloud of features into the target view and outperforms baselines and prior work on the Matterport, Replica, and RealEstate10K datasets.
Posted Content
HoloGAN: Unsupervised learning of 3D representations from natural images
TL;DR: HoloGAN is the first generative model that learns 3D representations from natural images in an entirely unsupervised manner and is shown to be able to generate images with similar or higher visual quality than other generative models.
Posted Content
Celeb-DF: A Large-scale Challenging Dataset for DeepFake Forensics
TL;DR: This work presents a new large-scale challenging DeepFake video dataset, Celeb-DF, which contains 5,639 high-quality DeepFake videos of celebrities generated using improved synthesis process and conducts a comprehensive evaluation of DeepFake detection methods and datasets to demonstrate the escalated level of challenges posed by Celebrity-DF.
Posted Content
Closed-Form Factorization of Latent Semantics in GANs
Yujun Shen,Bolei Zhou +1 more
TL;DR: This work examines the internal representation learned by GANs to reveal the underlying variation factors in an unsupervised manner and proposes a closedform factorization algorithm for latent semantic discovery by directly decomposing the pre-trained weights.