Large Scale GAN Training for High Fidelity Natural Image Synthesis

Open AccessProceedings Article

Large Scale GAN Training for High Fidelity Natural Image Synthesis

TLDR

It is found that applying orthogonal regularization to the generator renders it amenable to a simple "truncation trick," allowing fine control over the trade-off between sample fidelity and variety by reducing the variance of the Generator's input.

Abstract:

Despite recent progress in generative image modeling, successfully generating high-resolution, diverse samples from complex datasets such as ImageNet remains an elusive goal. To this end, we train Generative Adversarial Networks at the largest scale yet attempted, and study the instabilities specific to such scale. We find that applying orthogonal regularization to the generator renders it amenable to a simple "truncation trick," allowing fine control over the trade-off between sample fidelity and variety by reducing the variance of the Generator's input. Our modifications lead to models which set the new state of the art in class-conditional image synthesis. When trained on ImageNet at 128x128 resolution, our models (BigGANs) achieve an Inception Score (IS) of 166.5 and Frechet Inception Distance (FID) of 7.4, improving over the previous best IS of 52.52 and FID of 18.6.

Citations

PDF

Open Access

More filters

Posted Content

Denoising Diffusion Probabilistic Models

Jonathan Ho, +2 more

- 19 Jun 2020 -

arXiv: Learning

TL;DR: High quality image synthesis results are presented using diffusion probabilistic models, a class of latent variable models inspired by considerations from nonequilibrium thermodynamics, which naturally admit a progressive lossy decompression scheme that can be interpreted as a generalization of autoregressive decoding.

...read moreread less

Posted Content

Analyzing and Improving the Image Quality of StyleGAN

Tero Karras, +5 more

- 03 Dec 2019 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: This work redesigns the generator normalization, revisit progressive growing, and regularize the generator to encourage good conditioning in the mapping from latent codes to images, and thereby redefines the state of the art in unconditional image modeling.

...read moreread less

Proceedings ArticleDOI

Semantic Image Synthesis With Spatially-Adaptive Normalization

Taesung Park, +3 more

TL;DR: S spatially-adaptive normalization is proposed, a simple but effective layer for synthesizing photorealistic images given an input semantic layout that allows users to easily control the style and content of image synthesis results as well as create multi-modal results.

...read moreread less

Posted Content

Self-Attention Generative Adversarial Networks

Han Zhang, +3 more

- 21 May 2018 -

arXiv: Machine Learning

TL;DR: Self-Attention Generative Adversarial Network (SAGAN) as mentioned in this paper uses attention-driven, long-range dependency modeling for image generation tasks and achieves state-of-the-art results.

...read moreread less

Proceedings ArticleDOI

Analyzing and Improving the Image Quality of StyleGAN

Tero Karras, +5 more

TL;DR: In this paper, the authors propose to redesign the generator normalization, revisit progressive growing, and regularize the generator to encourage good conditioning in the mapping from latent codes to images.

...read moreread less

Collapse

Large Scale GAN Training for High Fidelity Natural Image Synthesis

Citations

Denoising Diffusion Probabilistic Models

Analyzing and Improving the Image Quality of StyleGAN

Semantic Image Synthesis With Spatially-Adaptive Normalization

Self-Attention Generative Adversarial Networks

Analyzing and Improving the Image Quality of StyleGAN

Related Papers (5)

Generative Adversarial Nets

A Style-Based Generator Architecture for Generative Adversarial Networks

Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks

Image-to-Image Translation with Conditional Adversarial Networks

Deep Residual Learning for Image Recognition