GANs Trained by a Two Time-Scale Update Rule Converge to a Local Nash Equilibrium

Open AccessProceedings Article

GANs Trained by a Two Time-Scale Update Rule Converge to a Local Nash Equilibrium

- Vol. 30, pp 6626-6637

TLDR

In this paper, a two time-scale update rule (TTUR) was proposed for training GANs with stochastic gradient descent on arbitrary GAN loss functions, which has an individual learning rate for both the discriminator and the generator.

Abstract:

Generative Adversarial Networks (GANs) excel at creating realistic images with complex models for which maximum likelihood is infeasible. However, the convergence of GAN training has still not been proved. We propose a two time-scale update rule (TTUR) for training GANs with stochastic gradient descent on arbitrary GAN loss functions. TTUR has an individual learning rate for both the discriminator and the generator. Using the theory of stochastic approximation, we prove that the TTUR converges under mild assumptions to a stationary local Nash equilibrium. The convergence carries over to the popular Adam optimization, for which we prove that it follows the dynamics of a heavy ball with friction and thus prefers flat minima in the objective landscape. For the evaluation of the performance of GANs at image generation, we introduce the `Frechet Inception Distance'' (FID) which captures the similarity of generated images to real ones better than the Inception Score. In experiments, TTUR improves learning for DCGANs and Improved Wasserstein GANs (WGAN-GP) outperforming conventional GAN training on CelebA, CIFAR-10, SVHN, LSUN Bedrooms, and the One Billion Word Benchmark.

Citations

PDF

Open Access

More filters

Posted Content

Denoising Diffusion Probabilistic Models

Jonathan Ho, +2 more

- 19 Jun 2020 -

arXiv: Learning

TL;DR: High quality image synthesis results are presented using diffusion probabilistic models, a class of latent variable models inspired by considerations from nonequilibrium thermodynamics, which naturally admit a progressive lossy decompression scheme that can be interpreted as a generalization of autoregressive decoding.

...read moreread less

Posted Content

Analyzing and Improving the Image Quality of StyleGAN

Tero Karras, +5 more

- 03 Dec 2019 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: This work redesigns the generator normalization, revisit progressive growing, and regularize the generator to encourage good conditioning in the mapping from latent codes to images, and thereby redefines the state of the art in unconditional image modeling.

...read moreread less

Proceedings ArticleDOI

Semantic Image Synthesis With Spatially-Adaptive Normalization

Taesung Park, +3 more

TL;DR: S spatially-adaptive normalization is proposed, a simple but effective layer for synthesizing photorealistic images given an input semantic layout that allows users to easily control the style and content of image synthesis results as well as create multi-modal results.

...read moreread less

Posted Content

Self-Attention Generative Adversarial Networks

Han Zhang, +3 more

- 21 May 2018 -

arXiv: Machine Learning

TL;DR: Self-Attention Generative Adversarial Network (SAGAN) as mentioned in this paper uses attention-driven, long-range dependency modeling for image generation tasks and achieves state-of-the-art results.

...read moreread less

Proceedings ArticleDOI

Analyzing and Improving the Image Quality of StyleGAN

Tero Karras, +5 more

TL;DR: In this paper, the authors propose to redesign the generator normalization, revisit progressive growing, and regularize the generator to encourage good conditioning in the mapping from latent codes to images.

...read moreread less

Collapse

GANs Trained by a Two Time-Scale Update Rule Converge to a Local Nash Equilibrium

Citations

Denoising Diffusion Probabilistic Models

Analyzing and Improving the Image Quality of StyleGAN

Semantic Image Synthesis With Spatially-Adaptive Normalization

Self-Attention Generative Adversarial Networks

Analyzing and Improving the Image Quality of StyleGAN

Related Papers (5)

Generative Adversarial Nets

Image-to-Image Translation with Conditional Adversarial Networks

Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks

Adam: A Method for Stochastic Optimization

Deep Residual Learning for Image Recognition