Semantic Image Synthesis With Spatially-Adaptive Normalization

doi:10.1109/CVPR.2019.00244

Open AccessProceedings ArticleDOI

Semantic Image Synthesis With Spatially-Adaptive Normalization

Taesung Park, +3 more

- pp 2337-2346

Chats0

TLDR

S spatially-adaptive normalization is proposed, a simple but effective layer for synthesizing photorealistic images given an input semantic layout that allows users to easily control the style and content of image synthesis results as well as create multi-modal results.

Abstract:

We propose spatially-adaptive normalization, a simple but effective layer for synthesizing photorealistic images given an input semantic layout. Previous methods directly feed the semantic layout as input to the network, forcing the network to memorize the information throughout all the layers. Instead, we propose using the input layout for modulating the activations in normalization layers through a spatially-adaptive, learned affine transformation. Experiments on several challenging datasets demonstrate the superiority of our method compared to existing approaches, regarding both visual fidelity and alignment with input layouts. Finally, our model allows users to easily control the style and content of image synthesis results as well as create multi-modal results. Code is available upon publication.

Citations

PDF

Open Access

More filters

Journal Article

“Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks”の学習報告

杉山拓海

- 12 Sep 2017 -

Computers & Graphics

Posted Content

Self-Attention Generative Adversarial Networks

Han Zhang, +3 more

- 21 May 2018 -

arXiv: Machine Learning

TL;DR: Self-Attention Generative Adversarial Network (SAGAN) as mentioned in this paper uses attention-driven, long-range dependency modeling for image generation tasks and achieves state-of-the-art results.

...read moreread less

Proceedings ArticleDOI

Image2StyleGAN: How to Embed Images Into the StyleGAN Latent Space?

Rameen Abdal, +2 more

TL;DR: This work proposes an efficient algorithm to embed a given image into the latent space of StyleGAN, which enables semantic image editing operations that can be applied to existing photographs.

...read moreread less

Posted Content

Taming Transformers for High-Resolution Image Synthesis

Patrick Esser, +2 more

- 17 Dec 2020 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: It is demonstrated how combining the effectiveness of the inductive bias of CNNs with the expressivity of transformers enables them to model and thereby synthesize high-resolution images.

...read moreread less

Posted Content

StarGAN v2: Diverse Image Synthesis for Multiple Domains

Yunjey Choi, +3 more

- 04 Dec 2019 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: StarGAN v2, a single framework that tackles image-to-image translation models with limited diversity and multiple models for all domains, is proposed and shows significantly improved results over the baselines.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings ArticleDOI

The Cityscapes Dataset for Semantic Urban Scene Understanding

Marius Cordts, +8 more

TL;DR: This work introduces Cityscapes, a benchmark suite and large-scale dataset to train and test approaches for pixel-level and instance-level semantic labeling, and exceeds previous attempts in terms of dataset size, annotation richness, scene variability, and complexity.

...read moreread less

Proceedings ArticleDOI

A Style-Based Generator Architecture for Generative Adversarial Networks

Tero Karras, +2 more

TL;DR: This paper proposed an alternative generator architecture for GANs, borrowing from style transfer literature, which leads to an automatically learned, unsupervised separation of high-level attributes (e.g., pose and identity when trained on human faces) and stochastic variation in the generated images.

...read moreread less

Proceedings Article

Wasserstein Generative Adversarial Networks

Martin Arjovsky, +2 more

TL;DR: This work introduces a new algorithm named WGAN, an alternative to traditional GAN training that can improve the stability of learning, get rid of problems like mode collapse, and provide meaningful learning curves useful for debugging and hyperparameter searches.

...read moreread less

Posted Content

GANs Trained by a Two Time-Scale Update Rule Converge to a Nash Equilibrium

Martin Heusel, +5 more

- 26 Jun 2017 -

arXiv: Learning

TL;DR: In this article, a two time-scale update rule (TTUR) was proposed for training GANs with stochastic gradient descent on arbitrary GAN loss functions, which has an individual learning rate for both the discriminator and the generator.

...read moreread less

Posted Content

Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks

Jun-Yan Zhu, +3 more

- 30 Mar 2017 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: This work presents an approach for learning to translate an image from a source domain X to a target domain Y in the absence of paired examples, and introduces a cycle consistency loss to push F(G(X)) ≈ X (and vice versa).

...read moreread less

Collapse

Semantic Image Synthesis With Spatially-Adaptive Normalization

Citations

“Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks”の学習報告

Self-Attention Generative Adversarial Networks

Image2StyleGAN: How to Embed Images Into the StyleGAN Latent Space?

Taming Transformers for High-Resolution Image Synthesis

StarGAN v2: Diverse Image Synthesis for Multiple Domains

References

The Cityscapes Dataset for Semantic Urban Scene Understanding

A Style-Based Generator Architecture for Generative Adversarial Networks

Wasserstein Generative Adversarial Networks

GANs Trained by a Two Time-Scale Update Rule Converge to a Nash Equilibrium

Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks

Related Papers (5)

Image-to-Image Translation with Conditional Adversarial Networks

Generative Adversarial Nets

Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks

GANs Trained by a Two Time-Scale Update Rule Converge to a Local Nash Equilibrium

A Style-Based Generator Architecture for Generative Adversarial Networks