ESRGAN: Enhanced Super-Resolution Generative Adversarial Networks

Open AccessPosted Content

ESRGAN: Enhanced Super-Resolution Generative Adversarial Networks

Xintao Wang, +8 more

- 01 Sep 2018 -

arXiv: Computer Vision and Pattern Recog...

Chats0

TLDR

This work thoroughly study three key components of SRGAN – network architecture, adversarial loss and perceptual loss, and improves each of them to derive an Enhanced SRGAN (ESRGAN), which achieves consistently better visual quality with more realistic and natural textures than SRGAN.

Abstract:

The Super-Resolution Generative Adversarial Network (SRGAN) is a seminal work that is capable of generating realistic textures during single image super-resolution. However, the hallucinated details are often accompanied with unpleasant artifacts. To further enhance the visual quality, we thoroughly study three key components of SRGAN - network architecture, adversarial loss and perceptual loss, and improve each of them to derive an Enhanced SRGAN (ESRGAN). In particular, we introduce the Residual-in-Residual Dense Block (RRDB) without batch normalization as the basic network building unit. Moreover, we borrow the idea from relativistic GAN to let the discriminator predict relative realness instead of the absolute value. Finally, we improve the perceptual loss by using the features before activation, which could provide stronger supervision for brightness consistency and texture recovery. Benefiting from these improvements, the proposed ESRGAN achieves consistently better visual quality with more realistic and natural textures than SRGAN and won the first place in the PIRM2018-SR Challenge. The code is available at this https URL .

Citations

PDF

Open Access

More filters

Posted Content

The Six Fronts of the Generative Adversarial Networks

Alceu Bissoto, +2 more

- 29 Oct 2019 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: The target is to be an entry point to this vast literature, and to be able to update experienced researchers to the newest techniques, by splitting that incoming wave into six "fronts": Architectural Contributions, Conditional Techniques, Normalization and Constraint Contributions, Loss Functions, Image-to-image Translations, and Validation Metrics.

...read moreread less

Journal ArticleDOI

Unsupervised multi-domain multimodal image-to-image translation with explicit domain-constrained disentanglement.

Weihao Xia, +2 more

- 01 Nov 2020 -

Neural Networks

TL;DR: In this article, a unified framework for learning to generate diverse outputs using unpaired training data and allowing for simultaneous multi-domain translation via a single model is proposed, which outperforms or is comparable with the state-of-the-art methods for various applications.

...read moreread less

Posted Content

Structure Preserving Compressive Sensing MRI Reconstruction using Generative Adversarial Networks

Puneesh Deora, +3 more

- 14 Oct 2019 -

arXiv: Image and Video Processing

TL;DR: A novel generative adversarial network (GAN) based framework for CS-MRI reconstruction is proposed, leveraging a combination of patch-based discriminator and structural similarity index based loss that outperforms state-of-the-art methods in terms of quality of reconstruction and robustness to noise.

...read moreread less

Journal ArticleDOI

DeepBedMap: a deep neural network for resolving the bed topography of Antarctica

Wei Ji Leong, +1 more

- 05 Nov 2020 -

The Cryosphere

TL;DR: DeepBedMap as discussed by the authors is based on an adapted architecture of the Enhanced Super-Resolution Generative Adversarial Network (ESGRAN) to minimize per-pixel elevation errors while producing realistic topography.

...read moreread less

Journal ArticleDOI

Toward Real-World Super-Resolution via Adaptive Downsampling Models.

Sanghyun Son, +4 more

- 24 Aug 2021 -

IEEE Transactions on Pattern Analysis an...

TL;DR: In this paper, an adaptive data loss (ADL) is proposed for the downsampling network, which can be adaptively learned from given LR images and updated in the training loops.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings ArticleDOI

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.

...read moreread less

Proceedings Article

Adam: A Method for Stochastic Optimization

Diederik P. Kingma, +1 more

TL;DR: This work introduces Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments, and provides a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework.

...read moreread less

Proceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

Karen Simonyan, +1 more

TL;DR: This work investigates the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting using an architecture with very small convolution filters, which shows that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 weight layers.

...read moreread less

Proceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

Karen Simonyan, +1 more

TL;DR: In this paper, the authors investigated the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting and showed that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 layers.

...read moreread less

Journal ArticleDOI

Generative Adversarial Nets

Ian Goodfellow, +7 more

TL;DR: A new framework for estimating generative models via an adversarial process, in which two models are simultaneously train: a generative model G that captures the data distribution and a discriminative model D that estimates the probability that a sample came from the training data rather than G.

...read moreread less