Guided Disentanglement in Generative Networks

Open AccessPosted Content

Guided Disentanglement in Generative Networks

- 29 Jul 2021 -

arXiv: Computer Vision and Pattern Recog...

TLDR

In this paper, a comprehensive method for disentangling physics-based traits in the translation, guiding the learning process with neural or physical models is presented, integrating adversarial estimation and genetic algorithms to correctly achieve disentanglement.

Abstract:

Image-to-image translation (i2i) networks suffer from entanglement effects in presence of physics-related phenomena in target domain (such as occlusions, fog, etc), thus lowering the translation quality and variability. In this paper, we present a comprehensive method for disentangling physics-based traits in the translation, guiding the learning process with neural or physical models. For the latter, we integrate adversarial estimation and genetic algorithms to correctly achieve disentanglement. The results show our approach dramatically increase performances in many challenging scenarios for image translation.

References

PDF

Open Access

More filters

Proceedings ArticleDOI

Image-to-Image Translation with Conditional Adversarial Networks

Phillip Isola, +3 more

TL;DR: Conditional adversarial networks are investigated as a general-purpose solution to image-to-image translation problems and it is demonstrated that this approach is effective at synthesizing photos from label maps, reconstructing objects from edge maps, and colorizing images, among other tasks.

...read moreread less

Proceedings ArticleDOI

Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks

Jun-Yan Zhu, +3 more

TL;DR: CycleGAN as discussed by the authors learns a mapping G : X → Y such that the distribution of images from G(X) is indistinguishable from the distribution Y using an adversarial loss.

...read moreread less

Proceedings ArticleDOI

Pyramid Scene Parsing Network

Hengshuang Zhao, +4 more

TL;DR: This paper exploits the capability of global context information by different-region-based context aggregation through the pyramid pooling module together with the proposed pyramid scene parsing network (PSPNet) to produce good quality results on the scene parsing task.

...read moreread less

Proceedings ArticleDOI

Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization

Ramprasaath R. Selvaraju, +5 more

TL;DR: This work combines existing fine-grained visualizations to create a high-resolution class-discriminative visualization, Guided Grad-CAM, and applies it to image classification, image captioning, and visual question answering (VQA) models, including ResNet-based architectures.

...read moreread less

Proceedings ArticleDOI

The Cityscapes Dataset for Semantic Urban Scene Understanding

Marius Cordts, +8 more

TL;DR: This work introduces Cityscapes, a benchmark suite and large-scale dataset to train and test approaches for pixel-level and instance-level semantic labeling, and exceeds previous attempts in terms of dataset size, annotation richness, scene variability, and complexity.

...read moreread less

Collapse

Guided Disentanglement in Generative Networks

References

Image-to-Image Translation with Conditional Adversarial Networks

Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks

Pyramid Scene Parsing Network

Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization

The Cityscapes Dataset for Semantic Urban Scene Understanding

Related Papers (5)

Face identity disentanglement via latent space mapping

Generative Adversarial Parallelization

Disentangling in Latent Space by Harnessing a Pretrained Generator.

Vulnerability of Machine Learning Phases of Matter

Generative Multi-View Human Action Recognition