Semantic Image Inpainting with Deep Generative Models

doi:10.1109/CVPR.2017.728

Open AccessProceedings ArticleDOI

Semantic Image Inpainting with Deep Generative Models

Raymond A. Yeh, +6 more

- pp 6882-6890

Chats0

TLDR

A novel method for semantic image inpainting, which generates the missing content by conditioning on the available data, and successfully predicts information in large missing regions and achieves pixel-level photorealism, significantly outperforming the state-of-the-art methods.

Abstract:

Semantic image inpainting is a challenging task where large missing regions have to be filled based on the available visual data. Existing methods which extract information from only a single image generally produce unsatisfactory results due to the lack of high level context. In this paper, we propose a novel method for semantic image inpainting, which generates the missing content by conditioning on the available data. Given a trained generative model, we search for the closest encoding of the corrupted image in the latent image manifold using our context and prior losses. This encoding is then passed through the generative model to infer the missing content. In our method, inference is possible irrespective of how the missing content is structured, while the state-of-the-art learning based method requires specific information about the holes in the training phase. Experiments on three datasets show that our method successfully predicts information in large missing regions and achieves pixel-level photorealism, significantly outperforming the state-of-the-art methods.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

X-Reality Museums: Unifying the Virtual and Real World Towards Realistic Virtual Museums

George Margetis, +4 more

- 31 Dec 2020 -

Applied Sciences

TL;DR: The presented work proposes the synthesis of augmented, virtual and mixed reality technologies to provide unified X-Reality experiences in realistic virtual museums, engaging visitors in an interactive and seamless fusion of physical and virtual worlds that will feature virtual agents exhibiting naturalistic behavior.

...read moreread less

Proceedings ArticleDOI

Deepfake Video Detection Based on Spatial, Spectral, and Temporal Inconsistencies Using Multimodal Deep Learning

John K. Lewis, +7 more

TL;DR: In this article, the Discrete Cosine Transform (DCT) was used to capture spectral features of individual frames of a video frame to distinguish real and fake video frames, achieving 61.95% accuracy on the Facebook Deepfake Detection Challenge (DFDC) dataset.

...read moreread less

Proceedings ArticleDOI

MMFL: Multimodal Fusion Learning for Text-Guided Image Inpainting

Qing Lin, +3 more

TL;DR: This paper imitates the process of painters' conjecture, and proposes to introduce the text description into the image inpainting task for the first time, which provides abundant guidance information for image restoration through the fusion of multimodal features.

...read moreread less

Posted Content

Multi-level Encoder-Decoder Architectures for Image Restoration.

Indra Deep Mastan, +1 more

- 01 May 2019 -

arXiv: Image and Video Processing

TL;DR: In this article, a multi-level extension of the encoder-decoder network is proposed to investigate the relationship between image restoration and network construction independent of learning, and the proposed framework allows various network structures by modifying the following network components: skip links, cascading of the network input into intermediate layers, and network depth.

...read moreread less

Journal ArticleDOI

P+: Extended Textual Conditioning in Text-to-Image Generation

Daniel Cohen-Or, +3 more

- 16 Mar 2023 -

arXiv.org

TL;DR: This article introduced an Extended Textual Conditioning space in text-to-image models, referred to as $P+$ , which consists of multiple textual conditions, derived from per-layer prompts, each corresponding to a layer of the denoising U-net of the diffusion model.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings Article

Adam: A Method for Stochastic Optimization

Diederik P. Kingma, +1 more

TL;DR: This work introduces Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments, and provides a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework.

...read moreread less

Journal ArticleDOI

Image quality assessment: from error visibility to structural similarity

Zhou Wang, +3 more

- 01 Apr 2004 -

IEEE Transactions on Image Processing

TL;DR: In this article, a structural similarity index is proposed for image quality assessment based on the degradation of structural information, which can be applied to both subjective ratings and objective methods on a database of images compressed with JPEG and JPEG2000.

...read moreread less

Journal ArticleDOI

Generative Adversarial Nets

Ian Goodfellow, +7 more

TL;DR: A new framework for estimating generative models via an adversarial process, in which two models are simultaneously train: a generative model G that captures the data distribution and a discriminative model D that estimates the probability that a sample came from the training data rather than G.

...read moreread less

Journal Article

Visualizing Data using t-SNE

Laurens van der Maaten, +1 more

- 01 Jan 2008 -

Journal of Machine Learning Research

TL;DR: A new technique called t-SNE that visualizes high-dimensional data by giving each datapoint a location in a two or three-dimensional map, a variation of Stochastic Neighbor Embedding that is much easier to optimize, and produces significantly better visualizations by reducing the tendency to crowd points together in the center of the map.

...read moreread less

Proceedings Article

Auto-Encoding Variational Bayes

Diederik P. Kingma, +1 more

TL;DR: A stochastic variational inference and learning algorithm that scales to large datasets and, under some mild differentiability conditions, even works in the intractable case is introduced.

...read moreread less

Collapse

arXiv: Learning

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks

Jun-Yan Zhu, +3 more

Semantic Image Inpainting with Deep Generative Models

Citations

X-Reality Museums: Unifying the Virtual and Real World Towards Realistic Virtual Museums

Deepfake Video Detection Based on Spatial, Spectral, and Temporal Inconsistencies Using Multimodal Deep Learning

MMFL: Multimodal Fusion Learning for Text-Guided Image Inpainting

Multi-level Encoder-Decoder Architectures for Image Restoration.

P+: Extended Textual Conditioning in Text-to-Image Generation

References

Adam: A Method for Stochastic Optimization

Image quality assessment: from error visibility to structural similarity

Generative Adversarial Nets

Visualizing Data using t-SNE

Auto-Encoding Variational Bayes

Related Papers (5)

Generative Adversarial Nets

Image-to-Image Translation with Conditional Adversarial Networks

Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks

Deep Residual Learning for Image Recognition

Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks