scispace - formally typeset
Open AccessProceedings ArticleDOI

Semantic Image Inpainting with Deep Generative Models

Reads0
Chats0
TLDR
A novel method for semantic image inpainting, which generates the missing content by conditioning on the available data, and successfully predicts information in large missing regions and achieves pixel-level photorealism, significantly outperforming the state-of-the-art methods.
Abstract
Semantic image inpainting is a challenging task where large missing regions have to be filled based on the available visual data. Existing methods which extract information from only a single image generally produce unsatisfactory results due to the lack of high level context. In this paper, we propose a novel method for semantic image inpainting, which generates the missing content by conditioning on the available data. Given a trained generative model, we search for the closest encoding of the corrupted image in the latent image manifold using our context and prior losses. This encoding is then passed through the generative model to infer the missing content. In our method, inference is possible irrespective of how the missing content is structured, while the state-of-the-art learning based method requires specific information about the holes in the training phase. Experiments on three datasets show that our method successfully predicts information in large missing regions and achieves pixel-level photorealism, significantly outperforming the state-of-the-art methods.

read more

Content maybe subject to copyright    Report

Citations
More filters
Posted Content

What Is It Like Down There? Generating Dense Ground-Level Views and Image Features From Overhead Imagery Using Conditional Generative Adversarial Networks

TL;DR: This work is the first work to use cGANs to generate ground-level views given overhead imagery in order to explore the benefits of the learned representations, and shows that dense feature maps generated using the framework are more effective for land-cover classification than approaches which spatially interpolate features extracted from sparse ground- level images.
Journal ArticleDOI

P+: Extended Textual Conditioning in Text-to-Image Generation

TL;DR: This article introduced an Extended Textual Conditioning space in text-to-image models, referred to as $P+$ , which consists of multiple textual conditions, derived from per-layer prompts, each corresponding to a layer of the denoising U-net of the diffusion model.
Journal ArticleDOI

Development of a generative-adversarial-network-based signal reconstruction method for nuclear power plants

TL;DR: A new signal reconstruction method based on a generative adversarial network (GAN) that can be applied to reconstruct multiple missing signals under various NPP emergency situations is proposed.
Posted Content

Asymptotics of MAP Inference in Deep Networks

TL;DR: In this article, the mean squared error of the multilayer vector approximate message passing (ML-VAMP) estimate is shown to be exactly and rigorously characterized in a certain high-dimensional random limit.

A Study of Improved Methods on Image Inpainting

TL;DR: There are many different approaches to solve the inpainting problem such as feature distribution, sparse representation, Markov random field, multiscale graph cuts, neural networks, and GAN-based methods.
References
More filters
Proceedings Article

Adam: A Method for Stochastic Optimization

TL;DR: This work introduces Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments, and provides a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework.
Journal ArticleDOI

Image quality assessment: from error visibility to structural similarity

TL;DR: In this article, a structural similarity index is proposed for image quality assessment based on the degradation of structural information, which can be applied to both subjective ratings and objective methods on a database of images compressed with JPEG and JPEG2000.
Journal ArticleDOI

Generative Adversarial Nets

TL;DR: A new framework for estimating generative models via an adversarial process, in which two models are simultaneously train: a generative model G that captures the data distribution and a discriminative model D that estimates the probability that a sample came from the training data rather than G.
Journal Article

Visualizing Data using t-SNE

TL;DR: A new technique called t-SNE that visualizes high-dimensional data by giving each datapoint a location in a two or three-dimensional map, a variation of Stochastic Neighbor Embedding that is much easier to optimize, and produces significantly better visualizations by reducing the tendency to crowd points together in the center of the map.
Proceedings Article

Auto-Encoding Variational Bayes

TL;DR: A stochastic variational inference and learning algorithm that scales to large datasets and, under some mild differentiability conditions, even works in the intractable case is introduced.
Related Papers (5)