Semantic Image Inpainting with Deep Generative Models
Raymond A. Yeh,Chen Chen,Teck Yian Lim,Alexander G. Schwing,Alexander G. Schwing,Mark Hasegawa-Johnson,Minh N. Do +6 more
- pp 6882-6890
Reads0
Chats0
TLDR
A novel method for semantic image inpainting, which generates the missing content by conditioning on the available data, and successfully predicts information in large missing regions and achieves pixel-level photorealism, significantly outperforming the state-of-the-art methods.Abstract:
Semantic image inpainting is a challenging task where large missing regions have to be filled based on the available visual data. Existing methods which extract information from only a single image generally produce unsatisfactory results due to the lack of high level context. In this paper, we propose a novel method for semantic image inpainting, which generates the missing content by conditioning on the available data. Given a trained generative model, we search for the closest encoding of the corrupted image in the latent image manifold using our context and prior losses. This encoding is then passed through the generative model to infer the missing content. In our method, inference is possible irrespective of how the missing content is structured, while the state-of-the-art learning based method requires specific information about the holes in the training phase. Experiments on three datasets show that our method successfully predicts information in large missing regions and achieves pixel-level photorealism, significantly outperforming the state-of-the-art methods.read more
Citations
More filters
Posted Content
Making Images Real Again: A Comprehensive Survey on Deep Image Composition.
TL;DR: Wang et al. as mentioned in this paper summarized the datasets and methods for the above research directions and discussed the limitations and potential directions to facilitate the future research for image composition, including image harmonization, object placement, and geometry inconsistency.
Journal ArticleDOI
Diffusion map particle systems for generative modeling
TL;DR: In this article , diffusion maps are used to approximate the generator of the Langevin diffusion process from samples, and hence to learn the underlying data-generating manifold, which enables efficient sampling from the target distribution given a suitable choice of kernel, which is constructed via a spectral approximation of the generator, computed with diffusion maps.
Journal ArticleDOI
Accelerating Diffusion Models for Inverse Problems through Shortcut Sampling
TL;DR: Li et al. as discussed by the authors proposed Shortcut Sampling for Diffusion (SSD), a pipeline for solving inverse problems, where the key concept of SSD is to find the "Embryo", a transitional state that bridges the measurement image y and the restored image x.
Book ChapterDOI
Deep Dictionary Learning for Inpainting
Karthik Seemakurthy,Angshul Majumdar,Jayavardhana Gubbi,N. K. Sandeep,Ashley Varghese,Smita N. Deshpande,M. Girish Chandra,P. Balamurali +7 more
TL;DR: In this article, an alternating minimization (AM) approach is proposed to derive the dictionaries and their corresponding sparse coefficients at each level of the DDL framework for multispectral image inpainting.
Posted Content
SimMIM: A Simple Framework for Masked Image Modeling
TL;DR: SimMIM as discussed by the authors is a simple framework for masked image modeling without special designs such as block-wise masking and tokenization via discrete VAE or clustering, which shows that simple designs of each component have revealed very strong representation learning performance.
References
More filters
Proceedings Article
Adam: A Method for Stochastic Optimization
Diederik P. Kingma,Jimmy Ba +1 more
TL;DR: This work introduces Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments, and provides a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework.
Journal ArticleDOI
Image quality assessment: from error visibility to structural similarity
TL;DR: In this article, a structural similarity index is proposed for image quality assessment based on the degradation of structural information, which can be applied to both subjective ratings and objective methods on a database of images compressed with JPEG and JPEG2000.
Journal ArticleDOI
Generative Adversarial Nets
Ian Goodfellow,Jean Pouget-Abadie,Mehdi Mirza,Bing Xu,David Warde-Farley,Sherjil Ozair,Aaron Courville,Yoshua Bengio +7 more
TL;DR: A new framework for estimating generative models via an adversarial process, in which two models are simultaneously train: a generative model G that captures the data distribution and a discriminative model D that estimates the probability that a sample came from the training data rather than G.
Journal Article
Visualizing Data using t-SNE
TL;DR: A new technique called t-SNE that visualizes high-dimensional data by giving each datapoint a location in a two or three-dimensional map, a variation of Stochastic Neighbor Embedding that is much easier to optimize, and produces significantly better visualizations by reducing the tendency to crowd points together in the center of the map.
Proceedings Article
Auto-Encoding Variational Bayes
Diederik P. Kingma,Max Welling +1 more
TL;DR: A stochastic variational inference and learning algorithm that scales to large datasets and, under some mild differentiability conditions, even works in the intractable case is introduced.