Semantic Image Inpainting with Deep Generative Models
Raymond A. Yeh,Chen Chen,Teck Yian Lim,Alexander G. Schwing,Alexander G. Schwing,Mark Hasegawa-Johnson,Minh N. Do +6 more
- pp 6882-6890
Reads0
Chats0
TLDR
A novel method for semantic image inpainting, which generates the missing content by conditioning on the available data, and successfully predicts information in large missing regions and achieves pixel-level photorealism, significantly outperforming the state-of-the-art methods.Abstract:
Semantic image inpainting is a challenging task where large missing regions have to be filled based on the available visual data. Existing methods which extract information from only a single image generally produce unsatisfactory results due to the lack of high level context. In this paper, we propose a novel method for semantic image inpainting, which generates the missing content by conditioning on the available data. Given a trained generative model, we search for the closest encoding of the corrupted image in the latent image manifold using our context and prior losses. This encoding is then passed through the generative model to infer the missing content. In our method, inference is possible irrespective of how the missing content is structured, while the state-of-the-art learning based method requires specific information about the holes in the training phase. Experiments on three datasets show that our method successfully predicts information in large missing regions and achieves pixel-level photorealism, significantly outperforming the state-of-the-art methods.read more
Citations
More filters
Posted Content
Swapping Autoencoder for Deep Image Manipulation
TL;DR: The Swapping Autoencoder is proposed, a deep model designed specifically for image manipulation, rather than random sampling, that can be used to manipulate real input images in various ways, including texture swapping, local and global editing, and latent code vector arithmetic.
Proceedings ArticleDOI
UCTGAN: Diverse Image Inpainting Based on Unsupervised Cross-Space Translation
TL;DR: Unsupervised Cross-space Translation Generative Adversarial Network (called UCTGAN) is presented which mainly consists of three network modules: conditional encoder module, manifold projection module and generation module which are combined to learn one-to-one image mapping between two spaces in an unsupervised way.
Proceedings ArticleDOI
Extreme 3D Face Reconstruction: Seeing Through Occlusions
TL;DR: In this article, a deep convolutional encoder-decoder is used to estimate a coarse 3D face shape which acts as a foundation and then separately layer this foundation with details represented by a bump map.
Book ChapterDOI
Deep Video Generation, Prediction and Completion of Human Action Sequences
TL;DR: In this paper, a two-stage framework is proposed to generate human action videos with no constraints or arbitrary number of constraints, which uniformly addresses the three problems: video generation given no input frames, video prediction given the first few frames, and video completion given the last and last frames.
Journal ArticleDOI
High-fidelity facial reflectance and geometry inference from an unconstrained image
Shugo Yamaguchi,Shunsuke Saito,Koki Nagano,Yajie Zhao,Weikai Chen,Kyle Olszewski,Shigeo Morishima,Hao Li +7 more
TL;DR: A deep learning-based technique to infer high-quality facial reflectance and geometry given a single unconstrained image of the subject, which may contain partial occlusions and arbitrary illumination conditions, and demonstrates the rendering of high-fidelity 3D avatars from a variety of subjects captured under different lighting conditions.
References
More filters
Proceedings Article
Adam: A Method for Stochastic Optimization
Diederik P. Kingma,Jimmy Ba +1 more
TL;DR: This work introduces Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments, and provides a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework.
Journal ArticleDOI
Image quality assessment: from error visibility to structural similarity
TL;DR: In this article, a structural similarity index is proposed for image quality assessment based on the degradation of structural information, which can be applied to both subjective ratings and objective methods on a database of images compressed with JPEG and JPEG2000.
Journal ArticleDOI
Generative Adversarial Nets
Ian Goodfellow,Jean Pouget-Abadie,Mehdi Mirza,Bing Xu,David Warde-Farley,Sherjil Ozair,Aaron Courville,Yoshua Bengio +7 more
TL;DR: A new framework for estimating generative models via an adversarial process, in which two models are simultaneously train: a generative model G that captures the data distribution and a discriminative model D that estimates the probability that a sample came from the training data rather than G.
Journal Article
Visualizing Data using t-SNE
TL;DR: A new technique called t-SNE that visualizes high-dimensional data by giving each datapoint a location in a two or three-dimensional map, a variation of Stochastic Neighbor Embedding that is much easier to optimize, and produces significantly better visualizations by reducing the tendency to crowd points together in the center of the map.
Proceedings Article
Auto-Encoding Variational Bayes
Diederik P. Kingma,Max Welling +1 more
TL;DR: A stochastic variational inference and learning algorithm that scales to large datasets and, under some mild differentiability conditions, even works in the intractable case is introduced.