A Generative Model of People in Clothing

doi:10.1109/ICCV.2017.98

Open AccessProceedings ArticleDOI

A Generative Model of People in Clothing

- pp 853-862

TLDR

The first image-based generative model of people in clothing for the full body is presented, which sidestep the commonly used complex graphics rendering pipeline and the need for high-quality 3D scans of dressed people and is learned from a large image database.

Abstract:

We present the first image-based generative model of people in clothing for the full body. We sidestep the commonly used complex graphics rendering pipeline and the need for high-quality 3D scans of dressed people. Instead, we learn generative models from a large image database. The main challenge is to cope with the high variance in human pose, shape and appearance. For this reason, pure image-based approaches have not been considered so far. We show that this challenge can be overcome by splitting the generating process in two parts. First, we learn to generate a semantic segmentation of the body and clothing. Second, we learn a conditional model on the resulting segments that creates realistic images. The full model is differentiable and can be conditioned on pose, shape or color. The result are samples of people in different clothing items and styles. The proposed model can generate entirely new people with realistic clothing. In several experiments we present encouraging results that suggest an entirely data-driven approach to people generation is possible.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Deep video portraits

Hyeongwoo Kim, +9 more

TL;DR: In this paper, a generative neural network with a novel space-time architecture is proposed to transfer the full 3D head position, head rotation, face expression, eye gaze, and eye blinking from a source actor to a portrait video of a target actor.

...read moreread less

Proceedings ArticleDOI

Everybody Dance Now

Caroline Chan, +3 more

TL;DR: This paper presents a simple method for “do as I do” motion transfer: given a source video of a person dancing, it is shown that it can transfer that performance to a novel (amateur) target after only a few minutes of the target subject performing standard moves.

...read moreread less

Proceedings Article

Pose Guided Person Image Generation

Liqian Ma, +5 more

TL;DR: Zhang et al. as discussed by the authors proposed a pose guided person generation network (PG$^2$) that allows to synthesize person images in arbitrary poses, based on an image of that person and a novel pose.

...read moreread less

Proceedings ArticleDOI

Neural Body Fitting: Unifying Deep Learning and Model Based Human Pose and Shape Estimation

Mohamed Omran, +4 more

TL;DR: Neural Body Fitting (NBF) as discussed by the authors integrates a statistical body model as a layer within a CNN leveraging both reliable bottom-up body part segmentation and robust top-down body model constraints.

...read moreread less

Proceedings ArticleDOI

Disentangled Person Image Generation

Liqian Ma, +5 more

TL;DR: A novel, two-stage reconstruction pipeline is proposed that learns a disentangled representation of the aforementioned image factors and generates novel person images at the same time and can manipulate the foreground, background and pose of the input image, and also sample new embedding features to generate targeted manipulations, that provide more control over the generation process.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings Article

Adam: A Method for Stochastic Optimization

Diederik P. Kingma, +1 more

TL;DR: This work introduces Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments, and provides a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework.

...read moreread less

Book ChapterDOI

U-Net: Convolutional Networks for Biomedical Image Segmentation

Olaf Ronneberger, +2 more

TL;DR: Neber et al. as discussed by the authors proposed a network and training strategy that relies on the strong use of data augmentation to use the available annotated samples more efficiently, which can be trained end-to-end from very few images and outperforms the prior best method (a sliding-window convolutional network) on the ISBI challenge for segmentation of neuronal structures in electron microscopic stacks.

...read moreread less

Journal ArticleDOI

Generative Adversarial Nets

Ian Goodfellow, +7 more

TL;DR: A new framework for estimating generative models via an adversarial process, in which two models are simultaneously train: a generative model G that captures the data distribution and a discriminative model D that estimates the probability that a sample came from the training data rather than G.

...read moreread less

Proceedings Article

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Sergey Ioffe, +1 more

TL;DR: Applied to a state-of-the-art image classification model, Batch Normalization achieves the same accuracy with 14 times fewer training steps, and beats the original model by a significant margin.

...read moreread less

Proceedings Article

Auto-Encoding Variational Bayes

Diederik P. Kingma, +1 more

TL;DR: A stochastic variational inference and learning algorithm that scales to large datasets and, under some mild differentiability conditions, even works in the intractable case is introduced.

...read moreread less

Collapse

A Generative Model of People in Clothing

Citations

Deep video portraits

Everybody Dance Now

Pose Guided Person Image Generation

Neural Body Fitting: Unifying Deep Learning and Model Based Human Pose and Shape Estimation

Disentangled Person Image Generation

References

Adam: A Method for Stochastic Optimization

U-Net: Convolutional Networks for Biomedical Image Segmentation

Generative Adversarial Nets

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Auto-Encoding Variational Bayes

Related Papers (5)

Image-to-Image Translation with Conditional Adversarial Networks

Generative Adversarial Nets

Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks

Perceptual Losses for Real-Time Style Transfer and Super-Resolution

Realtime Multi-person 2D Pose Estimation Using Part Affinity Fields