Skeleton-Aided Articulated Motion Generation

doi:10.1145/3123266.3123277

Open AccessProceedings ArticleDOI

Skeleton-Aided Articulated Motion Generation

Yichao Yan, +4 more

- pp 199-207

Chats0

TLDR

This work makes the first attempt to generate articulated human motion sequence from a single image by utilizing paired inputs including human skeleton information as motion embedding and a single human image as appearance reference to generate novel motion frames based on the conditional GAN infrastructure.

Abstract:

This work makes the first attempt to generate articulated human motion sequence from a single image. On one hand, we utilize paired inputs including human skeleton information as motion embedding and a single human image as appearance reference, to generate novel motion frames based on the conditional GAN infrastructure. On the other hand, a triplet loss is employed to pursue appearance smoothness between consecutive frames. As the proposed framework is capable of jointly exploiting the image appearance space and articulated/kinematic motion space, it generates realistic articulated motion sequence, in contrast to most previous video generation methods which yield blurred motion effects. We test our model on two human action datasets including KTH and Human3.6M, and the proposed framework generates very promising results on both datasets.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

Pose Transferrable Person Re-identification

Jinxian Liu, +5 more

TL;DR: A pose-transferrable person ReID framework which utilizes posetransferred sample augmentations (i.e., with ID supervision) to enhance ReID model training, and achieves great performance improvement, and outperforms most state-of-the-art methods without elaborate designing the ReIDs.

...read moreread less

Proceedings ArticleDOI

Variational Convolutional Neural Network Pruning

Chenglong Zhao, +5 more

TL;DR: Variational technique is introduced to estimate distribution of a newly proposed parameter, called channel saliency, based on which redundant channels can be removed from model via a simple criterion, and results in significant size reduction and computation saving.

...read moreread less

Proceedings ArticleDOI

Towards Multi-Pose Guided Virtual Try-On Network

Haoye Dong, +7 more

TL;DR: Li et al. as mentioned in this paper proposed a multi-pose guided virtual try-on system, which enables clothes to transfer onto a person with diverse poses by using a conditional human parsing network to match both the desired pose and the desired clothes shape.

...read moreread less

Book ChapterDOI

Deep Video Generation, Prediction and Completion of Human Action Sequences

Haoye Cai, +3 more

TL;DR: In this paper, a two-stage framework is proposed to generate human action videos with no constraints or arbitrary number of constraints, which uniformly addresses the three problems: video generation given no input frames, video prediction given the first few frames, and video completion given the last and last frames.

...read moreread less

Proceedings ArticleDOI

Deep Kinematics Analysis for Monocular 3D Human Pose Estimation

Jingwei Xu, +5 more

TL;DR: It is shown that optimizing the kinematics structure of noisy 2D inputs is critical to obtain accurate 3D estimations and targeted ablation study shows that each former step is critical for the latter one to obtain promising results.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Long short-term memory

Sepp Hochreiter, +1 more

- 01 Nov 1997 -

Neural Computation

TL;DR: A novel, efficient, gradient based method called long short-term memory (LSTM) is introduced, which can learn to bridge minimal time lags in excess of 1000 discrete-time steps by enforcing constant error flow through constant error carousels within special units.

...read moreread less

Book ChapterDOI

U-Net: Convolutional Networks for Biomedical Image Segmentation

Olaf Ronneberger, +2 more

TL;DR: Neber et al. as discussed by the authors proposed a network and training strategy that relies on the strong use of data augmentation to use the available annotated samples more efficiently, which can be trained end-to-end from very few images and outperforms the prior best method (a sliding-window convolutional network) on the ISBI challenge for segmentation of neuronal structures in electron microscopic stacks.

...read moreread less

Journal ArticleDOI

Generative Adversarial Nets

Ian Goodfellow, +7 more

TL;DR: A new framework for estimating generative models via an adversarial process, in which two models are simultaneously train: a generative model G that captures the data distribution and a discriminative model D that estimates the probability that a sample came from the training data rather than G.

...read moreread less

Posted Content

Image-to-Image Translation with Conditional Adversarial Networks

Phillip Isola, +3 more

- 21 Nov 2016 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: Conditional Adversarial Network (CA) as discussed by the authors is a general-purpose solution to image-to-image translation problems, which can be used to synthesize photos from label maps, reconstructing objects from edge maps, and colorizing images, among other tasks.

...read moreread less

Posted Content

TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems

Martín Abadi, +39 more

- 01 Jan 2015 -

arXiv: Distributed, Parallel, and Cluste...

TL;DR: The TensorFlow interface and an implementation of that interface that is built at Google are described, which has been used for conducting research and for deploying machine learning systems into production across more than a dozen areas of computer science and other fields.

...read moreread less

Collapse

Skeleton-Aided Articulated Motion Generation

Citations

Pose Transferrable Person Re-identification

Variational Convolutional Neural Network Pruning

Towards Multi-Pose Guided Virtual Try-On Network

Deep Video Generation, Prediction and Completion of Human Action Sequences

Deep Kinematics Analysis for Monocular 3D Human Pose Estimation

References

Long short-term memory

U-Net: Convolutional Networks for Biomedical Image Segmentation

Generative Adversarial Nets

Image-to-Image Translation with Conditional Adversarial Networks

TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems

Related Papers (5)

Generative Adversarial Nets

Image-to-Image Translation with Conditional Adversarial Networks

Conditional Generative Adversarial Nets

Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks

U-Net: Convolutional Networks for Biomedical Image Segmentation