Efficient Human Pose Estimation from Single Depth Images

doi:10.1109/TPAMI.2012.241

Journal ArticleDOI

Efficient Human Pose Estimation from Single Depth Images

Jamie Shotton, +10 more

- 01 Dec 2013 -

IEEE Transactions on Pattern Analysis an...

- Vol. 35, Iss: 12, pp 2821-2840

TLDR

Two new approaches to human pose estimation are described, both of which can quickly and accurately predict the 3D positions of body joints from a single depth image without using any temporal information.

Abstract:

We describe two new approaches to human pose estimation. Both can quickly and accurately predict the 3D positions of body joints from a single depth image without using any temporal information. The key to both approaches is the use of a large, realistic, and highly varied synthetic set of training images. This allows us to learn models that are largely invariant to factors such as pose, body shape, field-of-view cropping, and clothing. Our first approach employs an intermediate body parts representation, designed so that an accurate per-pixel classification of the parts will localize the joints of the body. The second approach instead directly regresses the positions of body joints. By using simple depth pixel comparison features and parallelizable decision forests, both approaches can run super-real time on consumer hardware. Our evaluation investigates many aspects of our methods, and compares the approaches to each other and to the state of the art. Results on silhouettes suggest broader applicability to other imaging modalities.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

Going deeper with convolutions

Christian Szegedy, +8 more

TL;DR: Inception as mentioned in this paper is a deep convolutional neural network architecture that achieves the new state of the art for classification and detection in the ImageNet Large-Scale Visual Recognition Challenge 2014 (ILSVRC14).

...read moreread less

Proceedings ArticleDOI

Learning from Simulated and Unsupervised Images through Adversarial Training

Ashish Shrivastava, +5 more

TL;DR: SimGAN as mentioned in this paper uses an adversarial network similar to Generative Adversarial Networks (GANs), but with synthetic images as inputs instead of random vectors, and achieves state-of-the-art results on the MPIIGaze dataset without any labeled real data.

...read moreread less

Proceedings ArticleDOI

SUN RGB-D: A RGB-D scene understanding benchmark suite

Shuran Song, +2 more

TL;DR: This paper introduces an RGB-D benchmark suite for the goal of advancing the state-of-the-arts in all major scene understanding tasks, and presents a dataset that enables the train data-hungry algorithms for scene-understanding tasks, evaluate them using meaningful 3D metrics, avoid overfitting to a small testing set, and study cross-sensor bias.

...read moreread less

Book ChapterDOI

Playing for Data: Ground Truth from Computer Games

Stephan R. Richter, +3 more

TL;DR: In this paper, the authors present an approach to rapidly create pixel-accurate semantic label maps for images extracted from modern computer games, which enables rapid propagation of semantic labels within and across images synthesized by the game, without access to the source code or the content.

...read moreread less

Journal ArticleDOI

Learning Depth from Single Monocular Images Using Deep Convolutional Neural Fields

Fayao Liu, +3 more

- 01 Oct 2016 -

IEEE Transactions on Pattern Analysis an...

TL;DR: A deep convolutional neural field model for estimating depths from single monocular images, aiming to jointly explore the capacity of deep CNN and continuous CRF is presented, and a deep structured learning scheme which learns the unary and pairwise potentials of continuousCRF in a unified deep CNN framework is proposed.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Random Forests

Leo Breiman

TL;DR: Internal estimates monitor error, strength, and correlation and these are used to show the response to increasing the number of features used in the forest, and are also applicable to regression.

...read moreread less

Book

The Elements of Statistical Learning: Data Mining, Inference, and Prediction

Trevor Hastie, +2 more

TL;DR: In this paper, the authors describe the important ideas in these areas in a common conceptual framework, and the emphasis is on concepts rather than mathematics, with a liberal use of color graphics.

...read moreread less

Journal ArticleDOI

Induction of Decision Trees

J. R. Quinlan

- 25 Mar 1986 -

Machine Learning

TL;DR: In this paper, an approach to synthesizing decision trees that has been used in a variety of systems, and it describes one such system, ID3, in detail, is described, and a reported shortcoming of the basic algorithm is discussed.

...read moreread less

Journal ArticleDOI

Mean shift: a robust approach toward feature space analysis

Dorin Comaniciu, +1 more

- 01 May 2002 -

IEEE Transactions on Pattern Analysis an...

TL;DR: It is proved the convergence of a recursive mean shift procedure to the nearest stationary point of the underlying density function and, thus, its utility in detecting the modes of the density.

...read moreread less

Journal ArticleDOI

The Elements of Statistical Learning: Data Mining, Inference, and Prediction

David Ruppert

- 01 Jun 2004 -

Journal of the American Statistical Asso...

TL;DR: The Elements of Statistical Learning: Data Mining, Inference, and Prediction as discussed by the authors is a popular book for data mining and machine learning, focusing on data mining, inference, and prediction.

...read moreread less

Collapse

Efficient Human Pose Estimation from Single Depth Images

Citations

Going deeper with convolutions

Learning from Simulated and Unsupervised Images through Adversarial Training

SUN RGB-D: A RGB-D scene understanding benchmark suite

Playing for Data: Ground Truth from Computer Games

Learning Depth from Single Monocular Images Using Deep Convolutional Neural Fields

References

Random Forests

The Elements of Statistical Learning: Data Mining, Inference, and Prediction

Induction of Decision Trees

Mean shift: a robust approach toward feature space analysis

The Elements of Statistical Learning: Data Mining, Inference, and Prediction

Related Papers (5)

Real-time human pose recognition in parts from single depth images

Random Forests

DeepPose: Human Pose Estimation via Deep Neural Networks

ImageNet Classification with Deep Convolutional Neural Networks

Deep Residual Learning for Image Recognition