2D Human Pose Estimation: New Benchmark and State of the Art Analysis

doi:10.1109/CVPR.2014.471

Proceedings ArticleDOI

2D Human Pose Estimation: New Benchmark and State of the Art Analysis

Mykhaylo Andriluka, +3 more

- pp 3686-3693

Chats0

TLDR

A novel benchmark "MPII Human Pose" is introduced that makes a significant advance in terms of diversity and difficulty, a contribution that is required for future developments in human body models.

Abstract:

Human pose estimation has made significant progress during the last years. However current datasets are limited in their coverage of the overall pose estimation challenges. Still these serve as the common sources to evaluate, train and compare different models on. In this paper we introduce a novel benchmark "MPII Human Pose" that makes a significant advance in terms of diversity and difficulty, a contribution that we feel is required for future developments in human body models. This comprehensive dataset was collected using an established taxonomy of over 800 human activities [1]. The collected images cover a wider variety of human activities than previous datasets including various recreational, occupational and householding activities, and capture people from a wider range of viewpoints. We provide a rich set of labels including positions of body joints, full 3D torso and head orientation, occlusion labels for joints and body parts, and activity labels. For each image we provide adjacent video frames to facilitate the use of motion information. Given these rich annotations we perform a detailed analysis of leading human pose estimation approaches and gaining insights for the success and failures of these methods.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

Mask R-CNN

Kaiming He, +3 more

TL;DR: This work presents a conceptually simple, flexible, and general framework for object instance segmentation, which extends Faster R-CNN by adding a branch for predicting an object mask in parallel with the existing branch for bounding box recognition.

...read moreread less

Proceedings Article

Mask R-CNN

Kaiming He, +3 more

TL;DR: This work presents a conceptually simple, flexible, and general framework for object instance segmentation that outperforms all existing, single-model entries on every task, including the COCO 2016 challenge winners.

...read moreread less

Proceedings ArticleDOI

Realtime Multi-person 2D Pose Estimation Using Part Affinity Fields

Zhe Cao, +3 more

TL;DR: Part Affinity Fields (PAFs) as discussed by the authors uses a nonparametric representation to learn to associate body parts with individuals in the image and achieves state-of-the-art performance on the MPII Multi-Person benchmark.

...read moreread less

Book ChapterDOI

Stacked Hourglass Networks for Human Pose Estimation

Alejandro Newell, +2 more

TL;DR: This work introduces a novel convolutional network architecture for the task of human pose estimation that is described as a “stacked hourglass” network based on the successive steps of pooling and upsampling that are done to produce a final set of predictions.

...read moreread less

Posted Content

Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields

Zhe Cao, +3 more

- 24 Nov 2016 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: This work presents an approach to efficiently detect the 2D pose of multiple people in an image using a nonparametric representation, which it refers to as Part Affinity Fields (PAFs), to learn to associate body parts with individuals in the image.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

The Pascal Visual Object Classes (VOC) Challenge

Mark Everingham, +4 more

- 01 Jun 2010 -

International Journal of Computer Vision

TL;DR: The state-of-the-art in evaluated methods for both classification and detection are reviewed, whether the methods are statistically different, what they are learning from the images, and what the methods find easy or confuse.

...read moreread less

Journal ArticleDOI

2011 Compendium of Physical Activities: a second update of codes and MET values.

Barbara E. Ainsworth, +9 more

- 01 Aug 2011 -

Medicine and Science in Sports and Exerc...

TL;DR: The 2011 Compendium is an update of a system for quantifying the energy cost of adult human PA and is a living document that is moving in the direction of being 100% evidence based.

...read moreread less

Journal ArticleDOI

Pictorial Structures for Object Recognition

Pedro F. Felzenszwalb, +1 more

- 01 Jan 2005 -

International Journal of Computer Vision

TL;DR: A computationally efficient framework for part-based modeling and recognition of objects, motivated by the pictorial structure models introduced by Fischler and Elschlager, that allows for qualitative descriptions of visual appearance and is suitable for generic recognition problems.

...read moreread less

Journal ArticleDOI

Human3.6M: Large Scale Datasets and Predictive Methods for 3D Human Sensing in Natural Environments

Catalin Ionescu, +3 more

- 01 Jul 2014 -

IEEE Transactions on Pattern Analysis an...

TL;DR: A new dataset, Human3.6M, of 3.6 Million accurate 3D Human poses, acquired by recording the performance of 5 female and 6 male subjects, under 4 different viewpoints, is introduced for training realistic human sensing systems and for evaluating the next generation of human pose estimation models and algorithms.

...read moreread less

Journal ArticleDOI

HumanEva: Synchronized Video and Motion Capture Dataset and Baseline Algorithm for Evaluation of Articulated Human Motion

Leonid Sigal, +2 more

- 01 Mar 2010 -

International Journal of Computer Vision

TL;DR: A baseline algorithm for 3D articulated tracking that uses a relatively standard Bayesian framework with optimization in the form of Sequential Importance Resampling and Annealed Particle Filtering is described, and a variety of likelihood functions, prior models of human motion and the effects of algorithm parameters are explored.

...read moreread less

2D Human Pose Estimation: New Benchmark and State of the Art Analysis

Citations

Mask R-CNN

Mask R-CNN

Realtime Multi-person 2D Pose Estimation Using Part Affinity Fields

Stacked Hourglass Networks for Human Pose Estimation

Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields

References

The Pascal Visual Object Classes (VOC) Challenge

2011 Compendium of Physical Activities: a second update of codes and MET values.

Pictorial Structures for Object Recognition

Human3.6M: Large Scale Datasets and Predictive Methods for 3D Human Sensing in Natural Environments

HumanEva: Synchronized Video and Motion Capture Dataset and Baseline Algorithm for Evaluation of Articulated Human Motion

Related Papers (5)

Stacked Hourglass Networks for Human Pose Estimation

Deep Residual Learning for Image Recognition

Microsoft COCO: Common Objects in Context

Convolutional Pose Machines

Realtime Multi-person 2D Pose Estimation Using Part Affinity Fields