PoseTrack: Joint Multi-person Pose Estimation and Tracking

doi:10.1109/CVPR.2017.495

Open AccessProceedings ArticleDOI

PoseTrack: Joint Multi-person Pose Estimation and Tracking

- pp 4654-4663

TLDR

This work proposes a novel method that jointly models multi-person pose estimation and tracking in a single formulation and introduces a challenging Multi-Person PoseTrack dataset, and proposes a completely unconstrained evaluation protocol that does not make any assumptions about the scale, size, location or the number of persons.

Abstract:

In this work, we introduce the challenging problem of joint multi-person pose estimation and tracking of an unknown number of persons in unconstrained videos. Existing methods for multi-person pose estimation in images cannot be applied directly to this problem, since it also requires to solve the problem of person association over time in addition to the pose estimation for each person. We therefore propose a novel method that jointly models multi-person pose estimation and tracking in a single formulation. To this end, we represent body joint detections in a video by a spatio-temporal graph and solve an integer linear program to partition the graph into sub-graphs that correspond to plausible body pose trajectories for each person. The proposed approach implicitly handles occlusion and truncation of persons. Since the problem has not been addressed quantitatively in the literature, we introduce a challenging Multi-Person PoseTrack dataset, and also propose a completely unconstrained evaluation protocol that does not make any assumptions about the scale, size, location or the number of persons. Finally, we evaluate the proposed approach and several baseline methods on our new dataset.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

Deep High-Resolution Representation Learning for Human Pose Estimation

Ke Sun, +3 more

TL;DR: This paper proposes a network that maintains high-resolution representations through the whole process of human pose estimation and empirically demonstrates the effectiveness of the network through the superior pose estimation results over two benchmark datasets: the COCO keypoint detection dataset and the MPII Human Pose dataset.

...read moreread less

Journal ArticleDOI

DeepPoseKit, a software toolkit for fast and robust animal pose estimation using deep learning

Jacob M. Graving, +10 more

- 01 Oct 2019 -

eLife

TL;DR: A new easy-to-use software toolkit, DeepPoseKit, is introduced that addresses animal pose estimation problems using an efficient multi-scale deep-learning model, called Stacked DenseNet, and a fast GPU-based peak-detection algorithm for estimating keypoint locations with subpixel precision.

...read moreread less

Journal ArticleDOI

Survey on Emotional Body Gesture Recognition

Fatemeh Noroozi, +5 more

- 01 Apr 2021 -

IEEE Transactions on Affective Computing

TL;DR: In this paper, the authors present a comprehensive survey of body gesture recognition methods and discuss multi-modal approaches that combine speech or face with body gestures for improved emotion recognition, and define a complete framework for automatic emotional body gestures recognition.

...read moreread less

Journal ArticleDOI

Monocular human pose estimation: A survey of deep learning-based methods

Yucheng Chen, +2 more

- 01 Mar 2020 -

Computer Vision and Image Understanding

TL;DR: This survey extensively reviews the recent deep learning-based 2D and 3D human pose estimation methods published since 2014 and summarizes the challenges, main frameworks, benchmark datasets, evaluation metrics, performance comparison, and discusses some promising future research directions.

...read moreread less

Proceedings ArticleDOI

ArtTrack: Articulated Multi-Person Tracking in the Wild

Eldar Insafutdinov, +6 more

TL;DR: In this article, the authors propose an approach for articulated tracking of multiple people in unconstrained videos, which is based on a model that resembles existing architectures for single-frame pose estimation but is substantially faster.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Book ChapterDOI

Microsoft COCO: Common Objects in Context

Tsung-Yi Lin, +7 more

TL;DR: A new dataset with the goal of advancing the state-of-the-art in object recognition by placing the question of object recognition in the context of the broader question of scene understanding by gathering images of complex everyday scenes containing common objects in their natural context.

...read moreread less

Posted Content

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

Shaoqing Ren, +3 more

- 04 Jun 2015 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: Faster R-CNN as discussed by the authors proposes a Region Proposal Network (RPN) to generate high-quality region proposals, which are used by Fast R-NN for detection.

...read moreread less

Proceedings Article

Faster R-CNN: towards real-time object detection with region proposal networks

Shaoqing Ren, +3 more

TL;DR: Ren et al. as discussed by the authors proposed a region proposal network (RPN) that shares full-image convolutional features with the detection network, thus enabling nearly cost-free region proposals.

...read moreread less

Book ChapterDOI

Stacked Hourglass Networks for Human Pose Estimation

Alejandro Newell, +2 more

TL;DR: This work introduces a novel convolutional network architecture for the task of human pose estimation that is described as a “stacked hourglass” network based on the successive steps of pooling and upsampling that are done to produce a final set of predictions.

...read moreread less

Proceedings ArticleDOI

Convolutional Pose Machines

Shih-En Wei, +3 more

TL;DR: In this paper, a convolutional network is incorporated into the pose machine framework for learning image features and image-dependent spatial models for the task of pose estimation, which can implicitly model long-range dependencies between variables in structured prediction tasks such as articulated pose estimation.

...read moreread less

Collapse

PoseTrack: Joint Multi-person Pose Estimation and Tracking

Citations

Deep High-Resolution Representation Learning for Human Pose Estimation

DeepPoseKit, a software toolkit for fast and robust animal pose estimation using deep learning

Survey on Emotional Body Gesture Recognition

Monocular human pose estimation: A survey of deep learning-based methods

ArtTrack: Articulated Multi-Person Tracking in the Wild

References

Microsoft COCO: Common Objects in Context

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

Faster R-CNN: towards real-time object detection with region proposal networks

Stacked Hourglass Networks for Human Pose Estimation

Convolutional Pose Machines

Related Papers (5)

Realtime Multi-person 2D Pose Estimation Using Part Affinity Fields

Microsoft COCO: Common Objects in Context

Simple Baselines for Human Pose Estimation and Tracking

Stacked Hourglass Networks for Human Pose Estimation

2D Human Pose Estimation: New Benchmark and State of the Art Analysis