Bottom-Up Human Pose Estimation Via Disentangled Keypoint Regression

doi:10.1109/CVPR46437.2021.01444

Open AccessProceedings ArticleDOI

Bottom-Up Human Pose Estimation Via Disentangled Keypoint Regression

Zigang Geng, +4 more

- pp 14676-14686

Chats0

TLDR

In this paper, the authors proposed a disentangled keypoint regression (DEKR) method, which adopts adaptive convolutions through pixel-wise spatial transformer to activate the pixels in the keypoint regions and accordingly learn representations from them.

Abstract:

In this paper, we are interested in the bottom-up paradigm of estimating human poses from an image. We study the dense keypoint regression framework that is previously inferior to the keypoint detection and grouping framework. Our motivation is that regressing keypoint positions accurately needs to learn representations that focus on the keypoint regions.We present a simple yet effective approach, named disentangled keypoint regression (DEKR). We adopt adaptive convolutions through pixel-wise spatial transformer to activate the pixels in the keypoint regions and accordingly learn representations from them. We use a multi-branch structure for separate regression: each branch learns a representation with dedicated adaptive convolutions and regresses one keypoint. The resulting disentangled representations are able to attend to the keypoint regions, respectively, and thus the keypoint regression is spatially more accurate. We empirically show that the proposed direct regression method outperforms keypoint detection and grouping methods and achieves superior bottom-up pose estimation results on two benchmark datasets, COCO and CrowdPose. The code and models are available at https://github.com/HRNet/DEKR.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Revealing the Dark Secrets of Masked Image Modeling

Zhenda Xie, +5 more

- 26 May 2022 -

arXiv.org

TL;DR: This paper compares MIM with the long-dominant supervised pre-trained models from two perspectives, the visualizations and the experiments, to uncover their key representational differences and finds that MIM models can perform signiﬁcantly better on geometric and motion tasks with weak semantics or ﬁne-grained classi-cation tasks, than their supervised counterparts.

...read moreread less

Proceedings ArticleDOI

End-to-End Multi-Person Pose Estimation with Transformers

Dahu Shi, +4 more

TL;DR: The proposed PETR method views pose estimation as a hierarchical set prediction problem and effectively removes the need for many hand-crafted modules like RoI cropping, NMS and grouping post-processing, and largely overcomes the feature misalignment difficulty in pose estimation and improves the performance considerably.

...read moreread less

Proceedings ArticleDOI

Lite Pose: Efficient Architecture Design for 2D Human Pose Estimation

Yihan Wang, +4 more

TL;DR: LitePose is designed, an efficient single-branch architecture for pose estimation, and two simple approaches to enhance the capacity of LitePose are introduced, including fusion deconv head and large kernel conv.

...read moreread less

Proceedings ArticleDOI

Fast and Flexible Human Pose Estimation with HyperPose

Yixiao Guo, +4 more

- 26 Aug 2021 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: Hyperpose as mentioned in this paper provides expressive Python APIs that enable developers to easily customise pose estimation algorithms for their applications and further provides a model inference engine highly optimized for real-time pose estimation.

...read moreread less

Proceedings ArticleDOI

NIMBLE: A Non-rigid Hand Model with Bones and Muscles

Yuwei Li, +8 more

- 09 Feb 2022 -

ACM Transactions on Graphics

TL;DR: A novel parametric hand model that includes the missing key components, bringing 3D hand model to a new level of realism by enforcing the inner bones and muscles to match anatomic and kinematic rules, NIMBLE can animate 3D hands to new poses at unprecedented realism.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings Article

Adam: A Method for Stochastic Optimization

Diederik P. Kingma, +1 more

TL;DR: This work introduces Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments, and provides a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework.

...read moreread less

Book ChapterDOI

Microsoft COCO: Common Objects in Context

Tsung-Yi Lin, +7 more

TL;DR: A new dataset with the goal of advancing the state-of-the-art in object recognition by placing the question of object recognition in the context of the broader question of scene understanding by gathering images of complex everyday scenes containing common objects in their natural context.

...read moreread less

Proceedings ArticleDOI

Mask R-CNN

Kaiming He, +3 more

TL;DR: This work presents a conceptually simple, flexible, and general framework for object instance segmentation, which extends Faster R-CNN by adding a branch for predicting an object mask in parallel with the existing branch for bounding box recognition.

...read moreread less

Journal ArticleDOI

Representation Learning: A Review and New Perspectives

Yoshua Bengio, +2 more

- 01 Aug 2013 -

IEEE Transactions on Pattern Analysis an...

TL;DR: Recent work in the area of unsupervised feature learning and deep learning is reviewed, covering advances in probabilistic models, autoencoders, manifold learning, and deep networks.

...read moreread less

Proceedings Article

Spatial transformer networks

Max Jaderberg, +3 more

TL;DR: This work introduces a new learnable module, the Spatial Transformer, which explicitly allows the spatial manipulation of data within the network, and can be inserted into existing convolutional architectures, giving neural networks the ability to actively spatially transform feature maps.

...read moreread less