Modeling sense disambiguation of human pose: recognizing action at a distance by key poses

doi:10.1007/978-3-642-19315-6_19

Book ChapterDOI

Modeling sense disambiguation of human pose: recognizing action at a distance by key poses

- pp 244-255

TLDR

A methodology for recognizing actions at a distance by watching the human poses and deriving descriptors that capture the motion patterns of the poses and shows the efficacy of this approach when compared to the present state of the art.

Abstract:

We propose a methodology for recognizing actions at a distance by watching the human poses and deriving descriptors that capture the motion patterns of the poses. Human poses often carry a strong visual sense (intended meaning) which describes the related action unambiguously. But identifying the intended meaning of poses is a challenging task because of their variability and such variations in poses lead to visual sense ambiguity. From a large vocabulary of poses (visual words) we prune out ambiguous poses and extract key poses (or key words) using centrality measure of graph connectivity [1]. Under this framework, finding the key poses for a given sense (i.e., action type) amounts to constructing a graph with poses as vertices and then identifying the most "important" vertices in the graph (following centrality theory). The results on four standard activity recognition datasets show the efficacy of our approach when compared to the present state of the art.

Citations

PDF

Open Access

More filters

Pattern Recognition and Machine Learning

Christopher M. Bishop

TL;DR: Probability distributions of linear models for regression and classification are given in this article, along with a discussion of combining models and combining models in the context of machine learning and classification.

...read moreread less

Journal ArticleDOI

Selecting Key Poses on Manifold for Pairwise Action Recognition

Xianbin Cao, +3 more

- 01 Feb 2012 -

IEEE Transactions on Industrial Informat...

TL;DR: A novel approach for key poses selection is proposed, which models the descriptor space utilizing a manifold learning technique to recover the geometric structure of the descriptors on a lower dimensional manifold and develops a PageRank-based centrality measure.

...read moreread less

Journal ArticleDOI

Recognizing Human Action at a Distance in Video by Key Poses

Snehasis Mukherjee, +2 more

- 05 Apr 2011 -

IEEE Transactions on Circuits and System...

TL;DR: A graph theoretic technique for recognizing human actions at a distance in a video by modeling the visual senses associated with poses and introduces a “meaningful” threshold on centrality measure that selects key poses for each action type.

...read moreread less

Proceedings ArticleDOI

Recognizing interaction between human performers using 'key pose doublet'

Snehasis Mukherjee, +2 more

TL;DR: A graph theoretic approach for recognizing interactions between two human performers present in a video clip and applies the same centrality measure on all possible combinations of the key poses of the two performers to select the set of 'key pose doublets' that best represent the corresponding action.

...read moreread less

Journal ArticleDOI

Region-based Mixture Models for human action recognition in low-resolution videos

Ying Zhao, +5 more

- 19 Jul 2017 -

Neurocomputing

TL;DR: The Layered Elastic Motion Tracking (LEMT) method is adopted, a hybrid feature representation is presented to integrate both of the shape and motion features, and a Region-based Mixture Model (RMM) is proposed to be utilized for action classification.

...read moreread less

References

PDF

Open Access

More filters

Proceedings ArticleDOI

Unsupervised Learning of Human Action Categories Using Spatial-Temporal Words.

Juan Carlos Niebles, +2 more

TL;DR: The approach is not only able to classify different actions, but also to localize different actions simultaneously in a novel and complex video sequence.

...read moreread less

Proceedings ArticleDOI

Recognizing realistic actions from videos .

Jingen Liu, +2 more

TL;DR: This paper presents a systematic framework for recognizing realistic actions from videos “in the wild”, and uses motion statistics to acquire stable motion features and clean static features, and PageRank is used to mine the most informative static features.

...read moreread less

Journal ArticleDOI

Visual Word Ambiguity

Jan C. van Gemert, +3 more

- 01 Jul 2010 -

IEEE Transactions on Pattern Analysis an...

TL;DR: It is demonstrated that explicitly modeling visual word assignment ambiguity improves classification performance compared to the hard assignment of the traditional codebook model, and the proposed model performs consistently.

...read moreread less

Proceedings ArticleDOI

A Hierarchical Model of Shape and Appearance for Human Action Classification

Juan Carlos Niebles, +1 more

TL;DR: A hierarchical model that can be characterized as a constellation of bags-of-features and that is able to combine both spatial and spatial-temporal features is proposed and shown to improve the classification performance over bag of feature models.

...read moreread less

Proceedings ArticleDOI

Single View Human Action Recognition using Key Pose Matching and Viterbi Path Searching

Fengjun Lv, +1 more

TL;DR: Each action is modeled as a series of synthetic 2D human poses rendered from a wide range of viewpoints and the constraints on transition of the synthetic poses is represented by a graph model called Action Net.

...read moreread less

Collapse

Modeling sense disambiguation of human pose: recognizing action at a distance by key poses

Citations

Pattern Recognition and Machine Learning

Selecting Key Poses on Manifold for Pairwise Action Recognition

Recognizing Human Action at a Distance in Video by Key Poses

Recognizing interaction between human performers using 'key pose doublet'

Region-based Mixture Models for human action recognition in low-resolution videos

References

Unsupervised Learning of Human Action Categories Using Spatial-Temporal Words.

Recognizing realistic actions from videos .

Visual Word Ambiguity

A Hierarchical Model of Shape and Appearance for Human Action Classification

Single View Human Action Recognition using Key Pose Matching and Viterbi Path Searching

Related Papers (5)

Recognizing Human Actions by Their Pose

Recognizing Human Actions Using Key Poses

Unsupervised Learning of Human Action Categories Using Spatial-Temporal Words

Recognizing human actions from still images with latent poses

Pose-Based Two-Stream Relational Networks for Action Recognition in Videos.