scispace - formally typeset
Book ChapterDOI

Modeling sense disambiguation of human pose: recognizing action at a distance by key poses

TLDR
A methodology for recognizing actions at a distance by watching the human poses and deriving descriptors that capture the motion patterns of the poses and shows the efficacy of this approach when compared to the present state of the art.
Abstract
We propose a methodology for recognizing actions at a distance by watching the human poses and deriving descriptors that capture the motion patterns of the poses. Human poses often carry a strong visual sense (intended meaning) which describes the related action unambiguously. But identifying the intended meaning of poses is a challenging task because of their variability and such variations in poses lead to visual sense ambiguity. From a large vocabulary of poses (visual words) we prune out ambiguous poses and extract key poses (or key words) using centrality measure of graph connectivity [1]. Under this framework, finding the key poses for a given sense (i.e., action type) amounts to constructing a graph with poses as vertices and then identifying the most "important" vertices in the graph (following centrality theory). The results on four standard activity recognition datasets show the efficacy of our approach when compared to the present state of the art.

read more

Citations
More filters

Pattern Recognition and Machine Learning

TL;DR: Probability distributions of linear models for regression and classification are given in this article, along with a discussion of combining models and combining models in the context of machine learning and classification.
Journal ArticleDOI

Selecting Key Poses on Manifold for Pairwise Action Recognition

TL;DR: A novel approach for key poses selection is proposed, which models the descriptor space utilizing a manifold learning technique to recover the geometric structure of the descriptors on a lower dimensional manifold and develops a PageRank-based centrality measure.
Journal ArticleDOI

Recognizing Human Action at a Distance in Video by Key Poses

TL;DR: A graph theoretic technique for recognizing human actions at a distance in a video by modeling the visual senses associated with poses and introduces a “meaningful” threshold on centrality measure that selects key poses for each action type.
Proceedings ArticleDOI

Recognizing interaction between human performers using 'key pose doublet'

TL;DR: A graph theoretic approach for recognizing interactions between two human performers present in a video clip and applies the same centrality measure on all possible combinations of the key poses of the two performers to select the set of 'key pose doublets' that best represent the corresponding action.
Journal ArticleDOI

Region-based Mixture Models for human action recognition in low-resolution videos

TL;DR: The Layered Elastic Motion Tracking (LEMT) method is adopted, a hybrid feature representation is presented to integrate both of the shape and motion features, and a Region-based Mixture Model (RMM) is proposed to be utilized for action classification.
References
More filters
Proceedings ArticleDOI

Unsupervised Learning of Human Action Categories Using Spatial-Temporal Words.

TL;DR: The approach is not only able to classify different actions, but also to localize different actions simultaneously in a novel and complex video sequence.
Proceedings ArticleDOI

Recognizing realistic actions from videos .

TL;DR: This paper presents a systematic framework for recognizing realistic actions from videos “in the wild”, and uses motion statistics to acquire stable motion features and clean static features, and PageRank is used to mine the most informative static features.
Journal ArticleDOI

Visual Word Ambiguity

TL;DR: It is demonstrated that explicitly modeling visual word assignment ambiguity improves classification performance compared to the hard assignment of the traditional codebook model, and the proposed model performs consistently.
Proceedings ArticleDOI

A Hierarchical Model of Shape and Appearance for Human Action Classification

TL;DR: A hierarchical model that can be characterized as a constellation of bags-of-features and that is able to combine both spatial and spatial-temporal features is proposed and shown to improve the classification performance over bag of feature models.
Proceedings ArticleDOI

Single View Human Action Recognition using Key Pose Matching and Viterbi Path Searching

TL;DR: Each action is modeled as a series of synthetic 2D human poses rendered from a wide range of viewpoints and the constraints on transition of the synthetic poses is represented by a graph model called Action Net.
Related Papers (5)