View invariant human action recognition using histograms of 3D joints

doi:10.1109/CVPRW.2012.6239233

Proceedings ArticleDOI

View invariant human action recognition using histograms of 3D joints

Lu Xia, +2 more

- pp 20-27

Chats0

TLDR

This paper presents a novel approach for human action recognition with histograms of 3D joint locations (HOJ3D) as a compact representation of postures and achieves superior results on the challenging 3D action dataset.

Abstract:

In this paper, we present a novel approach for human action recognition with histograms of 3D joint locations (HOJ3D) as a compact representation of postures. We extract the 3D skeletal joint locations from Kinect depth maps using Shotton et al.'s method [6]. The HOJ3D computed from the action depth sequences are reprojected using LDA and then clustered into k posture visual words, which represent the prototypical poses of actions. The temporal evolutions of those visual words are modeled by discrete hidden Markov models (HMMs). In addition, due to the design of our spherical coordinate system and the robust 3D skeleton estimation from Kinect, our method demonstrates significant view invariance on our 3D action dataset. Our dataset is composed of 200 3D sequences of 10 indoor activities performed by 10 individuals in varied views. Our method is real-time and achieves superior results on the challenging 3D action dataset. We also tested our algorithm on the MSR Action 3D dataset and our algorithm outperforms Li et al. [25] on most of the cases.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

Hierarchical recurrent neural network for skeleton based action recognition

Yong Du, +2 more

TL;DR: This paper proposes an end-to-end hierarchical RNN for skeleton based action recognition, and demonstrates that the model achieves the state-of-the-art performance with high computational efficiency.

...read moreread less

Journal ArticleDOI

Enhanced Computer Vision With Microsoft Kinect Sensor: A Review

Jungong Han, +3 more

- 25 Jun 2013 -

IEEE Transactions on Systems, Man, and C...

TL;DR: A comprehensive review of recent Kinect-based computer vision algorithms and applications covering topics including preprocessing, object tracking and recognition, human activity analysis, hand gesture analysis, and indoor 3-D mapping.

...read moreread less

Proceedings ArticleDOI

Human Action Recognition by Representing 3D Skeletons as Points in a Lie Group

Raviteja Vemulapalli, +2 more

TL;DR: A new skeletal representation that explicitly models the 3D geometric relationships between various body parts using rotations and translations in 3D space is proposed and outperforms various state-of-the-art skeleton-based human action recognition approaches.

...read moreread less

Book ChapterDOI

Spatio-Temporal LSTM with Trust Gates for 3D Human Action Recognition

Jun Liu, +3 more

TL;DR: This paper introduces new gating mechanism within LSTM to learn the reliability of the sequential input data and accordingly adjust its effect on updating the long-term context information stored in the memory cell, and proposes a more powerful tree-structure based traversal method.

...read moreread less

Journal ArticleDOI

NTU RGB+D 120: A Large-Scale Benchmark for 3D Human Activity Understanding

Jun Liu, +5 more

- 01 Oct 2020 -

IEEE Transactions on Pattern Analysis an...

TL;DR: This work introduces a large-scale dataset for RGB+D human action recognition, which is collected from 106 distinct subjects and contains more than 114 thousand video samples and 8 million frames, and investigates a novel one-shot 3D activity recognition problem on this dataset.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

A tutorial on hidden Markov models and selected applications in speech recognition

Lawrence R. Rabiner

TL;DR: In this paper, the authors provide an overview of the basic theory of hidden Markov models (HMMs) as originated by L.E. Baum and T. Petrie (1966) and give practical details on methods of implementation of the theory along with a description of selected applications of HMMs to distinct problems in speech recognition.

...read moreread less

Proceedings ArticleDOI

Real-time human pose recognition in parts from single depth images

Jamie Shotton, +7 more

TL;DR: This work takes an object recognition approach, designing an intermediate body parts representation that maps the difficult pose estimation problem into a simpler per-pixel classification problem, and generates confidence-scored 3D proposals of several body joints by reprojecting the classification result and finding local modes.

...read moreread less

Proceedings ArticleDOI

Recognizing human actions: a local SVM approach

Christian Schüldt, +2 more

TL;DR: This paper construct video representations in terms of local space-time features and integrate such representations with SVM classification schemes for recognition and presents the presented results of action recognition.

...read moreread less

Journal ArticleDOI

The recognition of human movement using temporal templates

Aaron F. Bobick, +1 more

- 01 Mar 2001 -

IEEE Transactions on Pattern Analysis an...

TL;DR: A view-based approach to the representation and recognition of human movement is presented, and a recognition method matching temporal templates against stored instances of views of known actions is developed.

...read moreread less

Proceedings ArticleDOI

Behavior recognition via sparse spatio-temporal features

Piotr Dollár, +3 more

TL;DR: It is shown that the direct 3D counterparts to commonly used 2D interest point detectors are inadequate, and an alternative is proposed, and a recognition algorithm based on spatio-temporally windowed data is devised.

...read moreread less

Collapse

View invariant human action recognition using histograms of 3D joints

Citations

Hierarchical recurrent neural network for skeleton based action recognition

Enhanced Computer Vision With Microsoft Kinect Sensor: A Review

Human Action Recognition by Representing 3D Skeletons as Points in a Lie Group

Spatio-Temporal LSTM with Trust Gates for 3D Human Action Recognition

NTU RGB+D 120: A Large-Scale Benchmark for 3D Human Activity Understanding

References

A tutorial on hidden Markov models and selected applications in speech recognition

Real-time human pose recognition in parts from single depth images

Recognizing human actions: a local SVM approach

The recognition of human movement using temporal templates

Behavior recognition via sparse spatio-temporal features

Related Papers (5)

Mining actionlet ensemble for action recognition with depth cameras

Action recognition based on a bag of 3D points

Human Action Recognition by Representing 3D Skeletons as Points in a Lie Group

HON4D: Histogram of Oriented 4D Normals for Activity Recognition from Depth Sequences

Hierarchical recurrent neural network for skeleton based action recognition