RGB-D-based action recognition datasets

doi:10.1016/J.PATCOG.2016.05.019

Open AccessJournal ArticleDOI

RGB-D-based action recognition datasets

Jing Zhang, +4 more

- 01 Dec 2016 -

Pattern Recognition

- Vol. 60, pp 86-105

TLDR

In this article, a comprehensive review of the most commonly used action recognition related RGB-D video datasets, including 27 single-view, 10 multi-view and 7 multi-person datasets, is presented.

About:

This article is published in Pattern Recognition.The article was published on 2016-12-01 and is currently open access. It has received 244 citations till now.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Recent advances in convolutional neural networks

Jiuxiang Gu, +10 more

- 01 May 2018 -

Pattern Recognition

TL;DR: A broad survey of the recent advances in convolutional neural networks can be found in this article, where the authors discuss the improvements of CNN on different aspects, namely, layer design, activation function, loss function, regularization, optimization and fast computation.

...read moreread less

Posted Content

NTU RGB+D: A Large Scale Dataset for 3D Human Activity Analysis

Amir Shahroudy, +3 more

- 11 Apr 2016 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: In this paper, a large-scale dataset for RGB+D human action recognition was introduced with more than 56 thousand video samples and 4 million frames, collected from 40 distinct subjects.

...read moreread less

Proceedings ArticleDOI

NTU RGB+D: A Large Scale Dataset for 3D Human Activity Analysis

Amir Shahroudy, +3 more

TL;DR: A large-scale dataset for RGB+D human action recognition with more than 56 thousand video samples and 4 million frames, collected from 40 distinct subjects is introduced and a new recurrent neural network structure is proposed to model the long-term temporal correlation of the features for each body part, and utilize them for better action classification.

...read moreread less

Posted Content

Recent Advances in Convolutional Neural Networks

Jiuxiang Gu, +11 more

- 22 Dec 2015 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: This paper details the improvements of CNN on different aspects, including layer design, activation function, loss function, regularization, optimization and fast computation, and introduces various applications of convolutional neural networks in computer vision, speech and natural language processing.

...read moreread less

Journal ArticleDOI

NTU RGB+D 120: A Large-Scale Benchmark for 3D Human Activity Understanding

Jun Liu, +5 more

- 01 Oct 2020 -

IEEE Transactions on Pattern Analysis an...

TL;DR: This work introduces a large-scale dataset for RGB+D human action recognition, which is collected from 106 distinct subjects and contains more than 114 thousand video samples and 8 million frames, and investigates a novel one-shot 3D activity recognition problem on this dataset.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Large-scale Video Classiﬁcation with Convolutional Neural Networks

Andrej Karpathy, +5 more

Proceedings ArticleDOI

Large-Scale Video Classification with Convolutional Neural Networks

Andrej Karpathy, +5 more

TL;DR: This work studies multiple approaches for extending the connectivity of a CNN in time domain to take advantage of local spatio-temporal information and suggests a multiresolution, foveated architecture as a promising way of speeding up the training.

...read moreread less

Proceedings ArticleDOI

Actions as space-time shapes

M. Blank, +4 more

TL;DR: The method is fast, does not require video alignment and is applicable in many scenarios where the background is known, and the robustness of the method is demonstrated to partial occlusions, non-rigid deformations, significant changes in scale and viewpoint, high irregularities in the performance of an action and low quality video.

...read moreread less

Proceedings ArticleDOI

ActivityNet: A large-scale video benchmark for human activity understanding

Fabian Caba Heilbron, +3 more

TL;DR: This paper introduces ActivityNet, a new large-scale video benchmark for human activity understanding that aims at covering a wide range of complex human activities that are of interest to people in their daily living.

...read moreread less

Proceedings ArticleDOI

Hierarchical recurrent neural network for skeleton based action recognition

Yong Du, +2 more

TL;DR: This paper proposes an end-to-end hierarchical RNN for skeleton based action recognition, and demonstrates that the model achieves the state-of-the-art performance with high computational efficiency.

...read moreread less

Collapse

IEEE Transactions on Pattern Analysis an...

RGB-D-based action recognition datasets

Citations

Recent advances in convolutional neural networks

NTU RGB+D: A Large Scale Dataset for 3D Human Activity Analysis

NTU RGB+D: A Large Scale Dataset for 3D Human Activity Analysis

Recent Advances in Convolutional Neural Networks

NTU RGB+D 120: A Large-Scale Benchmark for 3D Human Activity Understanding

References

Large-scale Video Classiﬁcation with Convolutional Neural Networks

Large-Scale Video Classification with Convolutional Neural Networks

Actions as space-time shapes

ActivityNet: A large-scale video benchmark for human activity understanding

Hierarchical recurrent neural network for skeleton based action recognition

Related Papers (5)

Mining actionlet ensemble for action recognition with depth cameras

Action recognition based on a bag of 3D points

View invariant human action recognition using histograms of 3D joints

Hierarchical recurrent neural network for skeleton based action recognition

3D Convolutional Neural Networks for Human Action Recognition