An Extensive Analysis of the Vision-based Deep Learning Techniques for Action Recognition

doi:10.14569/IJACSA.2021.0120276

Open AccessJournal ArticleDOI

An Extensive Analysis of the Vision-based Deep Learning Techniques for Action Recognition

Manasa R, +2 more

- 01 Jan 2021 -

International Journal of Advanced Comput...

- Vol. 12, Iss: 2

Chats0

TLDR

This paper has summarized the evolution of various action localization, classification, and detection algorithms applied to data from vision-based sensors and reviewed the datasets that have been used for the action classification, localization, and Detection process.

Abstract:

Action recognition involves the idea of localizing and classifying actions in a video over a sequence of frames. It can be thought of as an image classification task extended temporally. The information obtained over the multitude of frames is aggregated to comprehend the action classification output. Applications of action recognition systems range from assistance for healthcare systems to human-machine interaction. Action recognition has proven to be a challenging task as it poses many impediments including high computation cost, capturing extended context, designing complex architectures, and lack of benchmark datasets. Increasing the efficiency of algorithms in human action recognition can significantly improve the probability of implementing it in real-world scenarios. This paper has summarized the evolution of various action localization, classification, and detection algorithms applied to data from vision-based sensors. We have also reviewed the datasets that have been used for the action classification, localization, and detection process. We have further explored the areas of action classification, temporal and spatiotemporal action detection, which use convolution neural networks, recurrent neural networks, or a combination of both.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Cross Domain Action Recognition Based on Deep Dual Auto-Encoder Network

儿良肖

- 01 Jan 2022 -

软件工程与应用

References

PDF

Open Access

More filters

Proceedings ArticleDOI

Histograms of oriented gradients for human detection

Navneet Dalal, +1 more

TL;DR: It is shown experimentally that grids of histograms of oriented gradient (HOG) descriptors significantly outperform existing feature sets for human detection, and the influence of each stage of the computation on performance is studied.

...read moreread less

Proceedings ArticleDOI

Learning Spatiotemporal Features with 3D Convolutional Networks

Du Tran, +5 more

TL;DR: The learned features, namely C3D (Convolutional 3D), with a simple linear classifier outperform state-of-the-art methods on 4 different benchmarks and are comparable with current best methods on the other 2 benchmarks.

...read moreread less

Proceedings Article

Two-Stream Convolutional Networks for Action Recognition in Videos

Karen Simonyan, +1 more

TL;DR: This work proposes a two-stream ConvNet architecture which incorporates spatial and temporal networks and demonstrates that a ConvNet trained on multi-frame dense optical flow is able to achieve very good performance in spite of limited training data.

...read moreread less

Large-scale Video Classiﬁcation with Convolutional Neural Networks

Andrej Karpathy, +5 more

Proceedings ArticleDOI

Large-Scale Video Classification with Convolutional Neural Networks

Andrej Karpathy, +5 more

TL;DR: This work studies multiple approaches for extending the connectivity of a CNN in time domain to take advantage of local spatio-temporal information and suggests a multiresolution, foveated architecture as a promising way of speeding up the training.

...read moreread less