Recognizing Human Activities in Videos Using Improved Dense Trajectories over LSTM

doi:10.1007/978-981-13-0020-2_8

Book ChapterDOI

Recognizing Human Activities in Videos Using Improved Dense Trajectories over LSTM

- pp 78-88

TLDR

This work proposes a deep learning based technique to classify actions based on Long Short Term Memory networks, and extends the proposed framework with an efficient motion feature, to enable handling significant camera motion.

Abstract:

We propose a deep learning based technique to classify actions based on Long Short Term Memory (LSTM) networks. The proposed scheme first learns spatial temporal features from the video, using an extension of the Convolutional Neural Networks (CNN) to 3D. A Recurrent Neural Network (RNN) is then trained to classify each sequence considering the temporal evolution of the learned features for each time step. Experimental results on the CMU MoCap, UCF 101, Hollywood 2 dataset show the efficacy of the proposed approach. We extend the proposed framework with an efficient motion feature, to enable handling significant camera motion. The proposed approach outperforms the existing deep models for each dataset.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

RCCNet: An Efficient Convolutional Neural Network for Histological Routine Colon Cancer Nuclei Classification

S. H. Shabbeer Basha, +5 more

TL;DR: In this article, the authors proposed an efficient convolutional neural network (CNN) based architecture for classification of histological routine colon cancer nuclei named as RCCNet, which consists of 1, 512, 868 learnable parameters which are significantly less compared to the popular CNN models such as AlexNet, CIFAR-VGG, GoogLeNet, and WRN.

...read moreread less

Book ChapterDOI

Human Action Recognition from 3D Landmark Points of the Performer

Snehasis Mukherjee, +1 more

TL;DR: In this article, the 3D landmark points of the human pose were extracted from a single image and then used as features for action recognition by applying an autoencoder architecture followed by a regression layer.

...read moreread less

Journal ArticleDOI

Human Activity Recognition With Low-Resolution Infrared Array Sensor Using Semi-Supervised Cross-Domain Neural Networks for Indoor Environment

Cunyi Yin, +6 more

- 01 Jul 2023 -

IEEE Internet of Things Journal

TL;DR: In this article , a semi-supervised cross-domain neural network (SCDNN) based on low-resolution infrared sensor is proposed for accurately identifying human activity despite changes in the environment at a low cost.

...read moreread less

References

PDF

Open Access

More filters

Proceedings ArticleDOI

Action recognition with trajectory-pooled deep-convolutional descriptors

Limin Wang, +2 more

TL;DR: This paper presents a new video representation, called trajectory-pooled deep-convolutional descriptor (TDD), which shares the merits of both hand-crafted features and deep-learned features, and achieves superior performance to the state of the art on these datasets.

...read moreread less

Book ChapterDOI

Convolutional learning of spatio-temporal features

Graham W. Taylor, +3 more

TL;DR: A model that learns latent representations of image sequences from pairs of successive images is introduced, allowing it to scale to realistic image sizes whilst using a compact parametrization.

...read moreread less

DOI

MoSIFT: Recognizing Human Actions in Surveillance Videos

Ming-yu Chen, +1 more

TL;DR: This paper proposes an algorithm called MoSIFT, which detects interest points and encodes not only their local appearance but also explicitly models local motion, and introduces a bigram model to construct a correlation between local features to capture the more global structure of actions.

...read moreread less

Proceedings ArticleDOI

First Person Action Recognition Using Deep Learned Descriptors

Suriya Singh, +2 more

TL;DR: This work proposes convolutional neural networks (CNNs) for end to end learning and classification of wearer's actions and shows that the proposed network can generalize and give state of the art performance on various disparate egocentric action datasets.

...read moreread less

Journal ArticleDOI

Semantic human activity recognition: A literature review

Maryam Ziaeefard, +1 more

- 01 Aug 2015 -

Pattern Recognition

TL;DR: An overview of state-of-the-art methods in activity recognition using semantic features, including a semantic space including the most popular semantic features of an action namely the human body, attributes, related objects, and scene context, is presented.

...read moreread less

Recognizing Human Activities in Videos Using Improved Dense Trajectories over LSTM

Citations

RCCNet: An Efficient Convolutional Neural Network for Histological Routine Colon Cancer Nuclei Classification

Human Action Recognition from 3D Landmark Points of the Performer

Human Activity Recognition With Low-Resolution Infrared Array Sensor Using Semi-Supervised Cross-Domain Neural Networks for Indoor Environment

References

Action recognition with trajectory-pooled deep-convolutional descriptors

Convolutional learning of spatio-temporal features

MoSIFT: Recognizing Human Actions in Surveillance Videos

First Person Action Recognition Using Deep Learned Descriptors

Semantic human activity recognition: A literature review

Related Papers (5)

Deep Convolutional and LSTM Neural Network Architectures on Leap Motion Hand Tracking Data Sequences

Two deep approaches for ADL recognition: A multi-scale LSTM and a CNN-LSTM with a 3D matrix skeleton representation

Sequential deep learning for human action recognition

Modeling Spatial-Temporal Clues in a Hybrid Deep Learning Framework for Video Classification

A parallel-fusion RNN-LSTM architecture for image caption generation