Hierarchical Clustering Multi-Task Learning for Joint Human Action Grouping and Recognition

doi:10.1109/TPAMI.2016.2537337

Journal ArticleDOI

Hierarchical Clustering Multi-Task Learning for Joint Human Action Grouping and Recognition

An-An Liu, +3 more

- 01 Jan 2017 -

IEEE Transactions on Pattern Analysis an...

- Vol. 39, Iss: 1, pp 102-114

Chats0

TLDR

This paper forms the objective function into the group-wise least square loss regularized by low rank and sparsity with respect to two latent variables, model parameters and grouping information, for joint optimization and can attain both optimal action models and group discovery by alternating iteratively.

Abstract:

This paper proposes a hierarchical clustering multi-task learning (HC-MTL) method for joint human action grouping and recognition. Specifically, we formulate the objective function into the group-wise least square loss regularized by low rank and sparsity with respect to two latent variables, model parameters and grouping information, for joint optimization. To handle this non-convex optimization, we decompose it into two sub-tasks, multi-task learning and task relatedness discovery. First, we convert this non-convex objective function into the convex formulation by fixing the latent grouping information. This new objective function focuses on multi-task learning by strengthening the shared-action relationship and action-specific feature learning. Second, we leverage the learned model parameters for the task relatedness measure and clustering. In this way, HC-MTL can attain both optimal action models and group discovery by alternating iteratively. The proposed method is validated on three kinds of challenging datasets, including six realistic action datasets (Hollywood2, YouTube, UCF Sports, UCF50, HMDB51 $\&$ UCF101), two constrained datasets (KTH $\&$ TJU), and two multi-view datasets (MV-TJU $\&$ IXMAS). The extensive experimental results show that: 1) HC-MTL can produce competing performances to the state of the arts for action recognition and grouping; 2) HC-MTL can overcome the difficulty in heuristic action grouping simply based on human knowledge; 3) HC-MTL can avoid the possible inconsistency between the subjective action grouping depending on human knowledge and objective action grouping based on the feature subspace distributions of multiple actions. Comparison with the popular clustered multi-task learning further reveals that the discovered latent relatedness by HC-MTL aids inducing the group-wise multi-task learning and boosts the performance. To the best of our knowledge, ours is the first work that breaks the assumption that all actions are either independent for individual learning or correlated for joint modeling and proposes HC-MTL for automated, joint action grouping and modeling.

Hierarchical Clustering Multi-Task Learning for Joint Human Action Grouping and Recognition

Citations

A Survey on Multi-Task Learning

A Deep Learning Approach for Intrusion Detection Using Recurrent Neural Networks

Action Recognition in Video Sequences using Deep Bi-Directional LSTM With CNN Features

Self-Supervised Spatiotemporal Learning via Video Clip Order Prediction

Automated pulmonary nodule detection in CT images using deep convolutional neural networks

References

On Spectral Clustering: Analysis and an algorithm

UCF101: A Dataset of 101 Human Actions Classes From Videos in The Wild

3D Convolutional Neural Networks for Human Action Recognition

3D Convolutional Neural Networks for Human Action Recognition

Learning realistic human actions from movies

Related Papers (5)

Deep Residual Learning for Image Recognition

HMDB: A large video database for human motion recognition

ImageNet Classification with Deep Convolutional Neural Networks

UCF101: A Dataset of 101 Human Actions Classes From Videos in The Wild

Learning realistic human actions from movies