Learning and fusing multiple hidden substages for action quality assessment

doi:10.1016/J.KNOSYS.2021.107388

Journal ArticleDOI

Learning and fusing multiple hidden substages for action quality assessment

Li-Jia Dong, +5 more

- 11 Oct 2021 -

Knowledge Based Systems

- Vol. 229, pp 107388

Chats0

TLDR

In this article, a learning and fusion network of multiple hidden substages is proposed to assess athletic performance by segmenting videos into five substages by a temporal semantic segmentation, and a fully-connected-network-based hidden regression model is built to predict the score of each substage, fusing these scores into the overall score.

Abstract:

Many of the existing methods for action quality assessment implement single-stage score regression networks that lack pertinence and rationality for the evaluation task. In this work, our target is to find a reasonable action quality assessment method for sports competitions that conforms to objective evaluation rules and field experience. To achieve this goal, three assessment scenarios, i.e., the overall-score-guided scenario, execution-score-guided scenario, and difficulty-level-based overall-score-guided scenario, are defined. A learning and fusion network of multiple hidden substages is proposed to assess athletic performance by segmenting videos into five substages by a temporal semantic segmentation. The feature of each video segment is extracted from the five feature backbone networks with shared weights, and a fully-connected-network-based hidden regression model is built to predict the score of each substage, fusing these scores into the overall score. We evaluate the proposed method on the UNLV-Diving dataset. The comparison results show that the proposed method based on objective evaluation rules of sports competitions outperforms the regression model directly trained on the overall score. The proposed multiple-substage network is more accurate than the single-stage score regression network and achieves state-of-the-art performance by leveraging objective evaluation rules and field experience that are beneficial for building an accurate and reasonable action quality assessment model.

Learning and fusing multiple hidden substages for action quality assessment

Citations

Functional movement screen dataset collected with two Azure Kinect depth sensors

Functional movement screen dataset collected with two Azure Kinect depth sensors

Skeleton-based deep pose feature learning for action quality assessment on figure skating videos

Pairwise Contrastive Learning Network for Action Quality Assessment

Gaussian guided frame sequence encoder network for action quality assessment

References

Representation Learning: A Review and New Perspectives

Learning Spatiotemporal Features with 3D Convolutional Networks

Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset

Two-Stream Convolutional Networks for Action Recognition in Videos

Temporal Segment Networks: Towards Good Practices for Deep Action Recognition

Related Papers (5)

ScoringNet: Learning Key Fragment for Action Quality Assessment with Ranking Loss in Skilled Sports

Feature ranking for multi-target regression

Ranking vs. Regression in Machine Translation Evaluation

Data set quality in Machine Learning: Consistency measure based on Group Decision Making

Learning to Rank Retargeted Images