Journal of Computer Applications

    Next Articles

Action quality assessment model based on new trajectory-guided perceptual learning with X3D

  

  • Received:2025-02-20 Revised:2025-04-16 Online:2025-04-24 Published:2025-04-24

基于X3D的新型轨迹引导感知学习的动作质量评估模型

张四中,刘建阳,李林峰   

  1. 西南交通大学机械工程学院
  • 通讯作者: 刘建阳

Abstract: Action quality assessment has attracted many researchers as a challenging visual task. Current research methods mainly focus on improving the feature extraction capability of backbone networks, ignoring the effect of motion trajectories. However, the consistency of the movement is also an important factor for evaluating the execution of the movement in real competitions. In this paper we design a novel trajectory-guided perceptual learning framework for action quality assessment, which uses trajectory descriptors to guide the model to perceptually learn action coherence information, and to achieve interactive learning between different information. Aiming at the lack of trajectory labels in the current dataset, we design an unsupervised optical flow trajectory extraction method based on the Farneback optical flow method to obtain motion trajectory information, and use the acquired optical flow trajectory features as cue words to guide the model to learn the video features perceptually. Finally a learnable spline curves of the KAN network is used to fit the data distribution of the mixture of features, so as to establish a more accurate mapping relationship. We conduct experiments on MTL-AQA, AQA-7, FineDiving and JIGSAWS datasets, and the spearman rank correlation coefficients on the above datasets can reach 0.9101 (MTL-AQA), 0.912 (AQA-7), 0.882 (FineDiving), and 0.98 (JIGSAWS), respectively.

Key words: trajectory, attention mechanism, guided perception learning, consistency of action, action quality assessment

摘要: 动作质量评估作为一项极具挑战性的视觉任务吸引了众多研究者的目光。当前研究方法主要集中于提升骨干网络的特征提取能力,忽略了运动轨迹的影响,而现实中动作的连贯性也是评价动作执行情况的重要因素。本文通过引入轨迹信息,设计一种新型轨迹引导感知学习的动作质量评估模型,利用轨迹描述符引导模型感知学习动作连贯性信息,实现不同信息之间的交互学习。针对当前数据集缺乏轨迹标签的问题,设计一种基于Farneback光流法的无监督光流轨迹提取方法获取运动轨迹信息,并将获取的光流轨迹特征作为引导词,引导模型感知学习视频特征。最后利用KAN网络的可学习样条曲线拟合了混合特征的数据分布,建立更精确的映射关系。模型在MTL-AQA、AQA-7、FineDiving、JIGSAWS数据集上进行实验,以斯皮尔曼秩相关系数作为评价指标,在各个数据集上分别取得了0.9101、0.9120、0.882和0.99的效果。

关键词: 轨迹, 注意力机制, 引导感知学习, 动作连贯性, 动作质量评估

CLC Number: