Journal of Computer Applications ›› 2019, Vol. 39 ›› Issue (8): 2408-2413.DOI: 10.11772/j.issn.1001-9081.2019010139

• Virtual reality and multimedia computing • Previous Articles     Next Articles

Weakly supervised action localization based on action template matching

SHI Xiangbin1,2, ZHOU Jincheng1, LIU Cuiwei2   

  1. 1. College of Information, Liaoning University, Shenyang Liaoning 110136, China;
    2. College of Computer Science, Shenyang Aerospace University, Shenyang Liaoning 110136, China
  • Received:2019-01-23 Revised:2019-03-11 Online:2019-04-15 Published:2019-08-10
  • Supported by:
    This work is partially surpported by the Doctor Startup Found Program of Liaoning Province (201601172).

基于动作模板匹配的弱监督动作定位

石祥滨1,2, 周金成1, 刘翠微2   

  1. 1. 辽宁大学 信息学院, 沈阳 110036;
    2. 沈阳航空航天大学 计算机学院, 沈阳 110136
  • 通讯作者: 周金成
  • 作者简介:石祥滨(1963-),男,辽宁沈阳人,教授,博士,CCF高级会员,主要研究方向:分布式虚拟现实、网络游戏、数据库;周金成(1993-),男,安徽池州人,硕士研究生,主要研究方向:视频分析;刘翠微(1989-),女,辽宁沈阳人,讲师,博士,CCF会员,主要研究方向:视频分析、模式识别。
  • 基金资助:
    辽宁省博士启动基金项目(201601172)。

Abstract: In order to solve the problem of action localization in video, a weakly supervised method based on template matching was proposed. Firstly, several candidate bounding boxes of the action subject position were given on each frame of the video, and then these candidate bounding boxes were connected in chronological order to form action proposals. Secondly, action templates were obtained from some frames of the training set video. Finally, the optimal model parameters were obtained after model training by using action proposals and action templates. In the experiments on UCF-sports dataset, the method has the accuracy of the action classification increased by 0.3 percentage points compared with TLSVM (Transfer Latent Support Vector Machine) method; when the overlapping threshold is 0.2, the method has the accuracy of action localization increased by 28.21 percentage points compared with CRANE method. Experimental results show that the proposed method can not only reduce the workload of dataset annotation, but also improve the accuracy of action classification and action localization.

Key words: action localization, action template, weakly supervised, action proposal, video

摘要: 为解决视频中的动作定位问题,提出一种基于模板匹配的弱监督动作定位方法。首先在视频的每一帧上给出若干个动作主体位置的候选框,按时间顺序连接这些候选框形成动作提名;然后利用训练集视频的部分帧得到动作模板;最后利用动作提名与动作模板训练模型,找到最优的模型参数。在UCF-sports数据集上进行实验,结果显示,与TLSVM方法相比,所提方法的动作分类准确率提升了0.3个百分点;当重叠度阈值取0.2时,与CRANE方法相比,所提方法的动作定位准确率提升了28.21个百分点。实验结果表明,所提方法不但能够减少数据集标注的工作量,而且动作分类和动作定位的准确率均得到提升。

关键词: 动作定位, 动作模板, 弱监督, 动作提名, 视频

CLC Number: