Journal of Computer Applications ›› 2017, Vol. 37 ›› Issue (9): 2605-2609.DOI: 10.11772/j.issn.1001-9081.2017.09.2605

Previous Articles     Next Articles

Key frame extraction of motion video based on spatial-temporal feature locally preserving

SHI Nianfeng1, HOU Xiaojing1, ZHANG Ping2   

  1. 1. School of Computer and Information Engineering, Luoyang Institute of Science and Technology, Luoyang Henan 471023, China;
    2. School of Mathematics and Statistics, Henan University of Science and Technology, Luoyang Henan 471023, China
  • Received:2017-04-25 Revised:2017-06-10 Online:2017-09-10 Published:2017-09-13
  • Supported by:
    This work is partially supported by the Key Science and Technology Program of Henan Province (152102210329, 172102310635).

时空特征局部保持的运动视频关键帧提取

石念峰1, 侯小静1, 张平2   

  1. 1. 洛阳理工学院 计算机与信息工程学院, 河南 洛阳 471023;
    2. 河南科技大学 数学与统计学院, 河南 洛阳 471023
  • 通讯作者: 侯小静,alisha@163.com
  • 作者简介:石念峰(1976-),男,河南洛阳人,副教授,博士,CCF高级会员,主要研究方向:计算机协同工作、模式识别;侯小静(1975-),女,河南洛阳人,讲师,硕士,主要研究方向:模式识别;张平(1976-),男,黑龙江牡丹江人,副教授,博士,主要研究方向:网络安全、模式识别。
  • 基金资助:
    河南省科技攻关项目(152102210329, 172102310635)。

Abstract: To improve the motion expression and compression rate of the motion video key frames, a dynamic video frame extraction technique based on flexible pose estimation and spatial-temporal feature embedding was proposed. Firstly, a Spatial-Temporal feature embedded Flexible Mixture-of-Parts articulated human model (ST-FMP) was designed by preserving the spatial-temporal features of body parts, and the N-best algorithm was adopted with spatial-temporal locally preserving of uncertain body parts to estimate the body configuration in a single frame based on ST-FMP. Then, the relative position and motion direction of the human body were used to describe the characteristics of the human body motion. The Laplacian scoring algorithm was used to implement dimensionality reduction to obtain the discriminant human motion feature vector with local topological structure. Finally, the ISODATA (Iterative Self-Organizing Data Analysis Technique) algorithm was used to dynamically determine the key frames. In the key frame extraction experiment on aerobics video, compared to articulated human model with Flexible Mixture-of-Parts (FMP) and motion block, the accuracy of uncertain body parts by using ST-FMP was 15 percentage points higher than that by using FMP, achieved 81%, which was higher than that by using Key Frames Extraction based on prior knowledge (KFE) and key frame extraction based on motion blocks. The experimental results on key frame extraction for calisthenics video show that the proposed approach is sensitive to motion feature selection and human pose configuration, and it can be used for sports video annotation.

Key words: key frame extraction, motion video, pose estimation, articulated human model with Flexible Mixture-of-Parts (FMP), feature selection

摘要: 为提高运动视频关键帧的运动表达能力和压缩率,提出柔性姿态估计和时空特征嵌入结合的运动视频关键帧提取技术。首先,利用人体动作的时间连续性保持建立具有时间约束限制的柔性部件铰接人体(ST-FMP)模型,通过非确定性人体部位动作连续性约束,采用N-best算法估计单帧图像中的人体姿态参数;接着,采用人体部位的相对位置和运动方向描述人体运动特征,通过拉普拉斯分值法实施数据降维,获得局部拓扑结构表达能力强的判别性人体运动特征向量;最后,采用迭代自组织数据分析技术(ISODATA)算法动态地确定关键帧。在健美操动作视频关键帧提取实验中,ST-FMP模型将柔性混合铰接人体模型(FMP)的非确定性人体部位的识别准确率提高约15个百分点,取得了81%的关键帧提取准确率,优于KFE和运动块的关键帧算法。所提算法对人体运动特征和人体姿态敏感,适用于运动视频批注审阅。

关键词: 关键帧提取, 运动视频, 姿态估计, 柔性混合铰接人体模型, 特征选择

CLC Number: