• •    

CCML2017+367+基于用户兴趣语义的视频关键帧提取

俞璜悦,王晗,郭梦婷   

  1. 北京林业大学
  • 收稿日期:2017-06-26 发布日期:2017-06-26
  • 通讯作者: 王晗

CCML2017+367+Video Keyframe Extraction based on Users’ interests

  • Received:2017-06-26 Online:2017-06-26

摘要: 视频关键帧提取是视频摘要、检索等领域的热点研究问题。目前,视频关键信息提取技术主要集中于根据视频低层特征进行关键帧的提取,忽略了与用户兴趣相关的语义信息。对视频进行语义建模需收集大量已标注的视频训练样本,这个过程费时费力。为缓解这一问题,本文使用大量互联网图像数据构建基于用户兴趣的语义模型,这些图像数据内容丰富、同时涵盖大量事件信息。然而,从互联网获取的图像知识多样且常伴随图像噪声,使用蛮力迁移将大大影响视频最终提取效果。本文提出使用近义词联合权重模型衡量互联网中存在差异但语义相近的图像组,并利用这些图像组构建语义模型。在此框架下,通过联合权重学习获取语义权重,每一图像组在知识迁移中所起的作用由权重值决定。本文使用来自不同视频网站的多段视频对该方法进行验证,实验结果表明对用户感兴趣的内容进行联合权重语义建模能更加全面、准确地获取信息,从而有效指导视频关键帧提取。

关键词: 视频检索, 关键帧提取, 视频分析, 知识迁移

Abstract: Extracting keyframes is of great interest in video summary, organization, browsing and indexing. Current researches mainly focus on obtaining extractions by optimizing low-level?feature diversity or representativeness of the video frames ignoring the interests of the users. And it is time consuming and labor expensive to collect a large amount of required labelled videos to model different user-interest concepts for different videos. To alleviate the labelling process, we propose to learn models for user- interest concepts on different videos by leveraging abundant Web images which cover many roughly annotated concepts and often captured in a maximally informative way. However, knowledge from the Web is noisy and diverse, brute force knowledge transfer may hurt the keyframe extraction performance. To address this problem, we propose a novel joint group weighting learning framework to leverage different but related groups of knowledges learnt from the Web images to videos. Under this framework, weights of different groups are learnt in a joint optimization problem, and each weight represents how contributive the corresponding image group is to the knowledge transferred to the video. Experimental results on several challenging video datasets demonstrate that it is effective to use grouped knowledge gained from Web images for video keyframe extraction and provides more comprehensive results.

Key words: Video retrieval, Keyframe extraction, Video analysis, Knowledge transfer

中图分类号: