计算机应用 ›› 2013, Vol. 33 ›› Issue (08): 2293-2296.

• 多媒体处理技术 • 上一篇    下一篇

基于异构信息双向传播的网络视频分类方法

李谦1,杜友田1,薛姣2   

  1. 1. 智能网络与网络安全教育部重点实验室(西安交通大学),西安710049
    2. 西安交通大学智能网络与网络安全教育部重点实验室(西安交通大学),西安710049
  • 收稿日期:2013-02-20 修回日期:2013-03-30 出版日期:2013-08-01 发布日期:2013-09-11
  • 通讯作者: 杜友田
  • 作者简介:李谦(1987-),男,山西太原人,硕士研究生,主要研究方向:网络多媒体理解;
    杜友田(1980-),男,山东日照人,讲师,博士,主要研究方向:网络多媒体理解、在线社会网络、机器学习;
    薛姣(1989-),女,陕西汉中人,硕士研究生,主要研究方向:网络多媒体理解。
  • 基金资助:
    国家自然科学基金资助项目;国家“十二五”科技支撑计划项目

Web video classification based on bidirectional propagation of heterogeneous attributes

LI Qian,DU Youtian,XUE Jiao   

  1. Ministry of Education Key Laboratory for Intelligent Networks and Network Security (Xi'an Jiaotong University), Xi'an Shaanxi 710049, China
  • Received:2013-02-20 Revised:2013-03-30 Online:2013-09-11 Published:2013-08-01
  • Contact: DU Youtian

摘要: 针对以往大多数网络视频分类研究只将文本和视觉特征进行简单融合的问题,提出了基于异构信息双向传播的网络视频分类方法。首先基于K均值方法将视频关键帧聚类成多个簇,在帧层次上对视频数据进行建模;将每个簇中代表性关键帧的文本信息传播至该簇作为其文本解释,完成从文本至视觉模态的传播;对每个关键帧,将其对应簇的文本解释传播至该关键帧,完成从视觉至文本模态的传播;最后基于支持向量机(SVM)对网络视频进行分类。在信息的双重传播中两类异构数据得到了密切的融合。实验结果表明该方法有效地提高了网络视频分类的准确率。

关键词: 网络视频分类, 异构数据, 视觉模态, 文本模态, 双向传播

Abstract: Concerning that most Web video categorization researches just focus on the basic simple fusion of the information from text model and visual model, a Web video classification method based on the bidirectional propagation of heterogeneous attributes was proposed. Firstly, the method adopted K-means clustering to divide key frames into multiple clusters, and modeled videos at the level of frame. For each cluster, a part of key frames were randomly chosen to propagate their text information to the cluster. For each key frame, the text explanation of the corresponding cluster was transferred to this frame. Finally, the Web video was classified based on the extended text information from dual propagation by using Support Vector Machine (SVM) classifiers. The method integrates heterogeneous attributes well based on the dual propagation. The experimental results demonstrate the effectiveness of the method in the Web video classification.

Key words: Web video classification, heterogeneous data, visual modality, textual modality, bidirectional propagation

中图分类号: