计算机应用 ›› 2016, Vol. 36 ›› Issue (11): 3217-3221.DOI: 10.11772/j.issn.1001-9081.2016.11.3217

• 虚拟现实与数字媒体 • 上一篇    下一篇

对象级特征引导的显著性视觉注意方法

杨凡1,2, 蔡超1,2   

  1. 1. 华中科技大学 自动化学院, 武汉 430074;
    2. 多谱信息处理技术国家重点实验室(华中科技大学), 武汉 430074
  • 收稿日期:2016-03-18 修回日期:2016-06-26 出版日期:2016-11-10 发布日期:2016-11-12
  • 通讯作者: 蔡超
  • 作者简介:杨凡(1990-),男,山东淄博人,硕士研究生,主要研究方向:视觉注意、显著性目标检测、深度学习;蔡超(1971-),男,山东东明人,副教授,博士,主要研究方向:计算机视觉、目标识别、医学图像处理、任务规划。
  • 基金资助:
    华为创新基金资助项目(YJCB2010022IN)。

Significant visual attention method guided by object-level features

YANG Fan1,2, CAI Chao1,2   

  1. 1. School of Automation, Huazhong University of Science and Technology, Wuhan Hubei 430074, China;
    2. National Key Laboratory of Science and Technology on Multi-spectral Information Processing, (Huazhong University of Science and Technology), Wuhan Hubei 430074, China
  • Received:2016-03-18 Revised:2016-06-26 Online:2016-11-10 Published:2016-11-12
  • Supported by:
    This work is partially supported by the Huawei Innovation Fund (YJCB2010022IN).

摘要: 针对已有视觉注意模型在整合对象特征方面的不足,提出一种新的结合高层对象特征和低层像素特征的视觉注意方法。首先,利用已训练的卷积神经网(CNN)对多类目标的强大理解能力,获取待处理图像中对象的高层次特征图;然后结合实际的眼动跟踪数据,训练多个对象特征图的加权系数,给出对象级突出图;紧接着提取像素级突出图,并和对象级突出图融合获得显著图;最后,在OSIE和MIT数据集上验证了该方法,并与国际上流行的视觉注意方法进行对比,结果显示该算法在OSIE数据集上获得的AUC值相对更高。实验结果表明,所提方法能够更加充分地利用图像中对象信息,提高显著性预测的准确率。

关键词: 视觉注意, 自顶向下, 显著性, 对象信息, 卷积神经网

Abstract: Concerning the defects of fusing object information by existing visual attention models, a new visual attention method combining high-level object features and low-level pixel features was proposed. Firstly, high-level feature maps were obtained by using Convolutional Neural Network (CNN) which has strong understanding of multi-class targets. Then all object feature maps were combined by training the weights with eye fixation data. Then the saliency map was obtained by fusing pixel-level conspicuity map and object-level conspicuity map. Finally, the proposed method was compared with many popular visual attention methods on OSIE and MIT datasets. Compared with the contrast methods, the Area Under Curve (AUC) result of the proposed method is increased. Experimental results show that the proposed method can make full use of the object information in the image, and increases the saliency prediction accuracy.

Key words: visual attention, top-down, saliency, object information, Convolutional Neural Network (CNN)

中图分类号: