计算机应用 ›› 2015, Vol. 35 ›› Issue (9): 2624-2628.DOI: 10.11772/j.issn.1001-9081.2015.09.2624

• 虚拟现实与数字媒体 • 上一篇    下一篇

多对象图像数据集建立及显著性检测算法评估

郑斌1, 牛玉贞1,2, 柯玲玲1   

  1. 1. 福州大学 数学与计算机科学学院, 福州 350116;
    2. 福建省网络计算与智能信息处理重点实验室(福州大学), 福州 350116
  • 收稿日期:2015-04-30 修回日期:2015-06-29 出版日期:2015-09-10 发布日期:2015-09-17
  • 通讯作者: 牛玉贞(1982-),女,山东济南人,教授,博士,CCF会员,主要研究方向:图像和视频处理、计算机视觉,yuzhenniu@gmail.com
  • 作者简介:郑斌(1990-),男,福建厦门人,硕士研究生,主要研究方向:图像和视频处理、计算机视觉;柯玲玲(1991-),女,福建莆田人,硕士研究生,主要研究方向:图像和视频处理、计算机视觉。
  • 基金资助:
    国家自然科学基金资助项目(61300102);福建省自然科学基金(杰青)资助项目(2015J06014);福建省自然科学基金(面上)资助项目(2014J01233)。

New multi-object image dataset construction and evaluation of visual saliency analysis algorithm

ZHENG Bin1, NIU Yuzhen1,2, KE Lingling1   

  1. 1. College of Mathematics and Computer Science, Fuzhou University, Fuzhou Fujian 350116, China;
    2. Fujian Key Laboratory of Network Computing and Intelligent Information Processing (Fuzhou University), Fuzhou Fujian 350116, China
  • Received:2015-04-30 Revised:2015-06-29 Online:2015-09-10 Published:2015-09-17

摘要: 图像视觉显著性检测算法在已有数据集上已经取得很好的结果,但是目前的多个数据集存在两个严重的问题:首先,数据集中的图像以只包含一个显著对象的图像为主;其次,在建立显著对象标注结果的过程中,忽略了用户对同一幅图像中包含的多个显著对象的不同认知。上述问题导致了在已有数据集上对显著性检测算法进行评估,不能体现算法在实际应用中的真实效果。为此,提出体现用户认知的多显著对象图像标注方法,首先设计并实现辅助软件,收集用户对各显著对象的重要程度的认知情况,包括显著区域与相应的重要程度;然后融合收集的多用户数据,绘制出以灰度图为表现形式的显著对象标注结果,并通过灰度值体现多用户对于每个显著对象的认知情况。基于改进的显著对象标注方法,建立了一个包含1000幅多显著对象图像的数据集,并为每幅图像提供了体现用户认知的显著对象标注结果。对10种具有代表性的显著性检测算法在已有数据集和建立的数据集上的性能进行了比较。实验结果表明,这些显著性检测算法在建立的数据集上的性能有大幅度的降低,例如受试者工作特征曲线下面积(ROC-AUC)评估参数的最大降幅超过了0.5,这证实了已有数据集存在的问题及建立新数据集的需求,同时指出显著性检测算法在处理包含多显著对象的复杂图像上存在的不足。

关键词: 视觉显著性检测, 多对象图像, 数据集, 用户认知, 算法评估

Abstract: Image visual saliency analysis algorithms have achieved satisfactory performance on existing datasets, but these datasets have two major problems. Firstly, most of the images contain only one salient object. Secondly, users' cognition of multiple salient objects in the same image was ignored during building salient objects' ground truth. The above problems result in that the performance of saliency analysis algorithms used in the real applications cannot be reflected by the evaluation on the existing datasets. So in this paper, a novel method of labeling the ground truth of salient objects was proposed. Firstly, a software to collect users' cognition of the important values of multiple salient objects in each image was designed and implemented. Then, according to the collected data from each user, the ground truth map represented as a gray scale image was created by manually labeling the regions covered by the salient objects. The pixel value of each region equals to the collected saliency in the first step. Based on the improved ground truth labeling method, a salient object dataset contains 1000 multi-object images was built. A ground truth map for each image was created to record users' cognition of the objects' saliencies. Then 10 state-of-the-art saliency analysis algorithms on existing datasets and the established dataset were compared. The experimental results show that these algorithms' performances are greatly reduced on the established dataset, such as the Area Under Curve of Receiver-Operating Characteristic (ROC-AUC) has a greatest decline of more than 0.5. The results prove the problems of existing datasets and the demand of building a new dataset, and point out the insufficiency of saliency analysis algorithms on complex images with multiple salient objects.

Key words: visual saliency analysis, multi-object image, dataset, users cognition, algorithms evaluation

中图分类号: