基于多级全局信息传递模型的视觉显著性检测

doi:10.11772/j.issn.1001-9081.2020060968

计算机应用 ›› 2021, Vol. 41 ›› Issue (1): 208-214.DOI: 10.11772/j.issn.1001-9081.2020060968

所属专题：第八届中国数据挖掘会议(CCDM 2020)

• 第八届中国数据挖掘会议(CCDM 2020) • 上一篇下一篇

基于多级全局信息传递模型的视觉显著性检测

温静, 宋建伟

山西大学计算机与信息技术学院, 太原 030006

收稿日期:2020-05-31 修回日期:2020-07-15 发布日期:2020-11-12 出版日期:2021-01-10
通讯作者: 温静
作者简介:温静(1982-),女,山西晋中人,副教授,博士,CCF会员,主要研究方向:计算机视觉、图像处理、模式识别;宋建伟(1994-),男,山西太原人,硕士研究生,主要研究方向:计算机视觉、图像处理。
基金资助:
国家自然科学青年基金资助项目（61703252）；山西省应用基础研究计划项目（201701D121053）。

Visual saliency detection based on multi-level global information propagation model

WEN Jing, SONG Jianwei

School of Computer and Information Technology, Shanxi University, Taiyuan Shanxi 030600, China

Received:2020-05-31 Revised:2020-07-15 Online:2020-11-12 Published:2021-01-10
Supported by:
This work is partially supported by the Youth Program of National Natural Science Foundation of China (61703252), the Shanxi Applied Basic Research Program (201701D121053).

摘要/Abstract

摘要： 对神经网络中的卷积特征采用分层处理的思想能明显提升显著目标检测的性能。然而，在集成分层特征时，如何获得丰富的全局信息以及有效融合较高层特征空间的全局信息和底层细节信息仍是一个没有解决的问题。为此，提出了一种基于多级全局信息传递模型的显著性检测算法。为了提取丰富的多尺度全局信息，在较高层级引入了多尺度全局特征聚合模块（MGFAM），并且将多层级提取出的全局信息进行特征融合操作；此外，为了同时获得高层特征空间的全局信息和丰富的底层细节信息，将提取到的有判别力的高级全局语义信息以特征传递的方式和较低层次特征进行融合。这些操作可以最大限度提取到高级全局语义信息，同时避免了这些信息在逐步传递到较低层时产生的损失。在ECSSD、PASCAL-S、SOD、HKU-IS等4个数据集上进行实验，实验结果表明，所提算法相较于较先进的NLDF模型，其F-measure（F）值分别提高了0.028、0.05、0.035和0.013，平均绝对误差（MAE）分别降低了0.023、0.03、0.023和0.007。同时，所提算法在准确率、召回率、F-measure值及MAE等指标上也优于几种经典的图像显著性检测方法。

关键词: 显著性检测, 全局信息, 神经网络, 信息传递, 多尺度池化

Abstract: The idea of hierarchical processing of convolution features in neural networks has a significant effect on saliency object detection. However, when integrating hierarchical features, it is still an open problem how to obtain rich global information, as well as effectively integrate the global information and of the higher-level feature space and low-level detail information. Therefore, a saliency detection algorithm based on a multi-level global information propagation model was proposed. In order to extract rich multi-scale global information, a Multi-scale Global Feature Aggregation Module (MGFAM) was introduced to the higher-level, and feature fusion operation was performed to the global information extracted from multiple levels. In addition, in order to obtain the global information of the high-level feature space and the rich low-level detail information at the same time, the extracted discriminative high-level global semantic information was fused with the lower-level features by means of feature propagation. These operations were able to extract the high-level global semantic information to the greatest extent, and avoid the loss of this information when it was gradually propagated to the lower-level. Experimental results on four datasets including ECSSD,PASCAL-S,SOD,HKU-IS show that compared with the advanced NLDF (Non-Local Deep Features for salient object detection) model, the proposed algorithm has the F-measure (F) value increased by 0.028、0.05、0.035 and 0.013 respectively, the Mean Absolute Error (MAE) decreased by 0.023、0.03、0.023 and 0.007 respectively, and the proposed algorithm was superior to several classical image saliency detection methods in terms of precision, recall, F-measure and MAE.

Key words: saliency detection, global information, neural network, information propagation, multi-scale pooling

中图分类号:

TP391.413

温静, 宋建伟. 基于多级全局信息传递模型的视觉显著性检测[J]. 计算机应用, 2021, 41(1): 208-214.

WEN Jing, SONG Jianwei. Visual saliency detection based on multi-level global information propagation model[J]. Journal of Computer Applications, 2021, 41(1): 208-214.

参考文献

[1] HONG S,YOU T,KWAK S,et al. Online tracking by learning discriminative saliency map with convolutional neural network[C]//Proceedings of the 2015 International Conference on Machine Learning. New York:JMLR. org,2015:597-606.
[2] LONG J,SHELHAMER E,DARRELL T. Fully convolutional networks for semantic segmentation[C]//Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2015:3431-3440.
[3] ZHAO R,OUYANG W,WANG X. Unsupervised salience learning for person re-identification[C]//Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2013:3586-3593.
[4] CHENG M,ZHANG F,MITRA N J,et al. RepFinder:finding approximately repeated scene elements for image editing[J]. ACM Transactions on Graphics,2010,29(4):No. 83.
[5] ITTI L,KOCH C,NIEBUR E. A model of saliency-based visual attention for rapid scene analysis[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence,1998,20(11):1254-1259.
[6] BORJI A,ITTI L. Exploiting local and global patch rarities for saliency detection[C]//Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE, 2012:478-485.
[7] WANG L,LU H,RUAN X,et al. Deep networks for saliency detection via local estimation and global search[C]//Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2015:3183-3192.
[8] LI G,YU Y. Visual saliency based on multiscale deep features[C]//Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2015:5455-5463.
[9] LEE G,TAI Y W,KIM J. Deep saliency with encoded low level distance map and high level features[C]//Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2016:660-668.
[10] LIU N,HAN J. DHSNet:deep hierarchical saliency network for salient object detection[C]//Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2016:678-686.
[11] WANG L,WANG L,LU H,et al. Saliency detection with recurrent fully convolutional networks[C]//Proceedings of the 2016 European Conference on Computer Vision,LNCS 9908. Cham:Springer,2016:825-841.
[12] LUO Z,MISHRA A,ACHKAR A,et al. Non-local deep features for salient object detection[C]//Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2017:6593-6601.
[13] ZHANG L,DAI J,LU H,et al. A bi-directional message passing model for salient object detection[C]//Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2018:1741-1750.
[14] WANG T,BORJI A,ZHANG L,et al. A stagewise refinement model for detecting salient objects in images[C]//Proceedings of the 2017 IEEE International Conference on Computer Vision. Piscataway:IEEE,2017:4039-4048.
[15] WANG T,ZHANG L,WANG S,et al. Detect globally,refine locally:a novel approach to saliency detection[C]//Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2018:3127-3135.
[16] WANG L,LU H,WANG Y,et al. Learning to detect salient objects with image-level supervision[C]//Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2017:3796-3805.
[17] LI Y,HOU X,KOCH C,et al. The secrets of salient object segmentation[C]//Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE, 2014:280-287.
[18] YAN Q,XU L,SHI J,et al. Hierarchical saliency detection[C]//Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2013:1155-1162.
[19] MOVAHEDI V,ELDER J H. Design and perceptual validation of performance measures for salient object segmentation[C]//Proceedings of the 2010 IEEE Conference on Computer Vision and Pattern Recognition Workshops. Piscataway:IEEE,2010:49-56.
[20] LI G,YU Y. Visual saliency based on multiscale deep features[C]//Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2015:5455-5463.
[21] YANG C,ZHANG L,LU H,et al. Saliency detection via graphbased manifold ranking[C]//Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2013:3166-3173.
[22] CHEN L C,PAPANDREOU G,KOKKINOS I,et al. DeepLab:semantic image segmentation with deep convolutional nets,atrous convolution,and fully connected CRFs[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2018, 40(4):834-848.
[23] YAN Q,XU L,SHI J,et al. Hierarchical saliency detection[C]//Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2013:1155-1162.
[24] ZHU W,LIANG S,WEI Y,et al. Saliency optimization from robustbackground detection[C]//Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2014:2814-2821.
[25] ZHAO T,WU X. Pyramid feature attention network for saliency detection[C]//Proceedings of the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE, 2019:3080-3089.
[26] HOU Q,CHENG M,HU X,et al. Deeply supervised salient object detection with short connections[C]//Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2017:5300-5309.

基于多级全局信息传递模型的视觉显著性检测

Visual saliency detection based on multi-level global information propagation model

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

[1]	姚光磊, 熊菊霞, 杨国武. 基于神经网络优化的花朵授粉算法[J]. 《计算机应用》唯一官方网站, 2024, 44(9): 2829-2837.
[2]	黄颖, 杨佳宇, 金家昊, 万邦睿. 用于RGBT跟踪的孪生混合信息融合算法[J]. 《计算机应用》唯一官方网站, 2024, 44(9): 2878-2885.
[3]	王娜, 蒋林, 李远成, 朱筠. 基于图形重写和融合探索的张量虚拟机算符融合优化[J]. 《计算机应用》唯一官方网站, 2024, 44(9): 2802-2809.
[4]	李云, 王富铕, 井佩光, 王粟, 肖澳. 基于不确定度感知的帧关联短视频事件检测方法[J]. 《计算机应用》唯一官方网站, 2024, 44(9): 2903-2910.
[5]	唐廷杰, 黄佳进, 秦进. 基于图辅助学习的会话推荐[J]. 《计算机应用》唯一官方网站, 2024, 44(9): 2711-2718.
[6]	张睿, 张鹏云, 高美蓉. 自优化双模态多通路非深度前庭神经鞘瘤识别模型[J]. 《计算机应用》唯一官方网站, 2024, 44(9): 2975-2982.
[7]	杨兴耀, 陈羽, 于炯, 张祖莲, 陈嘉颖, 王东晓. 结合自我特征和对比学习的推荐模型[J]. 《计算机应用》唯一官方网站, 2024, 44(9): 2704-2710.
[8]	秦璟, 秦志光, 李发礼, 彭悦恒. 基于概率稀疏自注意力神经网络的重性抑郁疾患诊断[J]. 《计算机应用》唯一官方网站, 2024, 44(9): 2970-2974.
[9]	方介泼, 陶重犇. 应对零日攻击的混合车联网入侵检测系统[J]. 《计算机应用》唯一官方网站, 2024, 44(9): 2763-2769.
[10]	杨航, 李汪根, 张根生, 王志格, 开新. 基于图神经网络的多层信息交互融合算法用于会话推荐[J]. 《计算机应用》唯一官方网站, 2024, 44(9): 2719-2725.
[11]	杜郁, 朱焱. 构建预训练动态图神经网络预测学术合作行为消失[J]. 《计算机应用》唯一官方网站, 2024, 44(9): 2726-2731.
[12]	赵宇博, 张丽萍, 闫盛, 侯敏, 高茂. 基于改进分段卷积神经网络和知识蒸馏的学科知识实体间关系抽取[J]. 《计算机应用》唯一官方网站, 2024, 44(8): 2421-2429.
[13]	张春雪, 仇丽青, 孙承爱, 荆彩霞. 基于两阶段动态兴趣识别的购买行为预测模型[J]. 《计算机应用》唯一官方网站, 2024, 44(8): 2365-2371.
[14]	陈彤, 杨丰玉, 熊宇, 严荭, 邱福星. 基于多尺度频率通道注意力融合的声纹库构建方法[J]. 《计算机应用》唯一官方网站, 2024, 44(8): 2407-2413.
[15]	石锐, 李勇, 朱延晗. 基于特征梯度均值化的调制信号对抗样本攻击算法[J]. 《计算机应用》唯一官方网站, 2024, 44(8): 2521-2527.