基于RCF的跨层融合特征的边缘检测

doi:10.11772/j.issn.1001-9081.2019112057

计算机应用 ›› 2020, Vol. 40 ›› Issue (7): 2053-2058.DOI: 10.11772/j.issn.1001-9081.2019112057

• 虚拟现实与多媒体计算 • 上一篇下一篇

基于RCF的跨层融合特征的边缘检测

宋杰, 于裕, 骆起峰

安徽大学计算机科学与技术学院, 合肥 230039

收稿日期:2019-12-04 修回日期:2020-01-14 出版日期:2020-07-10 发布日期:2020-06-29
通讯作者: 于裕
作者简介:宋杰(1966-),男,安徽合肥人,副教授,博士,CCF会员,主要研究方向:智能计算、计算机体系结构;于裕(1995-),男,安徽阜阳人,硕士研究生,主要研究方向:计算机视觉、数据挖掘;骆起峰(1995-),男,安徽芜湖人,硕士研究生,主要研究方向:计算机视觉、智能计算。
基金资助:
国家自然科学基金资助项目（61974001）。

Cross-layer fusion feature based on richer convolutional features for edge detection

SONG Jie, YU Yu, LUO Qifeng

School of Computer Science and Technology, Anhui University, Hefei Anhui 230039, China

Received:2019-12-04 Revised:2020-01-14 Online:2020-07-10 Published:2020-06-29
Supported by:
This work is partially supported by the National Natural Science Foundation of China (61974001).

摘要/Abstract

摘要： 针对当前基于深度学习的边缘检测技术产生的边缘线条杂乱且模糊等问题，提出了一种基于RCF的端到端的跨层融合多尺度特征的边缘检测（CFF）模型。该模型使用RCF作为基线，在主干网络中加入CBAM，采用具有平移不变性的下采样技术，并且去除了主干网络中的部分下采样操作，以保留图像的细节信息，同时使用扩张卷积技术增大模型感受野。此外，采用跨层融合特征图的方式，使得高低层特征能够充分融合。为了平衡各阶段损失和融合损失之间的关系，以及避免出现多尺度特征融合之后低层细节过度丢失的现象，对每个损失添加了一个权重。在伯克利分割数据集（BSDS500）和PASCAL VOL Context数据集上进行了训练，在测试时使用图像金字塔技术提高边缘图像的质量。实验结果表明，CFF模型提取的轮廓比基线网络更加清晰，能够解决边缘模糊问题。在BSDS500基准上进行的评估表明，该模型将最佳数据集规模（ODS）和最佳图像比例（OIS）指标分别提高到0.818和0.839。

关键词: 深度学习, 边缘检测, 注意力机制, 平移不变性, 跨层融合

Abstract: Aiming at the problems such as chaotic and fuzzy edge lines caused by current deep learning based edge detection technology, an end-to-end Cross-layer Fusion Feature for edge detection (CFF) model based on RCF (Richer Convolutional Features) was proposed. In this model, RCF was used as a baseline, the CBAM (Convolutional Block Attention Module) was added to the backbone network, translation-invariant downsampling technology was adopted, and some downsampling operations in the backbone network were removed in order to preserve the image details information, dilated convolution technique was used to increase the model receptive field at the same time. In addition, the method of cross-layer fusion of feature maps was adopted to enable high-level and low-level features to be fully fused together. In order to balance the relationship between the loss in each stage and the fusion loss, and to avoid the phenomenon of excessive loss of low-level details after multi-scale feature fusion, the weight parameters were added to the losses. The model was trained on Berkeley Segmentation Data Set (BSDS500) and PASCAL VOL Context dataset, and the image pyramid technology was used in testing to improve the quality of edge images. Experimental results show that the contour extracted by CFF model is clearer than that extracted by the baseline network and can solve the edge blurring problem. The evaluation performed on the BSDS500 benchmark shows that, the Optimal Dataset Scale (ODS) and the Optimal Image Scale (OIS) are improved to 0.818 and 0.839 respectively by this model.

Key words: deep learning, edge detection, attention mechanism, translation invariance, cross-layer fusion

中图分类号:

TP391.41

宋杰, 于裕, 骆起峰. 基于RCF的跨层融合特征的边缘检测[J]. 计算机应用, 2020, 40(7): 2053-2058.

SONG Jie, YU Yu, LUO Qifeng. Cross-layer fusion feature based on richer convolutional features for edge detection[J]. Journal of Computer Applications, 2020, 40(7): 2053-2058.

参考文献

[1] CHENG M,HOU Q,ZHANG S,et al. Intelligent visual media processing:when graphics meets vision[J]. Journal of Computer Science and Technology,2017,32(1):110-121.
[2] HU S,CHEN T,XU K,et al. Internet visual media processing:a survey with graphics and vision applications[J]. The Visual Computer,2013,29(5):393-405.
[3] FERRARI V,FEVRIER L,JURIE F,et al. Groups of adjacent contour segments for object detection[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence,2008,30(1):36-51.
[4] ZITNICK C L,DOLLÁR P. Edge boxes:locating object proposals from edges[C]//Proceedings of the 13th European Conference on Computer Vision,LNCS 8693. Cham:Springer,2014:391-405.
[5] ARBELÁEZ P,PONT-TUSET J,BARRON J,et al. Multiscale combinatorial grouping[C]//Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2014:328-335.
[6] FELDMAN J A,FELDMAN G M,FALK G,et al. The Stanford hand-eye project[C]//Proceedings of the 1st International Joint Conference on Artificial Intelligence. San Francisco, CA:Morgan Kaufmann Publishers Inc.,1969:521-526,526a.
[7] CANNY J. A computational approach to edge detection[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence,1986, PAMI-8(6):679-698.
[8] MARTIN D R,FOWLKES C C,MALIK J. Learning to detect natural image boundaries using local brightness,color,and texture cues[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence,2004,26(5):530-549.
[9] DOLLÁR P,ZITNICK C L. Fast edge detection using structured forests[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence,2015,37(8):1558-1570.
[10] GANIN Y,LEMPITSKY V. N⁴-fields:neural network nearest neighbor fields for image transforms[C]//Proceedings of the 12th Asian Conference on Computer Vision, LNCS 9004. Cham:Springer,2014:536-551.
[11] XIE S,TU Z. Holistically-nested edge detection[C]//Proceedings of the 2015 IEEE International Conference on Computer Vision. Piscataway:IEEE,2015:1395-1403.
[12] LONG J,SHELHAMER E,DARRELL T. Fully convolutional networks for semantic segmentation[C]//Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2015:3431-3440.
[13] SIMONYAN K,ZISSERMAN A. Very deep convolutional networks for large-scale image recognition[EB/OL].[2019-03-02]. https://arxiv.org/pdf/1409.1556.pdf.
[14] LIU Y,CHENG M,HU X,et al. Richer convolutional features for edge detection[C]//Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE, 2017:5872-5881.
[15] WOO S,PARK J,LEE J Y,et al. CBAM:convolutional block attention module[C]//Proceedings of the 15th European Conference on Computer Vision,LNCS 11211. Cham:Springer,2018:3-19.
[16] ZHANG R. Making convolutional networks shift-invariant again[C]//Proceedings of the 36th International Conference on Machine Learning. New York:International Machine Learning Society, 2019:7324-7334.
[17] YU F,KOLTUN V. Multi-scale context aggregation by dilated convolutions[EB/OL].[2019-03-02]. https://arxiv.org/pdf/1511.07122.pdf.
[18] LIN T Y,DOLLÁR P,GIRSHICK R,et al. Feature pyramid networks for object detection[C]//Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2017:936-944.
[19] MARTIN D,FOWLKES C,TAL D,et al. A database of human segmented natural images and its application to evaluating segmentation algorithms and measuring ecological statistics[C]//Proceedings of the 8th IEEE International Conference on Computer Vision. Piscataway:IEEE,2001:416-423.
[20] MOTTAGHI R,CHEN X,LIU X,et al. The role of context for object detection and semantic segmentation in the wild[C]//Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2014:891-898.
[21] HU J,SHEN L,SUN G. Squeeze-and-excitation networks[C]//. Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2018:7132-7141.
[22] HE K,ZHANG X,REN S,et al. Deep residual learning for image recognition[C]//Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE, 2016:770-778.
[23] ARBELÁEZ P,MAIRE M,FOWLKES C,et al. Contour detection and hierarchical image segmentation[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence,2011,33(5):898-916.
[24] BERTASIUS G,SHI J,TORRESANI L. DeepEdge:a multi-scale bifurcated deep network for top-down contour detection[C]//Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2015:4380-4389.
[25] SHEN W,WANG X,WANG Y,et al. DeepContour:a deep convolutional feature learned by positive-sharing loss for contour detection[C]//Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2015:3982-3991.

基于RCF的跨层融合特征的边缘检测

Cross-layer fusion feature based on richer convolutional features for edge detection

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

[1]	谢德峰, 吉建民. 融入句法感知表示进行句法增强的语义解析[J]. 计算机应用, 2021, 41(9): 2489-2495.
[2]	代雨柔, 杨庆, 张凤荔, 周帆. 基于自监督学习的社交网络用户轨迹预测模型[J]. 计算机应用, 2021, 41(9): 2545-2551.
[3]	刘雅璇, 钟勇. 基于头实体注意力的实体关系联合抽取方法[J]. 计算机应用, 2021, 41(9): 2517-2522.
[4]	郑志强, 胡鑫, 翁智, 王雨禾, 程曦. 基于改进DenseNet的牛眼图像特征提取方法[J]. 计算机应用, 2021, 41(9): 2780-2784.
[5]	李康康, 张静. 基于注意力机制的多层次编码和解码的图像描述模型[J]. 计算机应用, 2021, 41(9): 2504-2509.
[6]	赵宏, 孔东一. 图像特征注意力与自适应注意力融合的图像内容中文描述[J]. 计算机应用, 2021, 41(9): 2496-2503.
[7]	徐江浪, 李林燕, 万新军, 胡伏原. 结合目标检测的室内场景识别方法[J]. 计算机应用, 2021, 41(9): 2720-2725.
[8]	陈成瑞, 孙宁, 何世彪, 廖勇. 面向C-V2X通信的基于深度学习的联合信道估计与均衡算法[J]. 计算机应用, 2021, 41(9): 2687-2693.
[9]	党伟超, 李涛, 白尚旺, 高改梅, 刘春霞. 基于自注意力长短期记忆网络的Web软件系统实时剩余寿命预测方法[J]. 计算机应用, 2021, 41(8): 2346-2351.
[10]	何正海, 线岩团, 王蒙, 余正涛. 融合句法指导与字符注意力机制的案情阅读理解方法[J]. 计算机应用, 2021, 41(8): 2427-2431.
[11]	曹玉红, 徐海, 刘荪傲, 王紫霄, 李宏亮. 基于深度学习的医学影像分割研究综述[J]. 计算机应用, 2021, 41(8): 2273-2287.
[12]	秦斌斌, 彭良康, 卢向明, 钱江波. 司机分心驾驶检测研究进展[J]. 计算机应用, 2021, 41(8): 2330-2337.
[13]	杜炎, 吕良福, 焦一辰. 基于模糊推理的模糊原型网络[J]. 计算机应用, 2021, 41(7): 1885-1890.
[14]	武维, 李泽平, 杨华蔚, 林川, 王忠德. 融合内容特征和时序信息的深度注意力视频流行度预测模型[J]. 计算机应用, 2021, 41(7): 1878-1884.
[15]	李扬志, 袁家政, 刘宏哲. 基于时空注意力图卷积网络模型的人体骨架动作识别算法[J]. 计算机应用, 2021, 41(7): 1915-1921.