结合感知边缘约束与多尺度融合网络的图像超分辨率重建方法

doi:10.11772/j.issn.1001-9081.2020020185

计算机应用 ›› 2020, Vol. 40 ›› Issue (10): 3041-3047.DOI: 10.11772/j.issn.1001-9081.2020020185

• 虚拟现实与多媒体计算 • 上一篇下一篇

结合感知边缘约束与多尺度融合网络的图像超分辨率重建方法

欧阳宁^1,2, 韦羽², 林乐平^1,2

1. 认知无线电与信息处理省部共建教育部重点实验室(桂林电子科技大学), 广西桂林 541004;
2. 桂林电子科技大学信息与通信学院, 广西桂林 541004

收稿日期:2020-02-24 修回日期:2020-04-02 出版日期:2020-10-10 发布日期:2020-04-24
通讯作者: 林乐平
作者简介:欧阳宁(1972-),男,湖南宁远人,教授,硕士,主要研究方向:数字图像处理、智能信息处理;韦羽(1995-),男,广西玉林人,硕士研究生,主要研究方向:模式识别、深度学习;林乐平(1980-),女,广西桂平人,副教授,博士,主要研究方向:机器学习、智能信息处理、图像信号处理。
基金资助:
国家自然科学基金资助项目（61661017，61967005，U1501252）；中国博士后科学基金面上项目（2016M602923XB）；广西自然科学基金资助项目（2017GXNSFBA198212）；广西科技基地和人才专项（AD19110060）；认知无线电与信息处理教育部重点实验室资助项目（CRKL190107，CRKL160104）；桂林电子科技大学研究生教育创新计划项目（2019YCXS022）。

Image super-resolution reconstruction method combining perceptual edge constraint and multi-scale fusion network

OUYANG Ning^1,2, WEI Yu², LIN Leping^1,2

1. Key Laboratory of Cognitive Radio and Information Processing, Ministry of Education;(Guilin University of Electronic Technology), Guilin Guangxi 541004, China;
2. School of Information and Communication, Guilin University of Electronic Technology, Guilin Guangxi 541004, China

Received:2020-02-24 Revised:2020-04-02 Online:2020-10-10 Published:2020-04-24
Supported by:
This work is partially supported by the National Natural Science Foundation of China (61661017, 61967005, U1501252), the Surface Program of China Postdoctoral Science Foundation (2016M602923XB), the Natural Science Foundation of Guangxi (2017GXNSFBA198212), the Science and Technology Base and Talent Project of Guangxi (AD19110060), the Project of the Key Laboratory of Cognitive Radio and Information Processing of Ministry of Education (CRKL190107, CRKL160104), the Graduate Education Innovation Program of Guilin University of Electronic Technology (2019YCXS022).

摘要/Abstract

摘要： 针对图像超分辨率重建模型需要大量参数去捕获低分辨率（LR）图像和高分辨率（HR）图像之间的统计关系，以及使用L₁或L₂损失优化的网络模型不能有效恢复图像高频细节等问题，提出一种结合感知边缘约束与多尺度融合网络的图像超分辨率重建方法。该方法基于由粗到细的思想，设计了一种两阶段的网络模型。第一阶段通过卷积神经网络（CNN）提取图像特征，并将图像特征上采样至HR大小，得到粗糙特征；第二阶段使用多尺度估计将低维统计模型逐步逼近高维统计模型，将第一阶段输出的粗糙特征作为输入来提取图像多尺度特征，并通过注意力融合模块逐步融合不同尺度特征，以精细化第一阶段提取的特征。同时，该方法引入一种更丰富的卷积特征用于边缘检测，并将其作为感知边缘约束来优化网络，以更好地恢复图像高频细节。在Set5、Set14和BSDS100等基准数据集上进行实验，结果表明与现有的基于CNN的超分辨率重建方法相比，该方法不但能够重建出更为清晰的边缘和纹理，而且在×3和×4放大因子下的峰值信噪比（PSNR）和结构相似度（SSIM）都取得了一定的提升。

关键词: 卷积神经网络, 多尺度, 注意力融合, 感知边缘约束, 超分辨率重建

Abstract: Aiming at the problems that the image super-resolution reconstruction model requires a large number of parameters to capture the statistical relationship between Low-Resolution (LR) images and High-Resolution (HR) images, and the use of network models optimized by L₁ or L₂ loss cannot effectively recover the high-frequency details of the images, an image super-resolution reconstruction method combining perceptual edge constraint and multi-scale fusion network was proposed. Based on the idea from coarse to fine, a two-stage network model was designed in this method. At the first stage, Convolutional Neural Network (CNN) was used to extract image features and upsample the image features to the HR size in order to obtain rough features. At second stage, multi-scale estimation was used to gradually approximate the low-dimensional statistical model to the high-dimensional statistical model. The rough features output at the first stage were used as the input to extract the multi-scale features of the image, and the features of different scales were gradually fused together through the attention fusion module in order to refine the features extracted at the first stage. At the same time, a class of richer convolutional features was introduced for edge detection and used as the perceptual edge constraint to optimize the network, so as to better recover the high-frequency details of the images. Experimental results on benchmark datasets such as Set5, Set14 and BSDS100 show that compared with the existing CNN-based super-resolution reconstruction methods, the proposed method not only reconstructs sharper edges and textures, but also achieves certain improvements in Peak Signal-to-Noise Ratio (PSNR) and Structural SIMilarity index (SSIM) when magnification factor is 3 and 4.

Key words: Convolutional Neural Network (CNN), multi-scale, attention fusion, perceptual edge constraint, super-resolution reconstruction

中图分类号:

TP391.41

欧阳宁, 韦羽, 林乐平. 结合感知边缘约束与多尺度融合网络的图像超分辨率重建方法[J]. 计算机应用, 2020, 40(10): 3041-3047.

OUYANG Ning, WEI Yu, LIN Leping. Image super-resolution reconstruction method combining perceptual edge constraint and multi-scale fusion network[J]. Journal of Computer Applications, 2020, 40(10): 3041-3047.

参考文献

[1] GLASNER D,BAGON S,IRANI M. Super-resolution from a single image[C]//Proceedings of the IEEE 12th International Conference on Computer Vision. Piscataway:IEEE,2009:349-356.
[2] LIN T Y,DOLLÁR P,GIRSHICK R,et al. Feature pyramid networks for object detection[C]//Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2017:936-944.
[3] SIMONYAN K,ZISSERMAN A. Very deep convolutional networks for large-scale image recognition[EB/OL].[2019-04-10]. https://arxiv.org/pdf/1409.1556.pdf.
[4] MILLETARI F, NAVAB N, AHMADI S A. V-Net:fully convolutional neural networks for volumetric medical image segmentation[C]//Proceedings of the 4th International Conference on 3D Vision. Piscataway:IEEE,2016:565-571.
[5] DONG C,LOY C C,HE K,et al. Image super-resolution using deep convolutional networks[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence,2016,38(2):295-307.
[6] KIM J,LEE J K,LEE K M. Accurate image super-resolution using very deep convolutional networks[C]//Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2016:1646-1654.
[7] REN H,El-KHAMY M,LEE J. Image super resolution based on fusing multiple convolution neural networks[C]//Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops. Piscataway:IEEE,2017:1050-1057.
[8] LAI W S,HUANG J B,AHUJA N,et al. Deep Laplacian pyramid networks for fast and accurate super-resolution[C]//Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2017:5835-5843.
[9] 欧阳宁, 梁婷, 林乐平. 基于自注意力网络的图像超分辨率重建[J]. 计算机应用,2019,39(8):2391-2395.(OUYANG NING, LIANG T,LIN L P. Self-attention network based image superresolution[J]. Journal of Computer Applications,2019,39(8):2391-2395.)
[10] LEDIG C,THEIS L,HUSZÁR F,et al. Photo-realistic single image super-resolution using a generative adversarial network[C]//Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE, 2017:105-114.
[11] LIU Y,CHENG M,HU X,et al. Richer convolutional features for edge detection[C]//Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE, 2017:5872-5881.
[12] DONG C,LOY C C,TANG X. Accelerating the super-resolution convolutional neural network[C]//Proceedings of the 2016 European Conference on Computer Vision,LNCS 9906. Cham:Springer,2016:391-407.
[13] LIU X,MA Y,SHI Z,et al. GridDehazeNet:Attention-based multi-scale network for image dehazing[C]//Proceedings of the 2019 IEEE International Conference on Computer Vision. Piscataway:IEEE, 2019:7314-7323.
[14] ZHANG Y,LI K,LI K,et al. Image super-resolution using very deep residual channel attention networks[C]//Proceedings of the 2018 European Conference on Computer Vision,LNCS 11211. Cham:Springer,2018:294-310.
[15] CANNY J. A computational approach to edge detection[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence,1986, PAMI-8(6):679-698.
[16] AGUSTSSON E,TIMOFTE R. NTIRE 2017 challenge on single image super-resolution:dataset and study[C]//Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops. Piscataway:IEEE,2017:1122-1131.
[17] KEYS R. Cubic convolution interpolation for digital image processing[J]. IEEE Transactions on Acoustics,Speech,and Signal Processing,1981,29(6):1153-1160.

结合感知边缘约束与多尺度融合网络的图像超分辨率重建方法

Image super-resolution reconstruction method combining perceptual edge constraint and multi-scale fusion network

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

[1]	王贺兵, 张春梅. 基于非对称卷积-压缩激发-次代残差网络的人脸关键点检测[J]. 计算机应用, 2021, 41(9): 2741-2747.
[2]	宋中山, 梁家锐, 郑禄, 刘振宇, 帖军. 基于双向门控尺度特征融合的遥感场景分类[J]. 计算机应用, 2021, 41(9): 2726-2735.
[3]	李康康, 张静. 基于注意力机制的多层次编码和解码的图像描述模型[J]. 计算机应用, 2021, 41(9): 2504-2509.
[4]	张永斌, 常文欣, 孙连山, 张航. 基于字典的域名生成算法生成域名的检测方法[J]. 计算机应用, 2021, 41(9): 2609-2614.
[5]	赵宏, 孔东一. 图像特征注意力与自适应注意力融合的图像内容中文描述[J]. 计算机应用, 2021, 41(9): 2496-2503.
[6]	徐江浪, 李林燕, 万新军, 胡伏原. 结合目标检测的室内场景识别方法[J]. 计算机应用, 2021, 41(9): 2720-2725.
[7]	牟长宁, 王海鹏, 周丕宇, 侯鑫行. 基于图卷积神经网络的串联质谱从头测序[J]. 计算机应用, 2021, 41(9): 2773-2779.
[8]	曹玉红, 徐海, 刘荪傲, 王紫霄, 李宏亮. 基于深度学习的医学影像分割研究综述[J]. 计算机应用, 2021, 41(8): 2273-2287.
[9]	秦斌斌, 彭良康, 卢向明, 钱江波. 司机分心驾驶检测研究进展[J]. 计算机应用, 2021, 41(8): 2330-2337.
[10]	黄程程, 董霄霄, 李钊. 基于二维Winograd算法的深流水线5×5卷积方法[J]. 计算机应用, 2021, 41(8): 2258-2264.
[11]	曾祥银, 郑伯川, 刘丹. 基于深度卷积神经网络和聚类的左右轨道线检测[J]. 计算机应用, 2021, 41(8): 2324-2329.
[12]	高钦泉, 黄炳城, 刘文哲, 童同. 基于改进CenterNet的竹条表面缺陷检测方法[J]. 计算机应用, 2021, 41(7): 1933-1938.
[13]	吴则举, 焦翠娟, 陈亮. 基于改进Faster R-CNN的轮胎缺陷检测方法[J]. 计算机应用, 2021, 41(7): 1939-1946.
[14]	杨粟, 欧阳智, 杜逆索. 基于相关度距离的无监督并行哈希图像检索[J]. 计算机应用, 2021, 41(7): 1902-1907.
[15]	武光利, 李雷霆, 郭振洲, 王成祥. 基于改进的双向长短期记忆网络的视频摘要生成模型[J]. 计算机应用, 2021, 41(7): 1908-1914.