盲去模糊的多尺度编解码深度卷积网络

doi:10.11772/j.issn.1001-9081.2019020373

计算机应用 ›› 2019, Vol. 39 ›› Issue (9): 2552-2557.DOI: 10.11772/j.issn.1001-9081.2019020373

盲去模糊的多尺度编解码深度卷积网络

贾瑞明, 邱桢芝, 崔家礼, 王一丁

北方工业大学信息学院, 北京 100144

收稿日期:2019-03-07 修回日期:2019-04-19 发布日期:2019-05-14 出版日期:2019-09-10
通讯作者: 贾瑞明
作者简介:贾瑞明(1978-),男,山东青岛人,助理研究员,博士,主要研究方向:计算机视觉、深度学习、模式识别;邱桢芝(1994-),女,山西长治人,硕士研究生,主要研究方向:计算机视觉、深度学习;崔家礼(1975-),男,山东枣庄人,助理研究员,博士,主要研究方向:图像处理、模式识别;王一丁(1967-),男,辽宁沈阳人,教授,博士,主要研究方向:图像处理、图像分析与识别。
基金资助:
国家自然科学基金面上项目（61673021）。

Deep multi-scale encoder-decoder convolutional network for blind deblurring

JIA Ruiming, QIU Zhenzhi, CUI Jiali, WANG Yiding

School of Information Science and Technology, North China University of Technology, Beijing 100144, China

Received:2019-03-07 Revised:2019-04-19 Online:2019-05-14 Published:2019-09-10
Supported by:
This work is partially supported by the National Natural Science Foundation of China (61673021).

摘要/Abstract

摘要：

针对拍摄场景中物体运动不一致所带来的非均匀模糊，为提高复杂运动场景中去模糊的效果，提出一种多尺度编解码深度卷积网络。该网络采用"从粗到细"的多尺度级联结构，在模糊核未知条件下，实现盲去模糊；其中，在该网络的编解码模块中，提出一种快速多尺度残差块，使用两个感受野不同的分支增强网络对多尺度特征的适应能力；此外，在编解码之间增加跳跃连接，丰富解码端信息。与2018年国际计算机视觉与模式识别会议（CVPR）上提出的多尺度循环网络相比，峰值信噪比（PSNR）高出0.06 dB；与2017年CVPR上提出的深度多尺度卷积网络相比，峰值信噪比和平均结构相似性（MSSIM）分别提高了1.4%和3.2%。实验结果表明，该网络能快速去除图像模糊，恢复出图像原有的边缘结构和纹理细节。

关键词: 盲去模糊, 多尺度结构, 跳跃连接, 编解码, 卷积神经网络

Abstract:

Aiming at the heterogeneous blur of images caused by inconsistent motion of objects in the shooting scene, a deep multi-scale encoder-decoder convolutional network was proposed to improve the deblurring effect in complex motion scenes. A multi-scale cascade structure named "from coarse to fine" was applied to this network, and blind deblurring was achieved with the blur kernel unknown. In the encoder-decoder module of the network, a fast multi-scale residual block was proposed, which used two branches with different receptive fields to enhance the adaptability of the network to multi-scale features. In addition, skip connections were added between the encoder and the decoder to enrich the information of the decoder. The Peak Signal-to-Noise Ratio (PSNR) value pf this network is 0.06 dB higher than that of the Scale-recurrent Network proposed on CVPR(Conference on Computer Vision and Pattern Recognition)2018; the PSNR and Mean Structural Similarity (MSSIM) values are increased by 1.4% and 3.2% respectively compared to those of the deep multi-scale convolution network proposed on CVPR2017. The experimental results show that the proposed network can deblur the image quickly and restore the edge structure and texture details of the image.

Key words: blind deblurring, multi-scale structure, skip connection, encoder-decoder, Convolutional Neural Network (CNN)

中图分类号:

TP391

贾瑞明, 邱桢芝, 崔家礼, 王一丁. 盲去模糊的多尺度编解码深度卷积网络[J]. 计算机应用, 2019, 39(9): 2552-2557.

JIA Ruiming, QIU Zhenzhi, CUI Jiali, WANG Yiding. Deep multi-scale encoder-decoder convolutional network for blind deblurring[J]. Journal of Computer Applications, 2019, 39(9): 2552-2557.

参考文献

[1] NAH S, KIM T H, LEE K M. Deep multi-scale convolutional neural network for dynamic scene deblurring[C]//CVPR 2017:Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway, NJ:IEEE, 2017, 1:257-265.
[2] PAN J, HU Z, SU Z, et al. Deblurring text images via l0-regularized intensity and gradient prior[C]//CVPR 2014:Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway, NJ:IEEE, 2014:2901-2908.
[3] LI L, PAN J, LAI W-S, et al. Learning a discriminative prior for blind image deblurring[C]//CVPR 2018:Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway, NJ:IEEE, 2018:6616-6625.
[4] ZHANG X, DONG H, HU Z, et al. Gated fusion network for joint image deblurring and super-resolution[EB/OL].[2019-01-05]. https://arxiv.org/pdf/1807.10806.pdf.
[5] SUN J, CAO W, XU Z, et al. Learning a convolutional neural network for non-uniform motion blur removal[C]//CVPR 2015:Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway, NJ:IEEE, 2015:769-777.
[6] KUPYN O, BUDZAN V, MYKHAILYCH M, et al. DeblurGAN:blind motion deblurring using conditional adversarial networks[EB/OL].[2019-01-05]. https://arxiv.org/pdf/1711.07064.pdf.
[7] TAO X, GAO H, WANG Y, et al. Scale-recurrent network for deep image deblurring[EB/OL].[2019-01-05]. https://arxiv.org/pdf/1802.01770.pdf.
[8] KRIZHEVSKY A, SUTSKEVER I, HINTON G, et al. ImageNet classification with deep convolution neural network[C]//NIPS'12:Proceedings of the 25th International Conference on Neural Information Processing Systems. North Miami Beach, FL, USA:Curran Associates, 2012, 1:1097-1105.
[9] LOFFE S, SZEGEDY C. Batch normalization:accelerating deep network training by reducing internal covariate shift[C]//ICML'15:Proceedings of the 32nd International Conference on Machine Learning.[S.l.]:JMLR.org, 2015:448-456.
[10] LI J, FANG F, MEI K, et al. Multi-scale residual network for image super-resolution[C]//Proceedings of the 2018 European Conference on Computer Vision, LNCS 11212. Berlin:Springer, 2018:527-542.
[11] KÖHLER R, HIRSCH M, MOHLER B, et al. Recording and playback of camera shake:benchmarking blind deconvolution with a real-world database[EB/OL].[2019-01-05]. http://citeseerx.ist.psu.edu/viewdoc/download;jsessionid=4F605BF966AB6B236B6591E377AC8243?doi=10.1.1.379.1398&rep=rep1&type=pdf.
[12] LAI W, HUANG J, AHUJA N, et al. Deep Laplacian pyramid networks for fast and accurate super-resolution[C]//CVPR 2017:Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition. Washington, DC:IEEE Computer Society, 2017, 1:5835-5843.
[13] MAO X-J, SHEN C, YANG Y-B, et al. Image restoration using convolutional auto-encoders with symmetric skip connections[EB/OL].[2019-01-07]. https://arxiv.org/pdf/1606.08921.pdf.
[14] SU S, DELBRACIO M, WANG J, et al. Deep video deblurring for hand-held cameras[C]//CVPR 2017:Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition. Washington, DC:IEEE Computer Society, 2017, 1:237-246.
[15] RONNEBERGER O, FISCHER P, BROX T. U-net:convolutional networks for biomedical image segmentation[C]//Proceedings of the 2015 International Conference on Medical Image Computing and Computer-Assisted Intervention, LNCS 9351. Berlin:Springer, 2015:234-241.
[16] KIM T H, LEE K M. Segmentation-free dynamic scene deblurring[C]//CVPR 2014:Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition. Washington, DC:IEEE Computer Society, 2014, 1:2766-2773.
[17] GONG D, YANG J, LIU L, et al. From motion blur to motion flow:a deep learning solution for removing heterogeneous motion blur[C]//CVPR 2017:Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition. Washington, DC:IEEE Computer Society, 2017, 1:3806-3815.
[18] ZHANG J, PAN J, REN J, et al. Dynamic scene deblurring using spatially variant recurrent neural networks[C]//CVPR 2018:Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway, NJ:IEEE, 2018:2521-2529.
[19] 于春和,祁奇.离焦模糊图像复原技术综述[J].沈阳航空航天大学学报,2018,35(5):57-63.(YU C H, QI Q. A survey of defocusing image restoration techniques[J]. Journal of Shenyang Aerospace University, 2018, 35(5):57-63.)

盲去模糊的多尺度编解码深度卷积网络

Deep multi-scale encoder-decoder convolutional network for blind deblurring

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

[1]	宋中山, 梁家锐, 郑禄, 刘振宇, 帖军. 基于双向门控尺度特征融合的遥感场景分类[J]. 计算机应用, 2021, 41(9): 2726-2735.
[2]	李康康, 张静. 基于注意力机制的多层次编码和解码的图像描述模型[J]. 计算机应用, 2021, 41(9): 2504-2509.
[3]	张永斌, 常文欣, 孙连山, 张航. 基于字典的域名生成算法生成域名的检测方法[J]. 计算机应用, 2021, 41(9): 2609-2614.
[4]	赵宏, 孔东一. 图像特征注意力与自适应注意力融合的图像内容中文描述[J]. 计算机应用, 2021, 41(9): 2496-2503.
[5]	徐江浪, 李林燕, 万新军, 胡伏原. 结合目标检测的室内场景识别方法[J]. 计算机应用, 2021, 41(9): 2720-2725.
[6]	牟长宁, 王海鹏, 周丕宇, 侯鑫行. 基于图卷积神经网络的串联质谱从头测序[J]. 计算机应用, 2021, 41(9): 2773-2779.
[7]	王贺兵, 张春梅. 基于非对称卷积-压缩激发-次代残差网络的人脸关键点检测[J]. 计算机应用, 2021, 41(9): 2741-2747.
[8]	曹玉红, 徐海, 刘荪傲, 王紫霄, 李宏亮. 基于深度学习的医学影像分割研究综述[J]. 计算机应用, 2021, 41(8): 2273-2287.
[9]	秦斌斌, 彭良康, 卢向明, 钱江波. 司机分心驾驶检测研究进展[J]. 计算机应用, 2021, 41(8): 2330-2337.
[10]	黄程程, 董霄霄, 李钊. 基于二维Winograd算法的深流水线5×5卷积方法[J]. 计算机应用, 2021, 41(8): 2258-2264.
[11]	曾祥银, 郑伯川, 刘丹. 基于深度卷积神经网络和聚类的左右轨道线检测[J]. 计算机应用, 2021, 41(8): 2324-2329.
[12]	武光利, 李雷霆, 郭振洲, 王成祥. 基于改进的双向长短期记忆网络的视频摘要生成模型[J]. 计算机应用, 2021, 41(7): 1908-1914.
[13]	吴则举, 焦翠娟, 陈亮. 基于改进Faster R-CNN的轮胎缺陷检测方法[J]. 计算机应用, 2021, 41(7): 1939-1946.
[14]	杨粟, 欧阳智, 杜逆索. 基于相关度距离的无监督并行哈希图像检索[J]. 计算机应用, 2021, 41(7): 1902-1907.
[15]	高钦泉, 黄炳城, 刘文哲, 童同. 基于改进CenterNet的竹条表面缺陷检测方法[J]. 计算机应用, 2021, 41(7): 1933-1938.