基于生成对抗网络的文本图像联合超分辨率与去模糊方法

doi:10.11772/j.issn.1001-9081.2019071205

计算机应用 ›› 2020, Vol. 40 ›› Issue (3): 859-864.DOI: 10.11772/j.issn.1001-9081.2019071205

• 虚拟现实与多媒体计算 • 上一篇下一篇

基于生成对抗网络的文本图像联合超分辨率与去模糊方法

陈赛健, 朱远平

天津师范大学计算机与信息工程学院, 天津 300387

收稿日期:2019-07-15 修回日期:2019-09-03 出版日期:2020-03-10 发布日期:2020-03-23
通讯作者: 朱远平
作者简介:陈赛健(1994-),男,江苏启东人,硕士研究生,主要研究方向:图像处理、模式识别;朱远平(1978-),男,江西临川人,教授,博士,主要研究方向:图像处理、模式识别。
基金资助:
国家自然科学基金资助项目（61602345， 61703306）；天津自然科学基金资助项目（18JCYBJC85000， 16JCQNJC00600）。

Joint super-resolution and deblurring method based on generative adversarial network for text images

CHEN Saijian, ZHU Yuanping

College of Computer and Information Engineering, Tianjin Normal University, Tianjin 300387, China

Received:2019-07-15 Revised:2019-09-03 Online:2020-03-10 Published:2020-03-23
Supported by:
This work is partially supported by the National Natural Science Foundation of China (61602345, 61703306), the Natural Science Foundation of Tianjin (18JCYBJC85000, 16JCQNJC00600).

摘要/Abstract

摘要： 针对现有的超分辨率方法难以从模糊的低分辨率图像中重建出清晰的高分辨率图像的问题，提出了一种基于生成式对抗网络（GAN）的文本图像联合超分辨率与去模糊方法。首先，本方法聚焦于严重模糊的低分辨率文本图像，由上采样模块和去模糊模块两部分组成生成器网络；然后，通过上采样模块对输入图像上采样，生成模糊的超分辨率图像；进一步利用去模糊模块重建出清晰的超分辨率图像；最后，为了更好地恢复文本图像，引入了一个联合训练损失，包含超分辨率像素损失与去模糊像素损失、语义层的特征匹配损失以及对抗损失。在合成图像和真实图像上的大量实验结果表明，与现有的先进算法——单类GAN （SCGAN）相比，峰值信噪比（PSNR）、结构相似度（SSIM）和光学字符识别（OCR）精度分别提高了1.52 dB、0.011 5和13.2个百分点。所提方法能更好地处理真实场景下的退化文本图像，同时计算成本较低。

关键词: 超分辨率, 去模糊, 生成对抗网络, 残差学习, 文本图像

Abstract: Aiming at the difficulty to reconstruct clear high-resolution images from blurred low-resolution images by the existing super-resolution methods, a joint text image joint super-resolution and deblurring method based on Generative Adversarial Network (GAN) was proposed. Firstly, the low-resolution text images with severe blur were focused, and the down-sampling module and the deblurring module were used to generate the generator network. Secondly, the input images were down-sampled by the down-sampling module to generate blurred super-resolution images. Thirdly, the deblurring module was used to reconstruct the clear super-resolution images. Finally, in order to recover the text images better, a joint training loss including super-resolution pixel loss, deblurring pixel loss, semantic layer feature matching loss and adversarial loss was introduced. Extensive experiments on synthetic and real-world images demonstrate that compared with the existing advanced method SCGAN (Single-Class GAN), the proposed method has the Peak Signal-to-Noise Ratio (PSNR), Structural Similarity (SSIM) and OCR (Optical Character Recognition) accuracy improved by 1.52 dB, 0.011 5 and 13.2 percentage points respectively. The proposed method can better deal with degraded text images in real scenes with low computational cost.

Key words: super-resolution, deblurring, Generative Adversarial Network （GAN）, residual learning, text image

中图分类号:

TP391.41

陈赛健, 朱远平. 基于生成对抗网络的文本图像联合超分辨率与去模糊方法[J]. 计算机应用, 2020, 40(3): 859-864.

CHEN Saijian, ZHU Yuanping. Joint super-resolution and deblurring method based on generative adversarial network for text images[J]. Journal of Computer Applications, 2020, 40(3): 859-864.

参考文献

[1] LI J,LIANG X,WEI Y,et al. Perceptual generative adversarial networks for small object detection[C]//Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2017:1951-1959.
[2] ZHANG H,YANG J,ZHANG Y,et al. Close the loop:joint blind image restoration and recognition with sparse representation prior[C]//Proceedings of the 2011 International Conference on Computer Vision. Piscataway:IEEE,2011:770-777.
[3] WANG X,YU K,WU S,et al. ESRGAN:enhanced super-resolution generative adversarial networks[C]//Proceedings of the 2018 European Conference on Computer Vision,LNCS 11133. Cham:Springer,2018:63-79.
[4] KUPYN O,BUDZAN V,MYKHAILYCH M,et al. DeblurGAN:blind motion deblurring using conditional adversarial networks[C]//Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2018:8183-8192.
[5] 杨玲, 刘怡光, 黄蓉刚, 等. 新的基于稀疏表示单张彩色超分辨率算法[J]. 计算机应用,2013,33(2):472-475.(YANG L,LIU Y G,HUANG R G,et al. New approach for super-resolution from a single color image based on sparse coding[J]. Journal of Computer Applications,2013,33(2):472-475.)
[6] HUANG J B,SINGH A,AHUJA N. Single image super-resolution from transformed self-exemplars[C]//Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2015:5197-5206.
[7] DONG C,LOY C C,HE K,et al. Learning a deep convolutional network for image super-resolution[C]//Proceedings of the 2014 European Conference on Computer Vision, LNCS 8692. Cham:Springer,2014:184-199.
[8] KIM J,LEE J K,LEE K M. Accurate image super-resolution using very deep convolutional networks[C]//Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2016:1646-1654.
[9] LIM B,SON S,KIM H,et al. Enhanced deep residual networks for single image super-resolution[C]//Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops. Piscataway:IEEE,2017:136-144.
[10] LEDIG C,THEIS L,HUSZÁR F,et al. Photo-realistic single image super-resolution using a generative adversarial network[C]//Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2017:4681-4690.
[11] HARMELING S,HIRSCH M,SCHÖLKOPF B. Space-variant single-image blind deconvolution for removing camera shake[C]//Proceedings of the 23rd International Conference on Neural Information Processing Systems. New York:Curran Associates Inc., 2010:829-837.
[12] 陈华华, 鲍宗袍. 强边缘导向的盲去模糊算法[J]. 中国图象图形学报,2017,22(8):1034-1044. (CHEN H H,BAO Z P. Strong edge-oriented blind deblurring algorithm[J]. Journal of Image and Graphics,2017,22(8):1034-1044.)
[13] TAO X,GAO H,SHEN X,et al. Scale-recurrent network for deep image deblurring[C]//Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2018:8174-8182.
[14] PARK H,LEE K M. Joint estimation of camera pose,depth,deblurring,and super-resolution from a blurred image sequence[C]//Proceedings of the 2017 IEEE International Conference on Computer Vision. Piscataway:IEEE,2017:4623-4631.
[15] YAMAGUCHI T,FUKUDA H,FURUKAWA R,et al. Video deblurring and super-resolution technique for multiple moving objects[C]//Proceedings of the 10th Asian Conference on Computer Vision,LNCS 6495. Berlin:Springer,2010:127-140.
[16] XU X,SUN D,PAN J,et al. Learning to super-resolve blurry face and text images[C]//Proceedings of the 2017 IEEE International Conference on Computer Vision. Piscataway:IEEE,2017:251-260.
[17] ZHANG X,WANG F,DONG H,et al. A deep encoder-decoder networks for joint deblurring and super-resolution[C]//Proceedings of the 2018 IEEE International Conference on Acoustics, Speech and Signal Processing. Piscataway:IEEE,2018:1448-1452.
[18] PAN J,LIU Y,DONG J,et al. Physics-based generative adversarial models for image restoration and beyond[EB/OL].[2018-08-02]. https://arxiv.org/pdf/1808.00605.pdf.
[19] ZHANG X,DONG H,HU Z,et al. Gated fusion network for joint image deblurring and super-resolution[EB/OL].[2018-07-27]. https://arxiv.org/pdf/1807.10806.pdf.
[20] ISOLA P,ZHU J,ZHOU T,et al. Image-to-image translation with conditional adversarial networks[C]//Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2017:5967-5976.
[21] LIN M,CHEN Q,YAN S. Network in network[EB/OL].[2018-12-16]. https://arxiv.org/pdf/1312.4400.pdf.
[22] NAH S,KIM T H,LEE K M. Deep multi-scale convolutional neural network for dynamic scene deblurring[C]//Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2017:3883-3891.
[23] JOHNSON J,ALAHI A,LI F. Perceptual losses for real-time style transfer and super-resolution[C]//Proceedings of the 2016 European Conference on Computer Vision,LNCS 9906. Cham:Springer,2016:694-711.
[24] HRADIŠ M,KOTERA J,ZEMČÍK P,et al. Convolutional neural networks for direct text deblurring[C]//Proceedings of the 2015 British Machine Vision Conference. Durham:BMVA, 2015:No. 6.
[25] KINGMA D P,BA J L. Adam:a method for stochastic optimization[EB/OL].[2018-12-22]. https://arxiv.org/pdf/1412.6980.pdf.
[26] HE K,ZHANG X,REN S,et al. Delving deep into rectifiers:Surpassing human-level performance on ImageNet classification[C]//Proceedings of the 2015 IEEE International Conference on Computer Vision. Piscataway:IEEE,2015:1026-1034.
[27] SAJJADI M S M,SCHÖLKOPF B,HIRSCH M. EnhanceNet:single image super-resolution through automated texture synthesis[C]//Proceedings of the 2017 IEEE International Conference on Computer Vision. Piscataway:IEEE,2017:4491-4510.

基于生成对抗网络的文本图像联合超分辨率与去模糊方法

Joint super-resolution and deblurring method based on generative adversarial network for text images

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

[1]	卞凌志, 王直杰. 基于增强多维多粒度级联森林的信用评分模型[J]. 计算机应用, 2021, 41(9): 2539-2544.
[2]	管其杰, 张挺, 李德亚, 周绍景, 杜奕. 基于多分辨率生成对抗网络的空间数据不确定性重建方法[J]. 计算机应用, 2021, 41(8): 2306-2311.
[3]	孙潇, 徐金东. 基于级联生成对抗网络的遥感图像去雾方法[J]. 计算机应用, 2021, 41(8): 2440-2444.
[4]	汤桂花, 孙磊, 毛秀青, 戴乐育, 胡永进. 基于深度对齐网络的生成对抗网络伪造人脸检测[J]. 计算机应用, 2021, 41(7): 1922-1927.
[5]	牛康力, 谌雨章, 沈君凤, 曾张帆, 潘永才, 王绎冲. 基于深度学习的双通道夜视图像复原方法[J]. 计算机应用, 2021, 41(6): 1775-1784.
[6]	王先武, 张挺, 吉欣, 杜奕. 基于带梯度惩罚深度卷积生成对抗网络的页岩三维数字岩心重构方法[J]. 计算机应用, 2021, 41(6): 1805-1811.
[7]	李衍志, 范勇, 高琳. 基于形态流的石油钻井水流异常检测[J]. 计算机应用, 2021, 41(6): 1842-1848.
[8]	井贝贝, 郭嘉, 王丽清, 陈静, 丁洪伟. 结合降噪卷积神经网络和条件生成对抗网络的图像双重盲降噪算法[J]. 计算机应用, 2021, 41(6): 1767-1774.
[9]	黄梨, 卢龙. 基于长距离依赖编码与深度残差U-Net的缺血性卒中病灶分割[J]. 计算机应用, 2021, 41(6): 1820-1827.
[10]	孙鹤立, 孙玉柱, 张晓云. 基于生成对抗网络的事件描述生成[J]. 计算机应用, 2021, 41(5): 1256-1261.
[11]	梁敏, 王昊榕, 张瑶, 李杰. 基于加速残差网络的图像超分辨率重建方法[J]. 计算机应用, 2021, 41(5): 1438-1444.
[12]	郭茂祖, 杨倩楠, 赵玲玲. 基于条件Wassertein生成对抗网络的图像生成[J]. 计算机应用, 2021, 41(5): 1432-1437.
[13]	卞鹏程, 郑忠龙, 李明禄, 何依然, 王天翔, 张大伟, 陈丽媛. 基于注意力融合网络的视频超分辨率重建[J]. 计算机应用, 2021, 41(4): 1012-1019.
[14]	欧莉莉, 邵峰晶, 孙仁诚, 隋毅. 基于半监督方法的脑梗死图像识别[J]. 计算机应用, 2021, 41(4): 1221-1226.
[15]	段友祥, 张含笑, 孙歧峰, 孙友凯. 基于拉普拉斯金字塔生成对抗网络的图像超分辨率重建算法[J]. 计算机应用, 2021, 41(4): 1020-1026.