Image inpainting model of dual-discriminator generative adversarial network based on gated convolution and SENet

doi:10.11772/j.issn.1001-9081.2022060949

Abstract

Abstract:

Aiming at the problem that the details are not realistic enough when images with random irregular masks and complex semantic content were repaired by existing models， an image inpainting model of dual-discriminator generative adversarial network based on gated convolution and SENet（Squeeze and Excitation Network） was proposed. Firstly， the damaged image and masks were input into the coarse network composed of several gated convolution stacks， Squeeze and Excitation （SE） attention was added during upsampling， and L1 reconstruction loss was applied to obtain a preliminary repair map. Secondly， the preliminary repair result was input into the refine network， which was composed of several gated convolution blocks and SE attention blocks， reconstruction loss， perceptual loss and adversarial loss were combined to improve important features and details， and the repair result of the refine network was covered by the intact area of ??the damaged image to obtain the completed repair result. Finally， the dual-discriminator network structure was used for training， so that the output of the refine network and the completed result were more realistic. Experimental results on celebA dataset show that the inpainting result of the proposed model for images with large-area irregular masks achieves 27.39 dB on Peak Signal-to-Noise Ratio （PSNR） which is 6.74% higher than partial convolution， and 0.921 6 on Structural Similarity Index Meaturement （SSIM） which is 2.95% higher than partial convolution. Experimental results show that SE attention and dual discriminator help to improve the details of image inpainting.

Key words: gated convolution, dual-discriminator, Generative Adversarial Network (GAN), image inpainting, Squeeze and Excitation (SE) attention

摘要：

针对现有模型修复带有随机不规则掩码且语义内容复杂的图片时细节不够真实这一问题，提出了一种基于门控卷积和SENet的双判别生成对抗网络图像修复模型。首先，将破损图片掩码输入由若干门控卷积堆叠成的粗网络中，在上采样时添加通道注意力（SE），结合L1重建损失，得到初步修复图；然后，将初步修复图输入精细网络，精细网络由若干门控卷积块和通道注意力块构成，结合重构损失、感知损失和对抗损失完善重要特征和细节，将破损图像的完好区域覆盖到精细网络的修复图上，得到完成修复的图片；最后，使用双判别网络结构进行训练，使精细网络的输出与完成修复的图片更加真实。在celebA数据集上进行实验，所提模型对带有大面积不规则掩码图片的修复结果在峰值信噪比（PSNR）上达到了27.39 dB，相较于部分卷积提升了6.74%，在结构相似性（SSIM）上达到了0.921 6，较部分卷积提升了2.95%。实验结果表明，引入通道注意力和双判别结构有助于提升图像修复的细节。

关键词: 门控卷积, 双判别器, 生成对抗网络, 图像修复, 通道注意力

CLC Number:

TP391.41

Jibin FU, Yuli CAO. Image inpainting model of dual-discriminator generative adversarial network based on gated convolution and SENet[J]. Journal of Computer Applications, 2023, 43(S1): 212-216.

傅继彬, 曹玉笠. 基于门控卷积和SENet的双判别生成对抗网络图像修复模型[J]. 《计算机应用》唯一官方网站, 2023, 43(S1): 212-216.

Figures/Tables 9

References 18

1	NEWSON A， ALMANSA A， GOUSSEAU Y， et al. Non-local patch-based image inpainting［J］. Image Processing On Line， 2017， 7：373-385. 10.5201/ipol.2017.189
2	ZHANG N， JI H， LIU L， et al. Exemplar-based image inpainting using angle-aware patch matching ［EB/OL］. ［2022-05-21］. . 10.1186/s13640-019-0471-2
3	GOODFELLOW I， POUGET-ABADIE J， MIRZA M， et al. Generative adversarial nets ［J］. Advances in Neural Information Processing Systems， 2014， 3： 2672-2680.
4	PATHAK D， KRAHENBUHL P， DONAHUE J， et al. Context encoders： feature learning by inpainting［C］// Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway：IEEE， 2016： 2536-2544. 10.1109/cvpr.2016.278
5	IIZUKA S， SIMO-SERRA E， ISHIKAWA H. Globally and locally consistent image completion ［J］. ACM Transactions on Graphics， 2017， 36（4）： 1-14. 10.1145/3072959.3073659
6	DEMIR U， UNAL G. Patch-based image inpainting with generative adversarial networks ［EB/OL］. ［2022-05-21］. . 10.1109/access.2020.2970169
7	ISOLA P， ZHU J Y， ZHOU T， et al. Image-to-image translation with conditional adversarial networks ［C］// Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway：IEEE， 2017： 1125-1134. 10.1109/cvpr.2017.632
8	YU J， LIN Z， YANG J， et al. Generative image inpainting with contextual attention ［C］// Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway：IEEE，2018： 5505-5514. 10.1109/cvpr.2018.00577
9	NAZERI K， NG E， JOSEPH T， et al. EdgeConnect： generative image inpainting with adversarial edge learning ［EB/OL］. ［2022-05-21］. . 10.1109/iccvw.2019.00408
10	LIU G， REDA F A， SHIH K J， et al. Image inpainting for irregular holes using partial convolutions ［C］// Proceedings of the 2018 European Conference on Computer Vision， LNCS 11215. Cham： Springer， 2018： 89-105.
11	YU J， LIN Z， YANG J， et al. Free-form image inpainting with gated convolution［C］// Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision. Piscataway：IEEE，2019： 4471-4480. 10.1109/iccv.2019.00457
12	HU J， SHEN L， SUN G. Squeeze-and-excitation networks［J］. IEEE Transactions on Pattern Analysis and Machine Intelligence， 2020， 42（8）： 2011-2023. 10.1109/tpami.2019.2913372
13	刘强，张道畅.结合SENet的密集卷积生成对抗网络图像修复方法［J］.小型微型计算机系统，2022，43（5）：1056-1060.
14	高杰，霍智勇.一种门控卷积生成对抗网络的图像修复算法［J］.西安电子科技大学学报，2022，49（1）：216-224.
15	李海燕，马艳，郭磊，等.基于双判别生成对抗网络的不规则孔洞图像修复［J］.西北工业大学学报，2021，39（2）：423-429. 10.3969/j.issn.1000-2758.2021.02.024
16	MIYATO T， KATAOKA T， KOYAMA M， et al. Spectral normalization for generative adversarial networks ［EB/OL］. ［2022-05-21］. . 10.1007/978-3-030-63416-2_860
17	MAO X， LI Q， XIE H， et al. Least squares generative adversarial networks［C］// Proceedings of the 2017 IEEE International Conference on Computer Vision. Piscataway：IEEE，2017： 2794-2802. 10.1109/iccv.2017.304
18	JOHNSON J， ALAHI A， FEI-FEI L. Perceptual losses for real-time style transfer and super-resolution ［C］// Proceedings of the 2016 European Conference on Computer Vision， LNCS 9906. Cham： Springer， 2016： 694-711.

模型	PSNR/dB	SSIM	L1损失值
GL	23.21	0.871 5	0.036 0
Pix2pix	25.10	0.878 8	0.022 3
Pconv	25.66	0.895 2	0.023 6
本文模型	27.39	0.921 6	0.014 9

模型	PSNR/dB	SSIM	L1损失值
GL	23.21	0.871 5	0.036 0
Pix2pix	25.10	0.878 8	0.022 3
Pconv	25.66	0.895 2	0.023 6
本文模型	27.39	0.921 6	0.014 9

模型	PSNR/dB	SSIM	L1损失值
no_se&dc	27.11	0.917 7	0.015 3
no_se	27.20	0.919 9	0.014 9
本文模型	27.39	0.921 6	0.014 9

模型	PSNR/dB	SSIM	L1损失值
no_se&dc	27.11	0.917 7	0.015 3
no_se	27.20	0.919 9	0.014 9
本文模型	27.39	0.921 6	0.014 9

[1]	Xin JIN, Yangchuan LIU, Yechen ZHU, Zijian ZHANG, Xin GAO. Sinogram inpainting for sparse-view cone-beam computed tomography image reconstruction based on residual encoder-decoder generative adversarial network [J]. Journal of Computer Applications, 2023, 43(6): 1950-1957.
[2]	Jinwen GUO, Xinghua MA, Gongning LUO, Wei WANG, Yang CAO, Kuanquan WANG. Guidewire artifact removal method of structure-enhanced IVOCT based on Transformer [J]. Journal of Computer Applications, 2023, 43(5): 1596-1605.
[3]	Jiagao WU, Shiwen ZHANG, Yudong JIANG, Linfeng LIU. Social-interaction GAN for pedestrian trajectory prediction based on state-refinement long short-term memory and attention mechanism [J]. Journal of Computer Applications, 2023, 43(5): 1565-1570.
[4]	Xiaoyu FAN, Suzhen LIN, Yanbo WANG, Feng LIU, Dawei LI. Reconstruction algorithm for highly undersampled magnetic resonance images based on residual graph convolutional neural network [J]. Journal of Computer Applications, 2023, 43(4): 1261-1268.
[5]	Hao WANG, Zicheng WANG, Chao ZHANG, Yunsheng MA. Generative adversarial network based data uncertainty quantification method [J]. Journal of Computer Applications, 2023, 43(4): 1094-1101.
[6]	Chunyong YIN, Liwen ZHOU. Unsupervised time series anomaly detection model based on re-encoding [J]. Journal of Computer Applications, 2023, 43(3): 804-811.
[7]	Lingling TAO, Bo LIU, Wenbo LI, Xiping HE. Controllable face editing algorithm with closed-form solution [J]. Journal of Computer Applications, 2023, 43(2): 601-607.
[8]	Wanli SHEN, Yujin ZHANG, Wan HU. U-shaped feature pyramid network for image inpainting forensics [J]. Journal of Computer Applications, 2023, 43(2): 545-551.
[9]	Ruoying WANG, Fan LYU, Liuqing ZHAO, Fuyuan HU. Floorplan generation algorithm integrating user requirements and boundary constraints [J]. Journal of Computer Applications, 2023, 43(2): 575-582.
[10]	Gang CHEN, Yongwei LIAO, Zhenguo YANG, Wenying LIU. Image inpainting algorithm of multi-scale generative adversarial network based on multi-feature fusion [J]. Journal of Computer Applications, 2023, 43(2): 536-544.
[11]	Li’an ZHU, Hong ZHANG. Nonhomogeneous image dehazing based on dual-branch conditional generative adversarial network [J]. Journal of Computer Applications, 2023, 43(2): 567-574.
[12]	Ziqi HU, Kai XIE, Chang WEN, Meiran LI, Jianbiao HE. Low dose CT image enhancement based on generative adversarial network [J]. Journal of Computer Applications, 2023, 43(1): 280-288.
[13]	Zanxia QIANG, Xianfu BAO. Residual attention deraining network based on convolutional long short-term memory [J]. Journal of Computer Applications, 2022, 42(9): 2858-2864.
[14]	Wentao MAO, Guifang WU, Chao WU, Zhi DOU. Animation video generation model based on Chinese impressionistic style transfer [J]. Journal of Computer Applications, 2022, 42(7): 2162-2169.
[15]	Zefang HAN, Xiong ZHANG, Hong SHANGGUAN, Xinglong HAN, Jing HAN, Gang FENG, Xueying CUI. Artifacts sensing generative adversarial network for low-dose CT denoising [J]. Journal of Computer Applications, 2022, 42(7): 2301-2310.