基于密集连接块U-Net的语义人脸图像修复

doi:10.11772/j.issn.1001-9081.2020040522

计算机应用 ›› 2020, Vol. 40 ›› Issue (12): 3651-3657.DOI: 10.11772/j.issn.1001-9081.2020040522

• 虚拟现实与多媒体计算 • 上一篇下一篇

基于密集连接块U-Net的语义人脸图像修复

杨文霞¹, 王萌², 张亮¹

1. 武汉理工大学理学院, 武汉 430070;
2. 广西科技大学启迪数字学院, 广西柳州 545006

收稿日期:2020-04-23 修回日期:2020-06-23 出版日期:2020-12-10 发布日期:2020-07-30
通讯作者: 王萌(1979-),男,湖北武汉人,副教授,硕士,主要研究方向:自然语言理解。mwang007@gxust.edu.cn
作者简介:杨文霞(1978-),女,湖北天门人,副教授,博士,主要研究方向:数字图像处理、模式识别;张亮(1977-),男,湖北武汉人,教授,博士,主要研究方向:偏微分方程的能控性与能稳性
基金资助:
国家自然科学基金资助项目（61573012）；国家留学基金资助项目（201906955038）；柳州科技计划项目（2018DH10503）。

Semantic face image inpainting based on U-Net with dense blocks

YANG Wenxia¹, WANG Meng², ZHANG Liang¹

1. School of Science, Wuhan University of Technology, Wuhan Hubei 430070, China;
2. Tus College of Digit, Guangxi University of Science and Technology, Liuzhou Guangxi 545006, China

Received:2020-04-23 Revised:2020-06-23 Online:2020-12-10 Published:2020-07-30
Supported by:
This work is partially supported by the National Natural Science Foundation of China （61573012）， the China Scholarship Council （201906955038）， the Science and Technology Program of Liuzhou （2018DH10503）.

摘要/Abstract

摘要： 针对人脸图像在待修复缺损面积较大时，现有方法的修复存在图像语义理解不合理、边界不连贯等视觉瑕疵的问题，提出基于密集连接块的U-Net结构的端到端图像修复模型，以实现对任意模板的语义人脸图像的修复。首先，采用生成对抗网络思想，生成器采用密集连接块代替U-Net中的普通卷积模块，以捕捉图像中缺损部分的语义信息并确保前面层的特征被再利用；然后，使用跳连接以减少通过下采样而造成的信息损失，从而提取图像缺损区域的语义；最后，通过引入对抗损失、内容损失和局部总变分（TV）损失这三者的联合损失函数来训练生成器，确保了修复边界和周围真实图像的视觉一致，并通过Hinge损失来训练判别器。所提模型和GLC、DF、门控卷积（GC）在人脸数据集CelebA-HQ上进行了对比。实验结果表明，所提模型能有效提取人脸图像语义信息，修复结果具有自然过渡的边界和清晰的局部细节。相较性能第二的GC，所提模型对中心模板修复的结构相似性（SSIM）和峰值信噪比（PSNR）分别提高了5.68%和7.87%，Frechet Inception距离（FID）降低了7.86%；对随机模板修复的SSIM和PSNR分别提高了7.06%和4.80%，FID降低了6.85%。

关键词: 语义图像修复, 生成对抗网络, 密集连接块, 损失函数, 局部总变分, 编码器-解码器

Abstract: When the areas to be inpainted in the face image are large, there are some visual defects caused by the inpainting of the existing methods, such as unreasonable image semantic understanding and incoherent boundary. To solve this problem, an end-to-end image inpainting model of U-Net structure based on dense blocks was proposed to achieve the inpainting of semantic face of any mask. Firstly, the idea of generative adversarial network was adopted. In the generator, the convolutional layers in U-Net were replaced with dense blocks to capture the semantic information of the missing regions of the image and to make sure the features of the previous layers were reused. Then, the skip connections were adopted to reduce the information loss caused by the down-sampling, so as to extract the semantics of the missing regions. Finally, by introducing the joint loss function combining adversarial loss, content loss and local Total Variation (TV) loss to train the generator, the visual consistency between the inpainted boundary and the surrounding real image was ensured, and Hinge loss was used to train the discriminator. The proposed model was compared with Globally and Locally Consistent image completion(GLC),Deep Fusion(DF) and Gated Convolution(GC) on CelebA-HQ face dataset. Experimental results show that the proposed model can effectively extract the semantic information of face images, and its inpainting results have the boundaries with natural transition and clear local details. Compared with the second-best GC, the proposed model has the Structure SIMilarity index (SSIM) and Peak Signal-to-Noise Ratio (PSNR) increased by 5.68% and 7.87% respectively, while the Frechet Inception Distance (FID) decreased by 7.86% for the central masks; and has the SSIM and PSNR increased by 7.06% and 4.80% respectively while the FID decreased by 6.85% for the random masks.

Key words: semantic image inpainting, generative adversarial network, dense block, loss function, local Total Variation (TV), encoder-decoder

中图分类号:

TP391.4

杨文霞, 王萌, 张亮. 基于密集连接块U-Net的语义人脸图像修复[J]. 计算机应用, 2020, 40(12): 3651-3657.

YANG Wenxia, WANG Meng, ZHANG Liang. Semantic face image inpainting based on U-Net with dense blocks[J]. Journal of Computer Applications, 2020, 40(12): 3651-3657.

参考文献

[1] BERTALMIO M, SAPIRO G, CASELLES V, et al. Image inpainting[C]//Proceedings of the 200027th Annual Conference on Computer Graphics and Interactive Techniques. New York:ACM,2000:417-424.
[2] CHAN T F,SHEN J. Non-texture inpainting by Curvature-Driven Diffusions(CDD)[J]. Journal of Visual Communication and Image Representation,2001,12(4):436-449.
[3] GILBOA G,OSHER S. Nonlocal operators with applications to image processing[J]. SIAM Journal on Multiscale Modeling and Simulation,2008,7(3):1005-1028.
[4] CRIMINISI A,PÉREZ P,TOYAMA K. Region filling and object removal by exemplar-based inpainting[J]. IEEE Transactions on Image Processing,2004,13(9):1200-1212.
[5] GOODFELLOW I J, POUGET-ABADIE J, MIRZA M, et al. Generative adversarialnets[C]//Proceedings of the 27th International Conference on Neural Information Processing Systems. Cambridge:MIT Press,2014:2672-2680.
[6] PATHAK D,KRÄHENBÜHL P,DONAHUE J,et. al. Context encoders:feature learning by inpainting[C]//Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2016:2536-2544.
[7] YEH R A,CHEN C,LIM T Y,et al. Semantic image inpainting with deep generative models[C]//Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2017:6882-6890.
[8] IIZUKA S,SIMO-SERRA E,ISHIKAWA H. Globally and locally consistent imagecompletion[J]. ACM Transactions on Graphics, 2017,36(4):Article No. 107.
[9] PÉREZ P,GANGNET M,BLAKE A. Poisson image editing[J]. ACM Transactions on Graphics,2003,22(3):313-318.
[10] YU J,LIN Z,YANG J,et al. Generative image inpainting with contextual attention[C]//Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2018:5505-5514.
[11] 袁琳君, 蒋旻, 罗敦浪, 等. 基于生成对抗网络的人像修复[J]. 计算机应用, 2020, 40(3):842-846.(YUAN L J,JIANG M, LUO D L, et al. Portrait inpainting based on generative adversarialnetworks[J]. Journal of Computer Applications, 2020,40(3):842-846.)
[12] YU J,LIN Z,YANG J,et al. Free form image inpainting with gated convolution[C]//Proceedings of 2019 IEEE/CVF International Conference on Computer Vision. Piscataway:IEEE, 2019:4470-4479.
[13] HONG X,XIONG P,JI R,et al. Deep fusionnetwork for imagecompletion[C]//Proceedings of the 27th ACM International Conference on Multimedia. New York:ACM,2019:2033-2042.
[14] RONNEBERGER O, FISCHER P, BROX T. U-Net:convolutionalnetworks for biomedical image segmentation[C]//Proceedings of the 2015 International Conference on Medical Image Computing and Computer-Assisted Intervention, LNCS 9351. Cham:Springer,2015:234-241.
[15] 洪汉玉, 孙建国, 栾琳, 等. 基于U-Net模型的航拍图像去绳带方法[J]. 应用光学, 2019, 40(5):786-794.(HONG H Y,SUN J G,LUAN L,et al. Aerial image de-roping based on U-Net model[J]. Journal of Applied Optics,2019,40(5):786-794.)
[16] 陈俊周, 王娟, 龚勋. 基于级联生成对抗网络的人脸图像修复[J]. 电子科技大学学报, 2019, 48(6):910-917.(CHEN J Z, WANG J, GONG X. Face image inpainting using cascaded generative adversarialnetworks[J]. Journal of University of Electronic Science and Technology of China,2019,48(6):910-917.)
[17] HUANG G, LIU Z, MAATEN L, et al. Densely connected convolutionalnetworks[C]//Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2017:2261-2269.
[18] KARRAS T,AILA T,LAINE S,et. al. Progressive growing of GANs for improved quality,stability,and variation[EB/OL].[2020-02-26]. https://arxiv.org/pdf/1710.10196.pdf.
[19] HEUSEL M,RAMSAUER H,UNTERTHINER T,et al. GANs trained by a two time-scale update rule converge to a local Nash equilibrium[C]//Proceedings of the 31st International Conference on Neural Information Processing Systems. Red Hook:Curran Associates Inc.,2017:6629-6640.
[20] 杨文霞, 张亮. 基于图像结构纹理分解及局部总变分最小化的图像修复模型[J]. 计算机应用, 2018, 38(8):2386-2392. (YANG W X,ZHANG L. Image inpainting model based on structure-texture decomposition and local total variation minimization[J]. Journal of Computer Applications,2018,38(8):2386-2392.)

基于密集连接块U-Net的语义人脸图像修复

Semantic face image inpainting based on U-Net with dense blocks

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

[1]	管其杰, 张挺, 李德亚, 周绍景, 杜奕. 基于多分辨率生成对抗网络的空间数据不确定性重建方法[J]. 计算机应用, 2021, 41(8): 2306-2311.
[2]	孙潇, 徐金东. 基于级联生成对抗网络的遥感图像去雾方法[J]. 计算机应用, 2021, 41(8): 2440-2444.
[3]	王月, 江逸茗, 兰巨龙. 基于改进三元组网络和K近邻算法的入侵检测[J]. 计算机应用, 2021, 41(7): 1996-2002.
[4]	汤桂花, 孙磊, 毛秀青, 戴乐育, 胡永进. 基于深度对齐网络的生成对抗网络伪造人脸检测[J]. 计算机应用, 2021, 41(7): 1922-1927.
[5]	王先武, 张挺, 吉欣, 杜奕. 基于带梯度惩罚深度卷积生成对抗网络的页岩三维数字岩心重构方法[J]. 计算机应用, 2021, 41(6): 1805-1811.
[6]	李衍志, 范勇, 高琳. 基于形态流的石油钻井水流异常检测[J]. 计算机应用, 2021, 41(6): 1842-1848.
[7]	井贝贝, 郭嘉, 王丽清, 陈静, 丁洪伟. 结合降噪卷积神经网络和条件生成对抗网络的图像双重盲降噪算法[J]. 计算机应用, 2021, 41(6): 1767-1774.
[8]	孙鹤立, 孙玉柱, 张晓云. 基于生成对抗网络的事件描述生成[J]. 计算机应用, 2021, 41(5): 1256-1261.
[9]	郭茂祖, 杨倩楠, 赵玲玲. 基于条件Wassertein生成对抗网络的图像生成[J]. 计算机应用, 2021, 41(5): 1432-1437.
[10]	段友祥, 张含笑, 孙歧峰, 孙友凯. 基于拉普拉斯金字塔生成对抗网络的图像超分辨率重建算法[J]. 计算机应用, 2021, 41(4): 1020-1026.
[11]	欧莉莉, 邵峰晶, 孙仁诚, 隋毅. 基于半监督方法的脑梗死图像识别[J]. 计算机应用, 2021, 41(4): 1221-1226.
[12]	李虹霞, 秦品乐, 闫寒梅, 曾建潮, 鲍骞月, 柴锐. 基于面部特征图对称的人脸正面化生成对抗网络算法[J]. 计算机应用, 2021, 41(3): 714-720.
[13]	杜嘻嘻, 程华, 房一泉. 基于优势演员-评论家算法的强化自动摘要模型[J]. 计算机应用, 2021, 41(3): 699-705.
[14]	蒋宁, 方景龙, 杨庆. 基于单点多盒检测器的全局-局部层级的域适应目标检测[J]. 计算机应用, 2021, 41(2): 517-522.
[15]	张亚, 金鑫, 江倩, 李昕洁, 董云云, 姚绍文. 基于自动编码器的深度伪造图像检测方法[J]. 计算机应用, 2021, 41(10): 2985-2990.