计算机应用 ›› 2018, Vol. 38 ›› Issue (4): 1141-1145.DOI: 10.11772/j.issn.1001-9081.2017092378

• 虚拟现实与多媒体计算 • 上一篇    下一篇

基于极深卷积神经网络的人脸超分辨率重建算法

孙毅堂, 宋慧慧, 张开华, 严飞   

  1. 江苏省大数据分析技术重点实验室(南京信息工程大学), 南京 210044
  • 收稿日期:2017-10-09 修回日期:2017-11-16 出版日期:2018-04-10 发布日期:2018-04-09
  • 通讯作者: 宋慧慧
  • 作者简介:孙毅堂(1992-),男,江苏苏州人,硕士研究生,主要研究方向:图像超分辨率重建;宋慧慧(1986-),女,山东聊城人,教授,博士,主要研究方向:遥感图像处理;张开华(1983-),男,山东日照人,教授,博士,CCF会员,主要研究方法:图像分割、目标跟踪;严飞(1983-),男,江苏南京人,讲师,博士,主要研究方向:图像处理。
  • 基金资助:
    国家自然科学基金资助项目(41501377,61605083);江苏省自然科学基金资助项目(BK20150906,BK20170040)。

Face super-resolution via very deep convolutional neural network

SUN Yitang, SONG Huihui, ZHANG Kaihua, YAN Fei   

  1. Jiangsu Key Laboratory of Big Data Analysis Technology(Najing University of Information Science and Technology), Nanjing Jiangsu 210044, China
  • Received:2017-10-09 Revised:2017-11-16 Online:2018-04-10 Published:2018-04-09
  • Supported by:
    This work is partially supported by the National Natural Science Foundation of China (41501377, 61605083), the Natural Science Foundation of Jiangsu Province (BK20150906, BK20170040).

摘要: 针对多种放大倍数的人脸超分辨率重建问题,提出一种基于极深卷积神经网络的人脸超分辨率重建方法,并通过实验发现增加网络深度能够有效提升人脸重建的精度。首先,设计一个包含20个卷积层的网络从低分辨率图片和高分辨率图片之间学习一种端到端的映射关系,并通过在网络结构中将多个小的滤波器进行多次串联以扩大提取纹理信息的范围。其次,引入了残差学习的方法来解决随着深度的提升细节信息丢失的问题。另外,将不同放大因子的低分辨率人脸图片融合到一个训练集中训练,使得该卷积网络能够解决不同放大因子的人脸超分辨率重建问题。在CASPEAL测试集上的结果显示,该极深卷积神经网络的方法比基于双三次插值的人脸重建方法在峰值信噪比(PSNR)和结构相似度上有2.7 dB和2%的提升,和SRCNN的方法比较也有较大的提升,在精度和视觉改善方面都有较大提升。这显示了更深的网络结构能够在重建中取得更好的结果。

关键词: 超分辨率重建, 卷积神经网络, 机器学习, 深度学习, 残差学习

Abstract: For multiple scale factors of face super-resolution, a face super-resolution method based on very deep convolutional neural network was proposed; and through experiments, it was found that the increase of network depth can effectively improve the accuracy of face reconstruction. Firstly, a network that consists of 20 convolution layers were designed to learn an end-to-end mapping between the low-resolution images and the high-resolution images, and many small filters were cascaded to extract more textural information. Secondly, a residual-learning method was introduced to solve the problem of detail information loss caused by increasing depth. In addition, the low-resolution face images with multiple scale factors were merged to one training set to enable the network to achieve the face super resolution with multiple scale factors. The results on the CASPEAL test dataset show that the proposed method based on this very deep convolutional neural network has 2.7 dB increasement in Peak Signal-to-Noise Ratio (PSNR), and 2% increasement in structural similarity compared to the Bicubic based face reconstruction method. Compared with the SRCNN method, there is also a greater improvement. as well as a greater improvement in accuracy and visual improvement. It means that deeper network structures can achieve better results in reconstruction.

Key words: super-resolution reconstruction, convolutional neural network, machine learning, deep learning, residual learning

中图分类号: