Journal of Computer Applications ›› 2020, Vol. 40 ›› Issue (10): 3048-3053.DOI: 10.11772/j.issn.1001-9081.2020020281

• Virtual reality and multimedia computing • Previous Articles     Next Articles

Mixed-order channel attention network for single image super-resolution reconstruction

YAO Lu1, SONG Huihui2, ZHANG Kaihua1   

  1. 1. Jiangsu Key Laboratory of Big Data Analysis Technology(Nanjing University of Information Science and Technology), Nanjing Jiangsu 210044 China;
    2. Collaborative Innovation Center on Atmospheric Environment and Equipment Technology, Nanjing University of Information Science and Technology, Nanjing Jiangsu 210044, China
  • Received:2020-03-14 Revised:2020-06-15 Online:2020-10-10 Published:2020-06-24
  • Supported by:
    This work is partially supported by the National Major Project for New Generation of AI (2018AAA0100400), the National Natural Science Foundation of China (6187289, 61876088), the Natural Science Foundation of Jiangsu Province (BK20191397, BK20170040).

混合阶通道注意力网络的单图像超分辨率重建

姚鲁1, 宋慧慧2, 张开华1   

  1. 1. 江苏省大数据分析技术重点实验室(南京信息工程大学), 南京 210044;
    2. 南京信息工程大学 大气环境与装备技术协同创新中心, 南京 210044
  • 通讯作者: 宋慧慧
  • 作者简介:姚鲁(1993-),男,江苏连云港人,硕士研究生,主要研究方向:图像超分辨率重建、深度学习;宋慧慧(1986-),女,山东聊城人,教授,博士,CCF会员,主要研究方向:机器学习、遥感图像处理;张开华(1983-),男,山东日照人,教授,博士,CCF会员,主要研究方向:视频目标跟踪、显著性检测、视频目标分割。
  • 基金资助:
    国家新一代人工智能重大项目(2018AAA0100400);国家自然科学基金资助项目(61872189,61876088);江苏省自然科学基金资助项目(BK20191397,BK20170040)。

Abstract: For the current channel attention mechanism used for super-resolution reconstruction, there are problems that the attention prediction destroys the direct corresponding relationship between each channel and its weight and the mechanism only considers the first-order or second-order channel attention without comprehensive consideration of the advantage complementation. Therefore, a mixed-order channel attention network for image super-resolution reconstruction was proposed. First of all, by using the local cross-channel interaction strategy, increase and reduction in channel dimension used by the first-order and second-order channel attention models were changed into a fast one-dimensional convolution with kernel k, which not only makes the channel attention prediction more direct and accurate but makes the resulting model simpler than before. Besides, the improved first and second-order channel attention models above were adopted to comprehensively take the advantages of channel attentions of different orders, thus improving network discrimination. Experimental results on the benchmark datasets show that compared with the existing super-resolution algorithms, the proposed method has the best recovered texture details and high frequency information of the reconstructed images and the Perceptual Indictor (PI) on Set5 and BSD100 datasets are increased by 0.3 and 0.1 on average respectively. It shows that this network is more accurate in predicting channel attention and comprehensively uses channel attentions of different orders, so as to improve the performance.

Key words: channel attention mechanism, Generative Adversarial Network (GAN), image super-resolution, Convolutional Neural Network (CNN), deep learning

摘要: 目前用于图像超分辨率重建的通道注意力机制存在注意力预测破坏每个通道和其权重的直接对应关系以及仅仅只考虑一阶或二阶通道注意力而没有综合考虑优势互补的问题,因此提出一种混合阶通道注意力网络的单图像超分辨率重建算法。首先,该网络框架利用局部跨通道相互作用策略将之前一、二阶通道注意力模型采用的升降维改为核为k的一维卷积。这样不仅使得通道注意力预测更直接准确,而且得到的模型相比之前的通道注意力模型更简单;同时,采用改进一、二阶通道注意力模型以综合利用不同阶通道注意力的优势,提高网络判别能力。在基准数据集上的实验结果表明,和现有的超分辨率算法相比,所提算法重建图像的纹理细节和高频信息能得到更好的恢复,且在Set5和BSD100数据集上感知指数(PI)分别平均提高0.3和0.1。这表明此网络能更准确地预测通道注意力并综合利用了不同阶通道注意力,一定程度上提升了性能。

关键词: 通道注意力机制, 生成对抗网络, 图像超分辨率重建, 卷积神经网络, 深度学习

CLC Number: