基于生成对抗网络的红外图像数据增强

doi:10.11772/j.issn.1001-9081.2019122253

计算机应用 ›› 2020, Vol. 40 ›› Issue (7): 2084-2088.DOI: 10.11772/j.issn.1001-9081.2019122253

• 虚拟现实与多媒体计算 • 上一篇下一篇

基于生成对抗网络的红外图像数据增强

陈佛计^1,2,3,4, 朱枫^1,2,3,4, 吴清潇^1,2,3,4, 郝颖明^1,2,3,4, 王恩德^1,2,3,4

1. 中国科学院沈阳自动化研究所, 沈阳 110016;
2. 中国科学院机器人与智能制造创新研究院, 沈阳 110016;
3. 中国科学院大学, 北京 100049;
4. 中国科学院光电信息处理重点实验室, 沈阳 110016

收稿日期:2020-01-09 修回日期:2020-03-01 发布日期:2020-03-29 出版日期:2020-07-10
通讯作者: 陈佛计
作者简介:陈佛计(1994-),男,山西忻州人,硕士,主要研究方向:图像生成、机器学习、机器人视觉;朱枫(1962-),男,辽宁沈阳人,研究员,博士生导师,博士,主要研究方向:机器人视觉;吴清潇(1978-),男,辽宁沈阳人,研究员,博士,主要研究方向:机器人视觉、机器视觉;郝颖明(1966-),女,辽宁沈阳人,研究员,博士,主要研究方向:视觉定位、图像优化、视觉检测、红外图像仿真、三维成像、数据处理;王恩德(1980-),男,辽宁沈阳人,研究员,博士,主要研究方向:图像目标识别、检测与跟踪、微弱信号检测。
基金资助:
国家自然科学基金资助项目（U1713216）。

Infrared image data augmentation based on generative adversarial network

CHEN Foji^1,2,3,4, ZHU Feng^1,2,3,4, WU Qingxiao^1,2,3,4, HAO Yingming^1,2,3,4, WANG Ende^1,2,3,4

1. Shenyang Institute of Automation, Chinese Academy of Sciences, Shenyang Liaoning 110016, China;
2. Institutes for Robotics and Intelligent Manufacturing Innovation, Chinese Academy of Sciences, Shenyang Liaoning 110016, China;
3. University of Chinese Academy of Sciences, Beijing 100049, China;
4. Key Laboratory of Opto-Electronic Information Processing, Chinese Academy of Sciences, Shenyang Liaoning 110016, China

Received:2020-01-09 Revised:2020-03-01 Online:2020-03-29 Published:2020-07-10
Supported by:
This work is partially supported by the National Natural Science Foundation of China (U1713216).

摘要/Abstract

摘要： 深度学习在视觉任务中的良好表现很大程度上依赖于海量的数据和计算力的提升，但是在很多实际项目中通常难以提供足够的数据来完成任务。针对某些情况下红外图像少且难以获得的问题，提出一种基于彩色图像生成红外图像的方法来获取更多的红外图像数据。首先，用现有的彩色图像和红外图像数据构建成对的数据集；然后，基于卷积神经网络、转置卷积神经网络构建生成对抗网络（GAN）模型的生成器和鉴别器；接着，基于成对的数据集来训练GAN模型，直到生成器和鉴别器之间达到纳什平衡状态；最后，用训练好的生成器将彩色图像从彩色域变换到红外域。基于定量评估标准对实验结果进行了评估，结果表明，所提方法可以生成高质量的红外图像，并且相较于在损失函数中不加正则化项，在损失函数中加入L1和L2正则化约束后，该方法的FID分数值平均分别降低了23.95和20.89。作为一种无监督的数据增强方法，该方法也可以被应用于其他缺少数据的目标识别、目标检测、数据不平衡等视觉任务中。

关键词: 红外图像生成, 生成对抗网络, 图像转换, 数据增强, 生成图像质量评估

Abstract: The great performance of deep learning in many visual tasks largely depends on the big data volume and the improvement of computing power. But in many practical projects, it is usually difficult to provide enough data to complete the task. Concerning the problem that the number of infrared images is small and the infrared images are hard to collect, a method to generate infrared images based on color images was proposed to obtain more infrared image data. Firstly, the existing color image and infrared image data were employed to construct the paired datasets. Secondly, the generator and the discriminator of Generative Adversarial Network (GAN) model were formed based on the convolutional neural network and the transposed convolutional neural network. Thirdly, the GAN model was trained based on the paired datasets until the Nash equilibrium between the generator and the discriminator was reached. Finally, the trained generator was used to transform the color image from the color field to the infrared field. The experimental results were evaluated based on quantitative evaluation metrics. The evaluation results show that the proposed method can generate high-quality infrared images. In addition, after the L1 or L2 regularization constraint was added to the loss function, the FID (Fréchet Inception Distance) score was respectively reduced by 23.95, 20.89 on average compared to the FID score of loss function not adding the constraint. As an unsupervised data augmentation method, the method can also be applied to many other visual tasks that lack train data, such as target recognition, target detection and data imbalance.

Key words: infrared image generation, Generative Adversarial Network (GAN), image transformation, data augmentation, quality evaluation of generative image

中图分类号:

TP391.41

陈佛计, 朱枫, 吴清潇, 郝颖明, 王恩德. 基于生成对抗网络的红外图像数据增强[J]. 计算机应用, 2020, 40(7): 2084-2088.

CHEN Foji, ZHU Feng, WU Qingxiao, HAO Yingming, WANG Ende. Infrared image data augmentation based on generative adversarial network[J]. Journal of Computer Applications, 2020, 40(7): 2084-2088.

参考文献

[1] DAYAN P. Helmholtz machines and wake-sleep learning[M]//ARBIB M A. Handbook of Brain Theory and Neural Network. Cambridge:MIT Press,2000:522-525.
[2] HINTON G E. Deep belief networks[J]. Scholarpedia,2009,4(5):No. 5947.
[3] KINGMA D P,WELLING M. Auto-encoding variational Bayes[EB/OL].[2019-09-22]. https://arxiv.org/pdf/1312.6114.pdf.
[4] SALAKHUTDINOV R, MNIH A, HINTON G. Restricted Boltzmann machines for collaborative filtering[C]//Proceedings of the 24th International Conference on Machine Learning. New York:ACM,2007:791-798.
[5] SALAKHUTDINOV R,HINTON G. Deep Boltzmann machines[C]//Proceedings of the 2009 Artificial Intelligence and Statistics. Cambridge:JMLR.org,2009:448-455.
[6] VAN DEN OORD A,KALCHBRENNER N,KAVUKCUOGLU K. Pixel recurrent neural networks[EB/OL].[2019-09-22]. https://arxiv.org/pdf/1601.06759.pdf.
[7] GOODFELLOW I,POUGET-ABADIE J,MIRZA M,et al. Generative adversarial nets[C]//Proceedings of the 27th International Conference on Neural Information Processing Systems. Cambridge:MIT Press,2014:2672-2680.
[8] 林懿伦, 戴星原, 李力, 等. 人工智能研究的新前线:生成式对抗网络[J]. 自动化学报,2018,44(5):775-792.(LIN Y L,DAI X Y,LI L,et al. The new frontier of AI research:generative adversarial networks[J]. Acta Automatica Sinica,2018,44(5):775-792.)
[9] 曹仰杰, 贾丽丽, 陈永霞, 等. 生成式对抗网络及其计算机视觉应用研究综述[J]. 中国图象图形学报,2018,23(10):1433-1449.(CAO Y J,JIA L L,CHEN Y X,et al. Review of computer vision based on generative adversarial networks[J]. Journal of Image and Graphics,2018,23(10):1433-1449.)
[10] 陈文兵, 管正雄, 陈允杰. 基于条件生成式对抗网络的数据增强方法[J]. 计算机应用,2018,38(11):3305-3311.(CHEN W B,GUAN Z X,CHEN Y J. Data augmentation method based on conditional generative adversarial net model[J]. Journal of Computer Applications,2018,38(11):3305-3311.)
[11] CHEN X,DUAN Y,HOUTHOOFT R,et al. InfoGAN:interpretable representation learning by information maximizing generative adversarial nets[C]//Proceedings of the 30th International Conference on Neural Information Processing Systems. Red Hook,NY:Curran Associates Inc.,2016:2172-2180.
[12] ZHANG H,GOODFELLOW I,METAXAS D,et al. Self-attention generative adversarial networks[EB/OL].[2019-09-22]. https://arxiv.org/pdf/1805.08318.pdf.
[13] ISOLA P,ZHU J Y,ZHOU T,et al. Image-to-image translation with conditional adversarial networks[C]//Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2017:5967-5976.
[14] ZHU J Y,PARK T,ISOLA P,et al. Unpaired image-to-image translation using cycle-consistent adversarial networks[C]//Proceedings of the 2017 IEEE International Conference on Computer Vision. Piscataway:IEEE,2017:2242-2251.
[15] ODENA A,OLAH C,SHLENS J. Conditional image synthesis with auxiliary classifier GANs[C]//Proceedings of the 34th International Conference on Machine Learning. New York:JMLR.org, 2017:2642-2651.
[16] CHOI Y,CHOI M,KIM M,et al. StarGAN:unified generative adversarial networks for multi-domain image-to-image translation[C]//Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2018:8789-8797.
[17] 许洪, 王向军, 刘峰, 等. 基于可见光光谱图像的红外多光谱图像仿真生成[J]. 红外与激光工程,2009,38(2):200-204.(XU H,WANG X J,LIU F,et al. Infrared multispectral image simulation based on spectral images in visible bands[J]. Infrared and Laser Engineering,2009,38(2):200-204.)
[18] 陈珊, 孙继银. 基于可见光图像的红外场景仿真[J]. 红外与激光工程,2009,38(1):23-26,30.(CHEN S,SUN J Y. IR scene simulation based on visual image[J]. Infrared and Laser Engineering,2009,38(1):23-26,30.)
[19] ARJOVSKY M,CHINTALA S,BOTTOU L. Wasserstein GAN[EB/OL].[2019-09-22]. https://arxiv.org/pdf/1701.07875.pdf.
[20] BROCK A,DONAHUE J,SIMONYAN K. Large scale GAN training for high fidelity natural image synthesis[EB/OL].[2019-09-22]. https://arxiv.org/pdf/1809.11096.pdf.
[21] WANG T C,LIU M Y,ZHU J Y,et al. High-resolution image synthesis and semantic manipulation with conditional GANs[C]//Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2018:8798-8807.
[22] NG A. Sparse autoencoder[EB/OL].[2019-10-20]. https://web.stanford.edu/class/cs294a/sparseAutoencoder_2011new.pdf.
[23] CHOE G,KIM S H,IM S,et al. RANUS:RGB and NIR urban scene dataset for deep scene parsing[J]. IEEE Robotics and Automation Letters,2018,3(3):1808-1815.
[24] SAKLA W,KONJEVOD G,MUNDHENK T N. Deep multi-modal vehicle detection in aerial ISR imagery[C]//Proceedings of the 2017 IEEE Winter Conference on Applications of Computer Vision. Piscataway:IEEE,2017:916-923.
[25] HEUSEL M,RAMSAUER H,UNTERTHINER T,et al. GANs trained by a two time-scale update rule converge to a local Nash equilibrium[C]//Proceedings of the 31st International Conference on Neural Information Processing Systems. Red Hook,NY:Curran Associates Inc.,2017:6626-6637.
[26] SZEGEDY C,IOFFE S,VANHOUCKE V,et al. Inception-v4:inception-ResNet and the impact of residual connections on learning[C]//Proceedings of the 31st AAAI conference on Artificial Intelligence. Palo Alto,CA:AAAI Press,2017:4278-4284.

基于生成对抗网络的红外图像数据增强

Infrared image data augmentation based on generative adversarial network

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

[1]	杨莹, 郝晓燕, 于丹, 马垚, 陈永乐. 面向图神经网络模型提取攻击的图数据生成方法[J]. 《计算机应用》唯一官方网站, 2024, 44(8): 2483-2492.
[2]	刘丽, 侯海金, 王安红, 张涛. 基于多尺度注意力的生成式信息隐藏算法[J]. 《计算机应用》唯一官方网站, 2024, 44(7): 2102-2109.
[3]	汪炅, 唐韬韬, 贾彩燕. 无负采样的正样本增强图对比学习推荐方法PAGCL[J]. 《计算机应用》唯一官方网站, 2024, 44(5): 1485-1492.
[4]	郭洁, 林佳瑜, 梁祖红, 罗孝波, 孙海涛. 基于知识感知和跨层次对比学习的推荐方法[J]. 《计算机应用》唯一官方网站, 2024, 44(4): 1121-1127.
[5]	王昊冉, 于丹, 杨玉丽, 马垚, 陈永乐. 面向工控系统未知攻击的域迁移入侵检测方法[J]. 《计算机应用》唯一官方网站, 2024, 44(4): 1158-1165.
[6]	郑毅, 廖存燚, 张天倩, 王骥, 刘守印. 面向城区的基于图去噪的小区级RSRP估计方法[J]. 《计算机应用》唯一官方网站, 2024, 44(3): 855-862.
[7]	郭安迪, 贾真, 李天瑞. 基于伪实体数据增强的高精准率医学领域实体关系抽取[J]. 《计算机应用》唯一官方网站, 2024, 44(2): 393-402.
[8]	宋逸飞, 柳毅. 基于数据增强和标签噪声的快速对抗训练方法[J]. 《计算机应用》唯一官方网站, 2024, 44(12): 3798-3807.
[9]	胡新荣, 陈静雪, 黄子键, 王帮超, 姚迅, 刘军平, 朱强, 杨捷. 基于图卷积网络的掩码数据增强[J]. 《计算机应用》唯一官方网站, 2024, 44(11): 3335-3344.
[10]	周辉, 陈玉玲, 王学伟, 张洋文, 何建江. 基于生成对抗网络的联邦学习深度影子防御方案[J]. 《计算机应用》唯一官方网站, 2024, 44(1): 223-232.
[11]	刘安阳, 赵怀慈, 蔡文龙, 许泽超, 解瑞灯. 基于主动判别机制的自适应生成对抗网络图像去模糊算法[J]. 《计算机应用》唯一官方网站, 2023, 43(7): 2288-2294.
[12]	陈少权, 蔡剑平, 孙岚. 动态梯度阈值裁剪的差分隐私生成对抗网络算法[J]. 《计算机应用》唯一官方网站, 2023, 43(7): 2065-2072.
[13]	靳鑫, 刘仰川, 朱叶晨, 张子健, 高欣. 基于残差编解码-生成对抗网络的正弦图修复的稀疏角度锥束CT图像重建[J]. 《计算机应用》唯一官方网站, 2023, 43(6): 1950-1957.
[14]	吴家皋, 章仕稳, 蒋宇栋, 刘林峰. 基于状态精细化长短期记忆和注意力机制的社交生成对抗网络用于行人轨迹预测[J]. 《计算机应用》唯一官方网站, 2023, 43(5): 1565-1570.
[15]	郭劲文, 马兴华, 骆功宁, 王玮, 曹阳, 王宽全. 基于Transformer的结构强化IVOCT导丝伪影去除方法[J]. 《计算机应用》唯一官方网站, 2023, 43(5): 1596-1605.