基于条件生成式对抗网络的数据增强方法

doi:10.11772/j.issn.1001-9081.2018051008

计算机应用 ›› 2018, Vol. 38 ›› Issue (11): 3305-3311.DOI: 10.11772/j.issn.1001-9081.2018051008

• 应用前沿、交叉与综合 • 上一篇下一篇

基于条件生成式对抗网络的数据增强方法

陈文兵, 管正雄, 陈允杰

南京信息工程大学数学与统计学院, 南京 210044

收稿日期:2018-05-14 修回日期:2018-06-26 发布日期:2018-11-10 出版日期:2018-11-10
通讯作者: 管正雄
作者简介:陈文兵(1964-),男,安徽东至人,副教授,硕士,主要研究方向:计算数学、模式识别、图像处理;管正雄(1993-),男,安徽芜湖人,硕士研究生,主要研究方向:模式识别、图像处理;陈允杰(1980-),男,江苏南京人,教授,博士,主要研究方向:计算数学、模式识别、图像处理。
基金资助:
国家自然科学基金资助项目（61672291）；北极阁基金资助项目（BJG201504）。

Data augmentation method based on conditional generative adversarial net model

CHEN Wenbing, GUAN Zhengxiong, CHEN Yunjie

School of Mathematics and Statistics, Nanjing University of Information Science & Technology, Nanjing Jiangsu 210044, China

Received:2018-05-14 Revised:2018-06-26 Online:2018-11-10 Published:2018-11-10
Supported by:
This work is partially supported by the National Natural Science Foundation of China (61672291), the Beijige Foundation (BJG201504).

摘要/Abstract

摘要： 深度卷积神经网络（CNN）在大规模带有标签的数据集训练下，训练后模型能够取得高的识别率或好的分类效果，而利用较小规模数据集训练CNN模型则通常出现过拟合现象。针对这一问题，提出了一种集成高斯混合模型（GMM）及条件生成式对抗网络（CGAN）的数据增强方法并记作GMM-CGAN。首先，通过围绕核心区域随机滑动采样的方法增加数据集样本数量；其次，假定噪声随机向量服从GMM描述的分布，将它作为CGAN生成器的初始输入，图像标签作为CGAN条件，训练CGAN以及GMM模型的参数；最后，利用已训练CGAN生成符合样本真实分布的新数据集。对包含12种雾型386个样本的天气形势图基准集利用GMM-CGAN方法进行数据增强，增强后的数据集样本数多达38600个，将该数据集训练的CNN模型与仅使用仿射变换增强的数据集及CGAN方法增强的数据集训练的CNN模型相比，实验结果表明，前者的平均分类正确率相较于后两个模型分别提高了18.2%及14.1%，达到89.1%。

关键词: 图像分类, 深度卷积神经网络, 高斯混合模型, 有条件对抗神经网络, 数据增强算法

Abstract: Deep Convolutional Neural Network (CNN) is trained by large-scale labelled datasets. After training, the model can achieve high recognition rate or good classification effect. However, the training of CNN models with smaller-scale datasets usually occurs overfitting. In order to solve this problem, a novel data augmentation method called GMM-CGAN was proposed, which was integrated Gaussian Mixture Model (GMM) and CGAN (Conditional Generative Adversarial Net). Firstly, sample number was increased by randomly sliding sampling around the core region. Secondly, the random noise vector was supposed to submit to the distribution of GMM model, then it was used as the initial input to the CGAN generator and the image label was used as the CGAN condition to train the parameters of the CGAN and GMM models. Finally, the trained CGAN was used to generate a new dataset that matched the real distribution of the samples. The dataset was divided into 12 classes of 386 items. After implementing GMM-CGAN on the dataset, the total number of the new dataset was 38600. The experimental results show that compared with CNN's training datasets augmented by Affine transformation or CGAN, the average classification accuracy of the proposed method is 89.1%, which is improved by 18.2% and 14.1%, respectively.

Key words: image classification, deep Convolution Neural Network (CNN), Gaussian Mixture Model (GMM), Conditional Generative Adversarial Net (CGAN), data augmentation algorithm

中图分类号:

TP391.41

陈文兵, 管正雄, 陈允杰. 基于条件生成式对抗网络的数据增强方法[J]. 计算机应用, 2018, 38(11): 3305-3311.

CHEN Wenbing, GUAN Zhengxiong, CHEN Yunjie. Data augmentation method based on conditional generative adversarial net model[J]. Journal of Computer Applications, 2018, 38(11): 3305-3311.

参考文献

[1] LECUN Y, BOSER B, DENKER J S, et al. Back propagation applied to handwritten zip code recognition[J]. Neural Computation, 1989, 1(4):541-551.
[2] KRIZHEVSKY A, SUTSKEVER I, HINTON G E. ImageNet classification with deep convolutional neural networks[C]//Proceedings of the 25th International Conference on Neural Information Processing Systems. Lake Tahoe, Nevada:Curran Associates Inc., 2012:1097-1105.
[3] SIMONYAN K, ZISSERMAN A. Very deep convolutional networks for large-scale image recognition[J/OL]. arXiv Preprint, 2014, 2014:arXiv:1409.1556(2014-09-04)[2015-04-10]. http://arxiv.org/abs/1409.1556.
[4] SZEGEDY C, LIU W, JIA Y, et al. Going deeper with convolutions[C]//Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition. Washington, DC:IEEE Computer Society, 2015:1-9.
[5] HE K, ZHANG X, REN S. Deep residual learning for image recognition[C]//Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition. Washington, DC:IEEE Computer Society, 2016:770-778.
[6] 刘雨桐, 李志清, 杨晓玲. 改进卷积神经网络在遥感图像分类中的应用[J]. 计算机应用, 2018, 38(4):949-954.(LIU Y T, LI Z Q, YANG X L. Application of improved convolution neural network in remote sensing image classification[J]. Journal of Computer Applications, 2018, 38(4):949-954.)
[7] 安旭骁, 邓洪敏, 史兴宇. 基于迷你卷积神经网络的停车场空车位检测方法[J]. 计算机应用, 2018, 38(4):935-938.(AN X X, DENG H M, SHI X Y. Parking lot space detection method based on mini convolutional neural network[J]. Journal of Computer Applications, 2018, 38(4):935-938.)
[8] PEREZ L, WANG J. The Effectiveness of data augmentation in image classification using deep learning[J/OL]. arXiv Preprint, 2017, 2017:arXiv:1712.04621[2017-12-13]. http://arxiv.org/abs/1712.04621.
[9] BJERRUM E J. SMILES enumeration as data augmentation for neural network modeling of molecules[J/OL]. arXiv Preprint, 2017, 2017:arXiv:1703.07076(2017-03-21)[2017-05-17]. http://arxiv.org/abs/1703.07076.
[10] GOODFELLOW I J, POUGET-ABADIE J, MIRZA M, et al. Generative adversarial nets[C]//Proceedings of the 27th International Conference on Neural Information Processing Systems. Cambridge:MIT Press, 2014:2672-2680.
[11] MIRZA M, OSINDERO S. Conditional generative adversarial nets[J/OL]. arXiv Preprint, 2014, 2014:arXiv:1411.1784[2014-11-06]. http://arxiv.org/abs/1411.1784.
[12] KINGMA D P, WELLING M. Auto-encoding variational Bayes[J/OL]. arXiv Preprint, 2013, 2013:arXiv:1312.6114(2013-12-20)[2014-05-01]. http://arxiv.org/abs/1312.6114.
[13] ROSCA M, LAKSHMINARAYANAN B, WARDEFARLEY D, et al. Variational approaches for auto-encoding generative adversarial networks[J/OL]. arXiv Preprint, 2017, 2017:arXiv:1706.04987(2017-05-15)[2017-10-21]. http://arxiv.org/abs/1706.04987.
[14] LARSEN A B L, LAROCHELLE H, WINTHER O. Autoencoding beyond pixels using a learned similarity metric[C]//Proceedings of the 33rd International Conference on International Conference on Machine Learning. New York:JMLR.org, 2016:1558-1566.
[15] GURUMURTHY S, SARVADEVABHATLA R K, BABU R V. DeLiGAN:Generative adversarial networks for diverse and limited data[C]//Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition. Washington, DC:IEEE Computer Society, 2017:4941-4949.
[16] 王坤峰, 苟超, 段艳杰,等. 生成式对抗网络GAN的研究进展与展望[J].自动化学报, 2017, 43(3):321-332.(WANG K F, GOU C, DUAN Y J,et al. Generative adversarial networks:the state of the art and beyond[J]. Acta Automatica Sinica, 2017, 43(3):321-332.)

基于条件生成式对抗网络的数据增强方法

Data augmentation method based on conditional generative adversarial net model

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

[1]	王东炜, 刘柏辰, 韩志, 王艳美, 唐延东. 基于低秩分解和向量量化的深度网络压缩方法[J]. 《计算机应用》唯一官方网站, 2024, 44(7): 1987-1994.
[2]	翟飞宇, 马汉达. 基于DenseNet的经典-量子混合分类模型[J]. 《计算机应用》唯一官方网站, 2024, 44(6): 1905-1910.
[3]	肖斌, 杨模, 汪敏, 秦光源, 李欢. 独立性视角下的相频融合领域泛化方法[J]. 《计算机应用》唯一官方网站, 2024, 44(4): 1002-1009.
[4]	王星, 刘贵娟, 陈志豪. 高斯混合模型与文本图卷积网络结合的虚假评论识别算法[J]. 《计算机应用》唯一官方网站, 2024, 44(2): 360-368.
[5]	张睿, 潘俊铭, 白晓露, 胡静, 张荣国, 张鹏云. 面向深度分类模型超参数自优化的代理模型[J]. 《计算机应用》唯一官方网站, 2024, 44(10): 3021-3031.
[6]	谢莉, 舒卫平, 耿俊杰, 王琼, 杨海麟. 结合加权原型和自适应张量子空间的小样本宫颈细胞分类[J]. 《计算机应用》唯一官方网站, 2024, 44(10): 3200-3208.
[7]	周雯, 谌雨章, 温志远, 王诗琦. 基于位置编码重叠切块嵌入和多尺度通道交互注意力的鱼类图像分类[J]. 《计算机应用》唯一官方网站, 2024, 44(10): 3209-3216.
[8]	陈彤, 位纪伟, 何仕远, 宋井宽, 杨阳. 基于自适应攻击强度的对抗训练方法[J]. 《计算机应用》唯一官方网站, 2024, 44(1): 94-100.
[9]	王彬, 向甜, 吕艺东, 王晓帆. 基于NSGA‑Ⅱ的自适应多尺度特征通道分组优化算法[J]. 《计算机应用》唯一官方网站, 2023, 43(5): 1401-1408.
[10]	李振亮, 李波. 基于矩阵分解的卷积神经网络改进方法[J]. 《计算机应用》唯一官方网站, 2023, 43(3): 685-691.
[11]	文凯, 薛晓, 季娟. 面向复杂图像分类的共享转换矩阵胶囊网络[J]. 《计算机应用》唯一官方网站, 2023, 43(11): 3411-3417.
[12]	申志军, 穆丽娜, 高静, 史远航, 刘志强. 细粒度图像分类综述[J]. 《计算机应用》唯一官方网站, 2023, 43(1): 51-60.
[13]	魏佳璇, 杜世康, 于志轩, 张瑞生. 图像分类中的白盒对抗攻击技术综述[J]. 《计算机应用》唯一官方网站, 2022, 42(9): 2732-2741.
[14]	刘万军, 王佳铭, 曲海成, 董利兵, 曹欣宇. 基于频谱空间域特征注意的音乐流派分类算法[J]. 《计算机应用》唯一官方网站, 2022, 42(7): 2072-2077.
[15]	任炜, 白鹤翔. 基于全局与局部标签关系的多标签图像分类方法[J]. 《计算机应用》唯一官方网站, 2022, 42(5): 1383-1390.