计算机应用 ›› 2018, Vol. 38 ›› Issue (6): 1554-1561.DOI: 10.11772/j.issn.1001-9081.2017122872

• 人工智能 • 上一篇    下一篇

基于深度生成式对抗网络的蓝藻语义分割

杨朔, 陈丽芳, 石瑀, 毛一鸣   

  1. 江南大学 数字媒体学院, 江苏 无锡 214122
  • 收稿日期:2017-12-08 修回日期:2018-01-25 出版日期:2018-06-10 发布日期:2018-06-13
  • 通讯作者: 陈丽芳
  • 作者简介:杨朔(1993-),男,山东桓台人,硕士研究生,主要研究方向:深度神经网络、模式识别;陈丽芳(1973-),女,福建莆田人,副教授,博士,主要研究方向:图像处理、数字媒体;石瑀(1993-),男,山西太原人,硕士研究生,主要研究方向:自然语言处理、深度神经网络;毛一鸣(1994-),男,江苏扬州人,硕士研究生,主要研究方向:数字图像处理、深度学习。
  • 基金资助:
    国家科技支撑计划项目(2015BAH54F01)。

Semantic segmentation of blue-green algae based on deep generative adversarial net

YANG Shuo, CHEN Lifang, SHI Yu, MAO Yiming   

  1. School of Digital Media, Jiangnan University, Wuxi Jiangsu 214122, China
  • Received:2017-12-08 Revised:2018-01-25 Online:2018-06-10 Published:2018-06-13
  • Supported by:
    This work is partially supported by the National Key Technology R&D Program (2015BAH54F01).

摘要: 针对传统图像分割算法分割蓝藻图像准确率不足的问题,提出了一种基于深度神经网络(DNN)和生成式对抗网络(GAN)思想的网络结构,称为深度生成式对抗网络(DGAN)。首先,在传统全卷积神经网络(FCN)的基础上构建了一个12层的FCN作为生成网络(G),用于学习分布规律,生成蓝藻图像的分割结果(Fake);然后,设计了一个5层的卷积神经网络(CNN)作为判别网络(D),用于区分生成网络生成的分割结果(Fake)和手工标注的真实分割结果(Label),G试图生成Fake并蒙骗D,D试图找出Fake并惩罚G;最后,通过两个网络的对抗式训练,G生成的Fake可以蒙骗D,从而获得了更好的分割结果。在3075张蓝藻图像集上的训练和测试结果表明,DGAN在精确率、召回率及F1分数等指标上均大幅领先基于迭代的阈值分割算法;相比FCNNet (SHELHAMER E,LONG J,DARRELL T.Fully convolutional networks for semantic segmentation.IEEE Transactions on Pattern Analysis and Machine Intelligence,2017,39(4):640-651)、Deeplab (CHEN L C,PAPANDREOU G,KOKKINOS I,et al.Semantic image segmentation with deep convolutional nets and fully connected CRFs.Computer Science,2014(4):357-361)等其他基于DNN的方法也提升了超过4个百分点,取得了更精准的分割结果。分割速度上,DGAN的0.63 s略慢于FCNNet的0.46 s,但远快于Deeplab的1.31 s。DGAN均衡的分割准确率和分割速度为基于图像的蓝藻语义分割提供了可行的技术方案。

关键词: 生成式对抗网络, 深度神经网络, 全卷积神经网络, 蓝藻, 语义分割

Abstract: Concerning the problem of insufficient accuracy of the traditional image segmentation algorithm in segmentation of blue-green alga images, a new network structure named Deep Generative Adversarial Net (DGAN) based on Deep Neural Network (DNN) and Generative Adversarial Net (GAN) was proposed. Firstly, based on Fully Convolutional neural Network (FCN), a 12-layer FCN was constructed as the Generater (G), which was used to study the distribution of data and generate the segmentation result of blue-green alga images (Fake). Secondly, a 5-layer Convolutional Neural Network (CNN) was constructed as the Discriminator (D), which was used to distinguish the segmentation result generated by the generated network (Fake) and the true segmentation result with manual annotation (Label), G tried to generate Fake and deceive D, D tried to find out Fake and punish G. Finally, through the adversarial training of two networks, a better segmentation result was obtained because Fake generated by G could cheat D. The training and test results on image sets with 3075 blue-green alga images show that, the proposed DGAN is far ahead of the iterative threshold segmentation algorithm in precision, recall and F1 score, which are increased by more than 4 percentage points than other DNN algorithms such as FCNNet (SHELHAMER E, LONG J, DARRELL T. Fully convolutional networks for semantic segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(4):640-651) and Deeplab (CHEN L C, PAPANDREOU G, KOKKINOS I, et al. Semantic image segmentation with deep convolutional nets and fully connected CRFs. Computer Science, 2014(4):357-361). The proposed DGAN has obtained more accurate segmentation results. In the aspect of segmentation speed, the DGAN needs 0.63 s per image, which is slightly slower than the traditional FCNNet with 0.46 s, but much faster than Deeplab with 1.31 s. The balanced segmentation accuracy and speed of DGAN can provide a feasible technical scheme for image-based semantic segmentation of blue-green algae.

Key words: Generative Adversarial Net (GAN), Deep Neural Network (DNN), Fully Convolutional neural Network (FCN), blue-green algae, semantic segmentation

中图分类号: