Journal of Computer Applications ›› 2019, Vol. 39 ›› Issue (8): 2271-2275.DOI: 10.11772/j.issn.1001-9081.2018122555

• Artificial intelligence • Previous Articles     Next Articles

Two-input stream deep deconvolution neural network for interpolation and recognition

ZHANG Qiang1, YANG Jian1,2, FU Lizhen1   

  1. 1. School of Software, North University of China, Taiyuan Shanxi 030051, China;
    2. Key Laboratory of Electromagnetic Wave Information of Ministry of Education(Fudan University), Shanghai 200433, China
  • Received:2018-12-27 Revised:2019-04-09 Online:2019-08-10 Published:2019-04-18
  • Supported by:
    This work is partially supported by the National Natural Science Foundation of China (61602427).

双输入流深度反卷积的插值神经网络

张强1, 杨剑1,2, 富丽贞1   

  1. 1. 中北大学 软件学院, 太原 030051;
    2. 电磁波信息科学教育部重点实验室(复旦大学), 上海 200433
  • 通讯作者: 杨剑
  • 作者简介:张强(1993-),男,山西忻州人,硕士研究生,主要研究方向:计算机视觉、图像处理;杨剑(1979-),男,山西临汾人,讲师,博士,CCF会员,主要研究方向:计算机视觉、无线通信;富丽贞(1984-),女,山西大同人,讲师,博士,CCF会员,主要研究方向:人工智能、数据挖掘。
  • 基金资助:
    国家自然科学基金资助项目(61602427)。

Abstract: It is impractical to have a large size of training dataset in real work for neural network training, so a two-input stream generative neural network which can generate a new image with the given parameters was proposed, hence to augment the training dataset. The framework of the proposed neural network consists of a two-input steam convolution network and a deconvolution network. The two-input steam network has two convolution networks to extract features, and the deconvolution network is connected to the end. Two images with different angle were input into the convolution network to get high-level description, then an interpolation target image with a new perspectives was generated by using the deconvolution network with the above high-level description and set parameters. The experiment results on ShapeNetCore show that on the same dataset, the recognition rate of the proposed network has increased by 20% than the common network framework. This method can enlarge the size of the training dataset and is useful for multi-angle recognition.

Key words: deep learning, artificial intelligence, generative neural network, deconvolution, two-input stream

摘要: 在实际工作中深度学习方法通常不具备大量的训练样本,因此提出了双输入流深度反卷积生成神经网络的构架,依据给定的条件产生新的目标图像,从而扩充训练样本集。该神经网络的整体架构由双输入的卷积网络和一个反卷积网络输出构成,其中双输入卷积网络接收目标物体不同视角的两张图片并提取抽象特征,而反卷积网络则利用抽象特征和设定的参数产生新的插值目标图像。在ShapeNetCore数据集上的实验结果显示,在相同数量的训练样本空间中,与未扩展数据集的卷积网络相比,双输入流深度反卷积生成神经网络的识别率提高了20%左右。结果表明,双输入流深度反卷积生成神经网络无需输入目标物类别,可生成新参数条件下的目标图像,扩充训练样本空间,从而提高识别率,可用于少样本的目标物多角度识别。

关键词: 深度学习, 人工智能, 生成神经网络, 反卷积, 双输入流

CLC Number: