计算机应用 ›› 2019, Vol. 39 ›› Issue (10): 2930-2936.DOI: 10.11772/j.issn.1001-9081.2019040594

• 人工智能 • 上一篇    下一篇

基于卷积神经网络的多尺度葡萄图像识别方法

邱津怡1,2, 罗俊1,2, 李秀3, 贾伟1, 倪福川1,2, 冯慧1   

  1. 1. 华中农业大学 信息学院, 武汉 430070;
    2. 湖北省农业大数据工程技术研究中心, 武汉 430070;
    3. 华中农业大学 工学院, 武汉 430070
  • 收稿日期:2019-04-10 修回日期:2019-06-22 出版日期:2019-10-10 发布日期:2019-08-21
  • 通讯作者: 罗俊
  • 作者简介:邱津怡(1995-),女,天津人,硕士研究生,CCF会员,主要研究方向:计算机视觉、图像处理;罗俊(1981-),男,湖北武汉人,副教授,博士,CCF会员,主要研究方向:机器学习、大数据;李秀(1995-),女,山东济南人,硕士研究生,CCF会员,主要研究方向:计算机视觉、图像处理;贾伟(1994-),男,四川德阳人,硕士研究生,主要研究方向:计算机视觉、深度学习;倪福川(1974-),男,湖北黄冈人,讲师,博士,主要研究方向:机器学习、大数据;冯慧(1987-),女,湖北浠水人,讲师,博士,主要研究方向:计算机视觉、植物表型检测。
  • 基金资助:
    国家自然科学基金资助项目(21800305);国家重点研发计划项目(2018YFC1604000);中央高校基本科研业务费专项资金资助项目(2662017PY059)。

Multi-scale grape image recognition method based on convolutional neural network

QIU Jinyi1,2, LUO Jun1,2, LI Xiu3, JIA Wei1, NI Fuchuan1,2, FENG Hui1   

  1. 1. College of Informatics, Huazhong Agricultural University, Wuhan Hubei 430070, China;
    2. Hubei Engineering Technology Research Center of Agricultural Big Data, Wuhan Hubei 430070;
    3. College of Engineering, Huazhong Agricultural University, Wuhan Hubei 430070, China
  • Received:2019-04-10 Revised:2019-06-22 Online:2019-10-10 Published:2019-08-21
  • Supported by:
    This work is partially supported by the National Natural Science Foundation of China (21800305), the National Key R&D Program of China (2018YFC1604000), the Fundamental Research Funds for the Central Universities (2662017PY059).

摘要: 葡萄品种质量检测需要识别多类别的葡萄,而葡萄图片中存在多种景深变化、多串等多种场景,单一预处理方法存在局限导致葡萄识别的效果不佳。实验的研究对象是大棚中采集的15个类别的自然场景葡萄图像,并建立相应图像数据集Vitis-15。针对葡萄图像中同一类别的差异较大而不同类别的差异较小的问题,提出一种基于卷积神经网络(CNN)的多尺度葡萄图像识别方法。首先,对Vitis-15数据集中的数据通过三种方法进行预处理:旋转图像的数据扩增方法、中心裁剪的多尺度图像方法以及前两种方法的数据融合方法;然后,采用迁移学习方法和卷积神经网络方法来进行分类识别,迁移学习选取ImageNet上预训练的Inception V3网络模型,卷积神经网络采用AlexNet、ResNet、Inception V3这三类模型;最后,提出适合Vitis-15的多尺度图像数据融合的分类模型MS-EAlexNet。实验结果表明,在同样的学习率和同样的测试集上,数据融合方法在MS-EAlexNet上的测试准确率达到了99.92%,相较扩增和多尺度图像方法提升了近1个百分点,并且所提方法在分类小样本数据集上具有较高的效率。

关键词: 图像识别, 自然场景, 迁移学习, 卷积神经网络, 多尺度图像, 数据融合

Abstract: Grape quality inspection needs the identification of multiple categories of grapes, and there are many scenes such as depth of field changes and multiple strings in the grape images. Grape recognition is ineffective due to the limitations of single pretreatment method. The research objects were 15 kinds of natural scene grape images collected in the greenhouse, and the corresponding image dataset Vitis-15 was established. Aiming at the large intra-class differences and small inter-class of differences grape images, a multi-scale grape image recognition method based on Convolutional Neural Network (CNN) was proposed. Firstly, the data in Vitis-15 dataset were pre-processed by three methods, including the image rotating based data augmentation method, central cropping based multi-scale image method and data fusion method of the above two. Then, transfer learning method and convolution neural network method were adopted to realiize the classification and recognition. The Inception V3 network model pre-trained on ImageNet was selected for transfer learning, and three types of models-AlexNet, ResNet and Inception V3 were selected for convolution neural network. The multi-scale image data fusion classification model MS-EAlexNet was proposed, which was suitable for Vitis-15. Experimental results show that with the same learning rate on the same test dataset, compared with the augmentation and multi-scale image method, the data fusion method improves nearly 1% testing accuracy on MS-EAlexNet model with 99.92% accuracy, meanwhile the proposed method has higher efficiency in classifying small sample datasets.

Key words: image recognition, natural scene, transfer learning, Convolutional Neural Network (CNN), multi-scale image, data fusion

中图分类号: