Journal of Computer Applications ›› 2020, Vol. 40 ›› Issue (5): 1295-1300.DOI: 10.11772/j.issn.1001-9081.2019111883

• Artificial intelligence • Previous Articles     Next Articles

Multi-branch neural network model based weakly supervised fine-grained image classification method

BIAN Xiaoyong1,2,3, JIANG Peiling1,2,3, ZHAO Min4,5, DING Sheng1,2,3, ZHANG Xiaolong1,2,3   

  1. 1.School of Computer Science and Technology, Wuhan University of Science and Technology, WuhanHubei 430065, China
    2.Institute of Big Data Science and Engineering, Wuhan University of Science and Technology, WuhanHubei 430065, China
    3.Key Laboratory of Hubei Province for Intelligent Information Processing and Real-time Industrial System;(Wuhan University of Science and Technology), WuhanHubei 430065, China
    4.School of Information Science and Engineering, Wuhan University of Science and Technology, WuhanHubei 430081, China
    5.Engineering Research Center for Metallurgical Automation and Measurement Technology, Ministry of Education;(Wuhan University of Science and Technology), WuhanHubei 430081, China
  • Received:2019-11-05 Revised:2019-12-17 Online:2020-05-10 Published:2020-05-15
  • Contact: BIAN Xiaoyong, born in 1976, Ph. D., associate professor. His research interests include remote sensing scene classification, feature learning.
  • About author:BIAN Xiaoyong, born in 1976, Ph. D., associate professor. His research interests include remote sensing scene classification, feature learning.JIANG Peiling, born in 1993, M. S. candidate. His research interests include fine-grained image classification, deep learning.ZHAO Min, born in 1978, M. S., lecturer. His research interests include fault diagnosis.DING Sheng, born in 1975, Ph. D., associate professor. His research interests include object detection, deep learning.ZHANG Xiaolong, born in 1963, Ph. D., professor. His research interests include artificial intelligence, machine learning, data mining, bioinformatics.
  • Supported by:

    This work is partially supported by the National Natural Science Foundation of China (61572381, 61501337, 61972299), the Natural Science Foundation of Hubei Province (2018CFB575), the Open Fund of Engineering Research Center for Metallurgical Automation and Measurement Technology of Ministry of Education (MADT201707).

基于多分支神经网络模型的弱监督细粒度图像分类方法

边小勇1,2,3, 江沛龄1,2,3, 赵敏4,5, 丁胜1,2,3, 张晓龙1,2,3   

  1. 1.武汉科技大学 计算机科学与技术学院,武汉 430065
    2.武汉科技大学 大数据科学与工程研究院,武汉 430065
    3.智能信息处理与实时工业系统湖北省重点实验室(武汉科技大学),武汉 430065
    4.武汉科技大学 信息科学与工程学院,武汉 430081
    5.冶金自动化与检测技术教育部工程研究中心(武汉科技大学),武汉 430081
  • 通讯作者: 边小勇(1976—)
  • 作者简介:边小勇(1976—),男,江西吉安人,副教授,博士,主要研究方向:遥感场景分类、特征学习; 江沛龄(1993—),男,湖北武汉人,硕士研究生,主要研究方向:细粒度图像分类、深度学习; 赵敏(1978—),男,湖北咸宁人,讲师,硕士,主要研究方向:故障诊断; 丁胜(1975—),男,湖北武汉人,副教授,博士,主要研究方向:目标检测、深度学习; 张晓龙(1963—),男,江西吉安人,教授,博士,主要研究方向:人工智能、机器学习、数据挖掘、生物信息处理。
  • 基金资助:

    国家自然科学基金资助项目(61572381,61501337,61972299);湖北省自然科学基金资助项目(2018CFB575);冶金自动化与检测技术教育部工程研究中心开放基金资助项目(MADT201707)。

Abstract:

Concerning the problem that the local feature and rotation invariant feature cannot be jointly paid attention to in traditional attention-based neural networks, a multi-branch neural network model based weakly supervised fine-grained image classification method was proposed. Firstly, the lightweight Class Activation Map (CAM) network was utilized to localize the local region with potential semantic information, and the residual network ResNet-50 with deformable convolution and Oriented Response Network (ORN) with rotation invariant coding were designed. Secondly, the pre-trained model was employed to initialize the feature networks respectively, and the original image and the above regions were input to fine-tune the model. Finally, the three intra-branch losses and between-branch losses were combined to optimize the entire network, and the classification and prediction were performed on the test set. The proposed method achieves the classification accuracies of 87.7% and 90.8% on CUB-200-2011 dataset and FGVC_Aircraft dataset respectively, which are increased by 1.2 percentage points, and 0.9 percentage points respectively compared with those of the Multi-Attention Convolutional Neural Network (MA-CNN) method. On Aircraft_2 dataset, the proposed method reaches 91.8% classification accuracy, which is 4.1 percentage points higher than that of ResNet-50. The experimental results show that the proposed method improves the accuracy of weakly supervised fine-grained image classification effectively.

Key words: fine-grained image classification, deep learning, weakly supervised, deformable convolution, Class Activation Map (CAM), Oriented Response Network (ORN)

摘要:

针对传统基于注意力机制的神经网络不能联合关注局部特征和旋转不变特征的问题,提出一种基于多分支神经网络模型的弱监督细粒度图像分类方法。首先,用轻量级类激活图(CAM)网络定位有潜在语义信息的局部区域,设计可变形卷积的残差网络ResNet-50和旋转不变编码的方向响应网络(ORN);其次,利用预训练模型分别初始化特征网络,并输入原图和以上局部区域分别对模型进行微调;最后,组合三个分支内损失和分支间损失优化整个网络,对测试集进行分类预测。所提方法在CUB-200-2011和FGVC_Aircraft数据集上的分类准确率分别达到87.7%和90.8%,与多注意力卷积神经网络(MA-CNN)方法相比,分别提高了1.2个百分点和0.9个百分点;在Aircraft_2数据集上的分类准确率达到91.8%,比ResNet-50网络提高了4.1个百分点。实验结果表明,所提方法有效提高了弱监督细粒度图像分类的准确率。

关键词: 细粒度图像分类, 深度学习, 弱监督, 可变形卷积, 类激活图, 方向响应网络

CLC Number: