Fine-grained image classification method based on multi-feature combination

doi:10.11772/j.issn.1001-9081.2017122920

Journal of Computer Applications ›› 2018, Vol. 38 ›› Issue (7): 1853-1856.DOI: 10.11772/j.issn.1001-9081.2017122920

Previous Articles Next Articles

Fine-grained image classification method based on multi-feature combination

ZOU Chengming^1,2, LUO Ying^1,2, XU Xiaolong^1,2

1. Hubei Key Laboratory of Transportation Internet of Things(Wuhan University of Technology), Wuhan Hubei 430070, China;
2. College of Computer Science and Technology, Wuhan University of Technology, Wuhan Hubei 430070, China

Received:2017-12-13 Revised:2018-02-08 Online:2018-07-12 Published:2018-07-10
Supported by:
This work is partially supported by the Fundamental Research Funds for the Central Universities (2017-zy-084).

基于多特征组合的细粒度图像分类方法

邹承明^1,2, 罗莹^1,2, 徐晓龙^1,2

1. 交通物联网技术湖北省重点实验室(武汉理工大学), 武汉 430070;
2. 武汉理工大学计算机科学与技术学院, 武汉 430070

通讯作者: 罗莹
作者简介:邹承明(1975-),男,广东徐闻人,教授,博士,CCF会员,主要研究方向:计算机视觉、嵌入式系统、软件理论与方法;罗莹(1993-),女,湖南益阳人,硕士研究生,主要研究方向:图形图像处理;徐晓龙(1995-),男,安徽宿州人,硕士研究生,主要研究方向:图形图像处理。
基金资助:
中央高校基本科研业务费专项（2017-zy-084）。

Abstract

Abstract: As the limitation of single feature representation may cause low accuracy of fine-grained image classification, a multi-feature combination representation method based on Convolutional Neural Network (CNN) and Scale Invariant Feature Transform (SIFT) was proposed. The features were extracted from the entire target, the key parts and the key points comprehensively. Firstly, two CNN models were trained with the target-entirety regions and the head-only regions in the fine-grained image library respectively, which were used to extract the target-entirety and the head-only CNN features. Secondly, the SIFT key points were extracted from all the target-entirety regions in the image library, and the codebook was generated through the K-means clustering. Then, the SIFT descriptors of each target-entirety region were encoded into a feature vector by using the Vector of Locally Aggregated Descriptors (VLAD) along with the codebook. Finally, Support Vector Machine (SVM) was used to classify the fine-grained images by using the combination of multiple features. The method was evaluated in CUB-200-2011 database and compared with the single feature representation method. The experimental results show that the proposed method can improve the classification accuracy by 13.31% compared with the single CNN feature representation, which proves the positive effect of multi-feature combination on fine-grained image classification.

Key words: Convolutional Neural Network (CNN), Scale Invariant Feature Transform (SIFT), K-means clustering, Vector of Locally Aggregated Descriptors (VLAD), fine-grained image classification

摘要： 针对单一特征表示的局限性会导致细粒度图像分类准确度不高的问题，提出了一种基于卷积神经网络（CNN）和尺度不变特征转换（SIFT）的多特征组合表示方法，综合考虑对目标整体、关键部位和关键点的特征提取。首先，分别以细粒度图像库中的目标整体和头部区域训练CNN得到两个网络模型，用来提取目标的整体和头部CNN特征；然后，对图像库中所有目标区域提取SIFT关键点并通过K均值（K-means）聚类生成码本，再将每个目标区域的SIFT描述子通过局部特征聚合描述符（VLAD）参照码本编码为特征向量；最后，组合多种特征作为最终的特征表示，采用支持向量机（SVM）对细粒度图像进行分类。使用该方法在CUB-200-2011数据库上进行实验，并与单一的特征表示方法进行了比较。实验结果表明，该方法与基于单一CNN特征的细粒度图像分类相比提升了13.31%的准确度，证明了多特征组合对细粒度图像分类的积极作用。

关键词: 卷积神经网络, 尺度不变特征转换, K均值聚类, 局部特征聚合描述符, 细粒度图像分类

CLC Number:

ZOU Chengming, LUO Ying, XU Xiaolong. Fine-grained image classification method based on multi-feature combination[J]. Journal of Computer Applications, 2018, 38(7): 1853-1856.

邹承明, 罗莹, 徐晓龙. 基于多特征组合的细粒度图像分类方法[J]. 计算机应用, 2018, 38(7): 1853-1856.

References

[1] 罗建豪,吴建鑫.基于深度卷积特征的细粒度图像分类研究综述[J].自动化学报,2017,43(8):1306-1318.(LUO J H, WU J X. A survey on fine-grained image categorization using deep convolutional features[J]. Acta Automatica Sinica, 2017, 43(8):1306-1318.)
[2] 冯语姗,王子磊.自上而下注意图分割的细粒度图像分类[J].中国图象图形学报,2016,21(9):1147-1154.(FENG Y S, WANG Z L. Fine-grained image categorization with segmentation based on top-down attention map[J]. Journal of Image and Graphics, 2016, 21(9):1147-1154.)
[3] WAH C, BRANSON S, WELINDER P, et al. The Caltech-UCSD Birds-200-2011 dataset, CNS-TR-2011-001[R]. Pasadena, CA:California Institute of Technology, 2011.
[4] LOWE D G. Object recognition from local scale-invariant features[C]//ICCV 1999:Proceedings of the 7th IEEE International Conference on Computer Vision. Piscataway, NJ:IEEE, 1999:1150.
[5] SANCHEZ J, PERRONNIN F, MENSINK T. Image classification with the fisher vector:theory and practice[J]. International Journal of Computer Vision, 2013, 105(3):222-245.
[6] JEGOU H, DOUZE M, SCHMID C, et al. Aggregating local descriptors into a compact image representation[C]//CVPR 2010:Proceedings of the 2010 IEEE Conference on Computer Vision and Pattern Recognition. Washington, DC:IEEE Computer Society, 2010:3304-3311.
[7] LECUN Y, BOTTOU L, BENGIO Y. Gradient-based learning applied to document recognition[J]. Proceeding of the IEEE, 1998, 86(11):2278-2324.
[8] KRIZHEVSK A, SUTSKEVER I, HINTON G E. ImageNet classification with deep convolutional neural networks[J]. Advances in Neural Information Processing Systems, 2012, 25:1106-1114.
[9] SIMONYAN K, ZISSERMAN A. Very deep convolutional networks for large-scale image recognition[C/OL]. ICLR 2015:Proceedings of the 2015 International Conference on Learning Representations. San Diego, CA.[2017-09-12]. https://arxiv.org/abs/1409.1556.
[10] SZEGEDY C, LIU W, JIA Y, et al. Going deeper with convolutions[C]//CVPR 2015:Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition. Washington, DC:IEEE Computer Society, 2015:1-9.
[11] ZHAO B, FENG J, WU X, et al. A survey on deep learning-based fine-grained object classification and semantic segmentation[J]. International Journal of Automation and Computing, 2017, 14(2):119-135.
[12] ZHANG N, DONAHUE J, GIRSHICK R, et al. Part-based R-CNNs for fine-grained category detection[C]//ECCV 2015:Proceedings of the 2015 European Conference on Computer Vision. Piscataway, NJ:IEEE, 2015:1143-1151.
[13] BRANSON S, VAN HORN G, BELONGIE S, et al. Bird species categorization using pose normalized deep convolutional nets[C/OL]. BMVC 2014:Proceedings of the 2014 British Machine Vision Conference. Nottingham, UK.[2017-09-15]. https://arxiv.org/abs/1406.2952.
[14] LECUN Y, BOSER B, DENKER J S. Back propagation applied to handwritten zip code recognition[J]. Neural Computation, 1989, 1(4):541-551.
[15] 李彦冬,郝宗波,雷航.卷积神经网络研究综述[J].计算机应用,2016,36(9):2508-2515.(LI Y D, HAO Z B, LEI H. Survey of convolutional neural network[J]. Journal of Computer Applications, 2016, 36(9):2508-2515.)

Fine-grained image classification method based on multi-feature combination

基于多特征组合的细粒度图像分类方法

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics

[1]	Yun LI, Fuyou WANG, Peiguang JING, Su WANG, Ao XIAO. Uncertainty-based frame associated short video event detection method [J]. Journal of Computer Applications, 2024, 44(9): 2903-2910.
[2]	Hong CHEN, Bing QI, Haibo JIN, Cong WU, Li’ang ZHANG. Class-imbalanced traffic abnormal detection based on 1D-CNN and BiGRU [J]. Journal of Computer Applications, 2024, 44(8): 2493-2499.
[3]	Yangyi GAO, Tao LEI, Xiaogang DU, Suiyong LI, Yingbo WANG, Chongdan MIN. Crowd counting and locating method based on pixel distance map and four-dimensional dynamic convolutional network [J]. Journal of Computer Applications, 2024, 44(7): 2233-2242.
[4]	Dongwei WANG, Baichen LIU, Zhi HAN, Yanmei WANG, Yandong TANG. Deep network compression method based on low-rank decomposition and vector quantization [J]. Journal of Computer Applications, 2024, 44(7): 1987-1994.
[5]	Mengyuan HUANG, Kan CHANG, Mingyang LING, Xinjie WEI, Tuanfa QIN. Progressive enhancement algorithm for low-light images based on layer guidance [J]. Journal of Computer Applications, 2024, 44(6): 1911-1919.
[6]	Jianjing LI, Guanfeng LI, Feizhou QIN, Weijun LI. Multi-relation approximate reasoning model based on uncertain knowledge graph embedding [J]. Journal of Computer Applications, 2024, 44(6): 1751-1759.
[7]	Wenshuo GAO, Xiaoyun CHEN. Point cloud classification network based on node structure [J]. Journal of Computer Applications, 2024, 44(5): 1471-1478.
[8]	Min SUN, Qian CHENG, Xining DING. CBAM-CGRU-SVM based malware detection method for Android [J]. Journal of Computer Applications, 2024, 44(5): 1539-1545.
[9]	Jie WANG, Hua MENG. Image classification algorithm based on overall topological structure of point cloud [J]. Journal of Computer Applications, 2024, 44(4): 1107-1113.
[10]	Tianhua CHEN, Jiaxuan ZHU, Jie YIN. Bird recognition algorithm based on attention mechanism [J]. Journal of Computer Applications, 2024, 44(4): 1114-1120.
[11]	Lijun XU, Hui LI, Zuyang LIU, Kansong CHEN, Weixuan MA. 3D-GA-Unet： MRI image segmentation algorithm for glioma based on 3D-Ghost CNN [J]. Journal of Computer Applications, 2024, 44(4): 1294-1302.
[12]	Jingxian ZHOU, Xina LI. UAV detection and recognition based on improved convolutional neural network and radio frequency fingerprint [J]. Journal of Computer Applications, 2024, 44(3): 876-882.
[13]	Ruifeng HOU, Pengcheng ZHANG, Liyuan ZHANG, Zhiguo GUI, Yi LIU, Haowen ZHANG, Shubin WANG. Iterative denoising network based on total variation regular term expansion [J]. Journal of Computer Applications, 2024, 44(3): 916-921.
[14]	Yongfeng DONG, Jiaming BAI, Liqin WANG, Xu WANG. Chinese named entity recognition combining prior knowledge and glyph features [J]. Journal of Computer Applications, 2024, 44(3): 702-708.
[15]	Lin SUN, Menghan LIU. K-means clustering based on adaptive cuckoo optimization feature selection [J]. Journal of Computer Applications, 2024, 44(3): 831-841.