基于优化视觉词袋模型的图像分类方法

doi:10.11772/j.issn.1001-9081.2017.08.2244

计算机应用 ›› 2017, Vol. 37 ›› Issue (8): 2244-2247.DOI: 10.11772/j.issn.1001-9081.2017.08.2244

基于优化视觉词袋模型的图像分类方法

张永, 杨浩

兰州理工大学计算机与通信学院, 兰州 730050

收稿日期:2016-12-13 修回日期:2017-03-11 发布日期:2017-08-12 出版日期:2017-08-10
通讯作者: 杨浩
作者简介:张永(1963-),男,甘肃兰州人,教授,主要研究方向:智能信息处理、数据挖掘;杨浩(1991-),男,甘肃陇南人,硕士研究生,主要研究方向:图像分类、机器学习。

Image classification method based on optimized bag-of-visual words model

ZHANG Yong, YANG Hao

School of Computer and Communication, Lanzhou University of Technology, Lanzhou Gansu 730050, China

Received:2016-12-13 Revised:2017-03-11 Online:2017-08-12 Published:2017-08-10

摘要/Abstract

摘要： 针对视觉词袋（BOV）模型中过大的视觉词典会导致图像分类时间代价过大的问题，提出一种加权最大相关最小相似（W-MR-MS）视觉词典优化准则。首先，提取图像的尺度不变特征转换（SIFT）特征，并用K-Means算法对特征聚类生成原始视觉词典；然后，分别计算视觉单词与图像类别间的相关性，以及各视觉单词间的语义相似性，引入一个加权系数权衡两者对图像分类的重要程度；最后，基于权衡结果，删除视觉词典中与图像类别相关性弱、与视觉单词间语义相似性大的视觉单词，从而达到优化视觉词典的目的。实验结果表明，在视觉词典规模相同的情况下，所提方法的图像分类精度比传统基于K-Means算法的图像分类精度提高了5.30%；当图像分类精度相同的情况下，所提方法的时间代价比传统K-Means算法下的时间代价降低了32.18%，因此，所提方法具有较高的分类效率，适用于图像分类。

关键词: 图像分类, 视觉词袋模型, 特征提取, 视觉词典

Abstract: Concerning the problem that too large visual dictionary may increase the time cost of image classification in the Bag-Of-Visual words (BOV) model, a Weighted-Maximal Relevance-Minimal Semantic similarity (W-MR-MS) criterion was proposed to optimize visual dictionary. Firstly, the Scale Invariant Feature Transform (SIFT) features of images were extracted, and the K-Means algorithm was used to generate an original visual dictionary. Secondly, the correlation between visual words and image categories and semantic similarity among visual words were calculated, and a weighted parameter was introduced to measure the importance of the correlation and the semantic similarity in image classification. Finally, based on the weighing result, the visual word which correlation with image categories was weak and semantic similarity among visual words was high was removed, which achieved the purpose of optimizing the visual dictionary. The experimental results show that the classification precision of the proposed method is 5.30% higher than that of the traditional K-Means algorithm under the same visual dictionary scale; the time cost of the proposed method is reduced by 32.18% compared with the traditional K-Means algorithm under the same classification precision. Therefore, the proposed method has high classification efficiency and it is suitable for image classification.

Key words: image classification, Bag-Of-Visual words (BOV) model, feature extraction, visual dictionary

中图分类号:

TP181

张永, 杨浩. 基于优化视觉词袋模型的图像分类方法[J]. 计算机应用, 2017, 37(8): 2244-2247.

ZHANG Yong, YANG Hao. Image classification method based on optimized bag-of-visual words model[J]. Journal of Computer Applications, 2017, 37(8): 2244-2247.

参考文献

[1] SIVIC J, ZISSERMAN A. Video Google:a text retrieval approach to object matching in videos[C]//ICCV 2003:Proceedings of the 2003 Ninth IEEE International Conference on Computer Vision. Piscataway, NJ:IEEE, 2003:1470-1477.
[2] 王朔琛,汪西莉,马君亮.基于均值漂移的半监督支持向量机图像分类[J].计算机应用,2014,34(8):2399-2403.(WANG S C, WANG X L, MA J L. Semi-supervised support vector machine for image classification based on mean shift[J]. Journal of Computer Applications, 2014, 34(8):2399-2403.)
[3] 邵忻.基于跨领域主动学习的图像分类方法[J].计算机应用,2014,34(4):1169-1171.(SHAO X. Cross-domain active learning algorithm for image classification[J]. Journal of Computer Applications, 2014, 34(4):1169-1171.)
[4] TIMOFTE R, GOOL L V. Adaptive and weighted collaborative representations for image classification[J]. Pattern Recognition Letters, 2014, 43(1):127-135.
[5] ALQASRAWI Y, NEAGU D, COWLING P I. Fusing integrated visual vocabularies-based bag of visual words and weighted colour moments on spatial pyramid layout for natural scene image classification[J]. Signal Image & Video Processing, 2013, 7(4):759-775.
[6] LU Y, XIE F, LIU T, et al. No reference quality assessment for multiply-distorted images based on an improved bag-of-words model[J]. IEEE Signal Processing Letters, 2015, 22(10):1811-1815.
[7] QU Y, WU S, LIU H, et al. Evaluation of local features and classifiers in BOW model for image classification[J]. Multimedia Tools and Applications, 2014, 70(2):605-624.
[8] YANG X, ZHANG T, XU C. A new discriminative coding method for image classification[J]. Multimedia Systems, 2015, 21(2):133-145.
[9] GAO S, TSANG W H, MA Y. Learning category-specific dictionary and shared dictionary for fine-grained image categorization[J]. IEEE Transactions on Image Processing, 2014, 23(2):623-634.
[10] KIM S, KWEON I S, LEE C W. Visual categorization robust to large intra-class variations using entropy-guided codebook[C]//ICRA 2007:Proceedings of the 2007 IEEE International Conference on Robotics & Automation. Piscataway, NJ:IEEE, 2007:3793-3798.
[11] EPSHTEIN B, ULLMAN S. Feature hierarchies for object classification[C]//ICCV 2005:Proceedings of the 2005 Tenth IEEE International Conference on Computer Vision. Piscataway, NJ:IEEE, 2005:220-227.
[12] LU Z, WANG L, WEN J R. Image classification by visual bag-of-words refinement and reduction[J]. Neurocomputing, 2016, 173:373-384.
[13] KELBERT M, SUHOV Y. Information Theory and Coding by Example[M]. Oxford:Cambridge University Press, 2013:18-86.
[14] LOWE D G. Distinctive image features from scale-invariant keypoints[J]. International Journal of Computer Vision, 2004, 60(60):91-110.
[15] TUIA D, VOLPI M, DALLA MURA M, et al. Automatic feature learning for spatio-spectral image classification with sparse SVM[J]. IEEE Transactions on Geoscience & Remote Sensing, 2014, 52(10):6062-6074.

基于优化视觉词袋模型的图像分类方法

Image classification method based on optimized bag-of-visual words model

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

[1]	杨鑫, 陈雪妮, 吴春江, 周世杰. 结合变种残差模型和Transformer的城市公路短时交通流预测[J]. 《计算机应用》唯一官方网站, 2024, 44(9): 2947-2951.
[2]	付帅, 郭小英, 白茹意, 闫涛, 陈斌. 改进的CloFormer模型与有序回归相结合的年龄评估方法[J]. 《计算机应用》唯一官方网站, 2024, 44(8): 2372-2380.
[3]	陈彤, 杨丰玉, 熊宇, 严荭, 邱福星. 基于多尺度频率通道注意力融合的声纹库构建方法[J]. 《计算机应用》唯一官方网站, 2024, 44(8): 2407-2413.
[4]	龙伍丹, 彭博, 胡节, 申颖, 丁丹妮. 基于加强特征提取的道路病害检测算法[J]. 《计算机应用》唯一官方网站, 2024, 44(7): 2264-2270.
[5]	刘瑞华, 郝子赫, 邹洋杨. 基于多层级精细特征融合的步态识别算法[J]. 《计算机应用》唯一官方网站, 2024, 44(7): 2250-2257.
[6]	王东炜, 刘柏辰, 韩志, 王艳美, 唐延东. 基于低秩分解和向量量化的深度网络压缩方法[J]. 《计算机应用》唯一官方网站, 2024, 44(7): 1987-1994.
[7]	翟飞宇, 马汉达. 基于DenseNet的经典-量子混合分类模型[J]. 《计算机应用》唯一官方网站, 2024, 44(6): 1905-1910.
[8]	吴郅昊, 迟子秋, 肖婷, 王喆. 基于元学习自适应的小样本语音合成[J]. 《计算机应用》唯一官方网站, 2024, 44(5): 1629-1635.
[9]	肖斌, 杨模, 汪敏, 秦光源, 李欢. 独立性视角下的相频融合领域泛化方法[J]. 《计算机应用》唯一官方网站, 2024, 44(4): 1002-1009.
[10]	崔晨辉, 蔺素珍, 李大威, 禄晓飞, 武杰. 基于孪生网络和Transformer的红外弱小目标跟踪方法[J]. 《计算机应用》唯一官方网站, 2024, 44(2): 563-571.
[11]	刘涛, 鞠事宏, 高一萌. 基于改进YOLOv8n的无人机视角下小目标检测算法[J]. 《计算机应用》唯一官方网站, 2024, 44(11): 3603-3609.
[12]	范艺扬, 张洋, 曾尚, 曾渝, 付茂栗. 基于分解和频域特征提取的多变量长时间序列预测模型[J]. 《计算机应用》唯一官方网站, 2024, 44(11): 3442-3448.
[13]	赵培, 乔焰, 胡荣耀, 袁新宇, 李敏悦, 张本初. 基于多域特征提取的多变量时间序列异常检测[J]. 《计算机应用》唯一官方网站, 2024, 44(11): 3419-3426.
[14]	谢莉, 舒卫平, 耿俊杰, 王琼, 杨海麟. 结合加权原型和自适应张量子空间的小样本宫颈细胞分类[J]. 《计算机应用》唯一官方网站, 2024, 44(10): 3200-3208.
[15]	周雯, 谌雨章, 温志远, 王诗琦. 基于位置编码重叠切块嵌入和多尺度通道交互注意力的鱼类图像分类[J]. 《计算机应用》唯一官方网站, 2024, 44(10): 3209-3216.