• •    

基于多尺度特征融合Hessian稀疏编码的图像分类

刘盛清,孙季丰,余家林,宋治国   

  1. 广州华南理工大学电子与信息学院逸夫科学馆425实验室
  • 收稿日期:2017-06-05 修回日期:2017-08-05 发布日期:2017-08-05
  • 通讯作者: 刘盛清

Multi-Scale Feature Fusion Based Hessian Sparse Coding for Image Classification

  • Received:2017-06-05 Revised:2017-08-05 Online:2017-08-05
  • Contact: Sheng-Qing LIU

摘要: 摘 要: 针对传统稀疏编码图像分类算法提取单一类型特征,忽略图像的空间结构信息,特征编码时无法充分利用特征拓扑结构信息的问题,提出了基于多尺度特征融合Hessian稀疏编码的图像分类算法(Hessian Sparse Coding,HSC)。首先对图像进行空间金字塔多尺度划分;然后在各个子空间层将方向梯度直方图和尺度不变特征转换进行有效的融合;为了充分利用特征的拓扑结构信息,在传统稀疏编码目标函数中引入二阶Hessian能量函数作为正则项;最后利用支持向量机进行分类。在Scene15数据集上,HSC的准确率比局部约束线性编码高了3%-5%,比支持区别性字典学习等对比方法高了1%-3%。在Caltech101上的耗时实验中,HSC的用时比多核学习少40%左右。实验结果表明,HSC可以有效地提高图像分类准确率,算法的效率也优于对比算法。

关键词: 图像分类, 特征融合, 空间金字塔, 稀疏编码, 支持向量机

Abstract: Abstract: Concern the problem that the traditional sparse coding image classification algorithms extracted the single type feature, ignored the spatial structure information of the image and lacked the topological structure information of the feature. In order to solve the problems, the multi-scale feature fusion based Hessian sparse coding algorithm was proposed. Firstly, the image was divided into sub-regions with multi-scale spatial pyramid; Then, the Histogram of Oriented Gradient and Scale-invariant feature transform were effectively merged in each subspace layer; In order to make full use of the characteristic topology information, the second order Hessian energy function was introduced to the traditional sparse coding target function as a regular term; Finally, the support vector machine was used to classify. In the Scene15 experiment, the accuracy of HSC was 3%-5% higher than that of Locality-constrained Linear Coding, and it was 1%-3% higher than that of support discrimination dictionary learning. In the tine-consuming experiment on Caltech101, HSC is about 40% less than the Multiple Kernel Learning Sparse Coding. The experimental results show that the HSC algorithm can effectively improve the classification accuracy, and the efficiency is better than the contrast algorithm.

Key words: Keywords: image classification, feature fusion, spatial pyramid matching, sparse coding, support vector machine

中图分类号: