计算机应用 ›› 2016, Vol. 36 ›› Issue (10): 2822-2825.DOI: 10.11772/j.issn.1001-9081.2016.10.2822

• 虚拟现实与数字媒体 • 上一篇    下一篇

基于深度特征分析的双线性图像相似度匹配算法

李鸣1,2, 张鸿1,2   

  1. 1. 武汉科技大学 计算机科学与技术学院, 武汉 430065;
    2. 智能信息处理与实时工业系统湖北省重点实验室(武汉科技大学), 武汉 430065
  • 收稿日期:2016-03-23 修回日期:2016-06-18 发布日期:2016-10-10
  • 通讯作者: 张鸿,E-mail:zhanghong_wust@163.com
  • 作者简介:李鸣(1992—),男,湖北武汉人,硕士研究生,主要研究方向:图像检索、深度学习;张鸿(1979—),女,湖北襄阳人,教授,博士,CCF会员,主要研究方向:基于内容的多媒体检索、数据挖掘、机器学习。
  • 基金资助:
    国家自然科学基金资助项目(61373109,61003127)。

Bilinear image similarity matching algorithm based on deep feature analysis

LI Ming1,2, ZHANG Hong1,2   

  1. 1. College of Computer Science and Technology, Wuhan University of Science and Technology, Wuhan Hubei 430065, China;
    2. Hubei Province Key Laboratory of Intelligent Information Processing and Real-time Industrial Systems (Wuhan University of Science and Technology), Wuhan Hubei 430065, China
  • Received:2016-03-23 Revised:2016-06-18 Published:2016-10-10
  • Supported by:
    BackgroundThis work is supported by the National Natural Science Foundation of China (61373109,61003127).

摘要: 基于内容的图像检索一直面临"语义鸿沟"的难题,特征选择对语义学习结果有着直接的影响;而传统距离度量方法往往从单一角度进行相似性计算,不能很好地表示出图像之间的相似度。为了解决以上问题,提出基于深度特征分析的双线性图像相似度匹配的方法。首先,将图像数据集在卷积神经网络模型上进行微调训练,然后利用训练好的卷积神经网络对图像进行特征提取,获得全连接层输出的特征之后,通过双线性相似性度量方法得到图像间相似度的大小,通过对相似度的大小排序,返回最相似的图像实例。在Caltech101和Caltech256数据集上的对比实验显示,所提算法的平均查准率、TopK查准率和查全率均优于对比算法,验证了所提算法的有效性。

关键词: 深度神经网络, 双线性相似度, 图像检索, 语义鸿沟, 平均查准率

Abstract: Content-based image retrieval has being faced the problem of "semantic gap", feature selection has a direct influence on semantic learning results; while traditional distance metric often calculates the similarity from a single perspective, which cannot well express the similarity between images. To resolve the above problem, a bilinear image similarity matching algorithm based on deep feature analysis was proposed. First, the image dataset was fine-tuning trained on the Convolutional Neural Network (CNN) model, then the image features were extracted by using the trained CNN. After getting the output features of the full connection layer, the image similarity was calculated by the bilinear similarity matching algorithm, and the most similar image instance was returned after sorting the similarity. Experimental results on Caltech101 and Caltech 256 datasets show that compared with the contrast algorithms, the proposed algorithm can get higher mean average precision, TopK precision and recall, which demonstrates the effectiveness of the proposed algorithm.

Key words: deep neural network, bilinear image similarity matching, image retrieval, semantic gap, mean average precision

中图分类号: