计算机应用 ›› 2016, Vol. 36 ›› Issue (2): 531-534.DOI: 10.11772/j.issn.1001-9081.2016.02.0531

• 虚拟现实与数字媒体 • 上一篇    下一篇

基于卷积神经网络和流形排序的图像检索算法

刘兵1,2, 张鸿1,2   

  1. 1. 武汉科技大学 计算机科学与技术学院, 武汉 430065;
    2. 武汉科技大学 智能信息处理与实时工业系统湖北省重点实验室, 武汉 430065
  • 收稿日期:2015-07-24 修回日期:2015-09-12 出版日期:2016-02-10 发布日期:2016-02-03
  • 通讯作者: 张鸿(1979-),女,湖北襄阳人,教授,博士,CCF会员,主要研究方向:跨媒体检索、机器学习、数据挖掘。
  • 作者简介:刘兵(1992-),男,湖北广水人,硕士研究生,主要研究方向:深度学习、图像处理。
  • 基金资助:
    国家自然科学基金资助项目(61003127,61373109)。

Image retrieval algorithm based on convolutional neural network and manifold ranking

LIU Bing1,2, ZHANG Hong1,2   

  1. 1. School of Computer Science and Technology, Wuhan University of Science and Technology, Wuhan Hubei 430065, China;
    2. Hubei Province Key Laboratory of Intelligent Information Processing and Real-Time Industrial System, Wuhan University of Science and Technology, Wuhan Hubei 430065, China
  • Received:2015-07-24 Revised:2015-09-12 Online:2016-02-10 Published:2016-02-03

摘要: 针对基于内容的图像检索(CBIR)中低层视觉特征与用户对图像理解的高层语义不一致以及传统的距离度量方式难以真实反映图像之间相似程度等问题,提出了一种基于卷积神经网络(CNN)和流形排序的图像检索算法。首先,将图像输入CNN,通过多层神经网络对图像的监督学习,提取网络中全连接层的图像特征;其次,对图像特征进行归一化处理,然后用高效流形排序(EMR)算法对查询图像所返回的结果进行排序;最后,根据排序的结果返回最相似的图像。在corel数据集上,深度图像特征比基于场景描述的图像特征的平均查准率(mAP)提高了53.74%,流形排序比余弦距离度量方式的mAP提高了18.34%。实验结果表明,所提算法能够有效地提高图像检索的准确率。

关键词: 图像检索, 深度学习, 卷积神经网络, 特征提取, 流形排序

Abstract: In Content-Based Image Retrieval (CBIR), the low-level visual features are not consistent with the high-level semantic features captured by human, and it is difficult to reflect the similarity of images by traditional distance measurements. To solve these problems, an image retrieval algorithm based on Convolutional Neural Network (CNN) and manifold ranking was proposed. Firstly, the image dataset was put into CNN, image features were extracted through the fully connected layers of the network after supervised learning; secondly, the image features were normalized and then Efficient Manifold Ranking (EMR) algorithm was used to return the ranked scores for query images; finally, the most similar images were returned to users according to the scores. In corel dataset, the mean Average Precision (mAP) of deep image feature was 53.74% higher than that of the scene descriptor features, and the mAP of efficient manifold ranking was 18.34% higher than that of the cosine distance. The experimental results show that the proposed algorithm can effectively improve the accuracy of image retrieval.

Key words: image retrieval, deep learning, Convolutional Neural Network(CNN), feature extraction, manifold ranking

中图分类号: