基于相关度距离的无监督并行哈希图像检索

doi:10.11772/j.issn.1001-9081.2020091472

计算机应用 ›› 2021, Vol. 41 ›› Issue (7): 1902-1907.DOI: 10.11772/j.issn.1001-9081.2020091472

所属专题：人工智能

基于相关度距离的无监督并行哈希图像检索

杨粟^1,2, 欧阳智¹, 杜逆索^1,2

1. 贵州省公共大数据重点实验室(贵州大学), 贵阳 550025;
2. 贵州大学计算机科学与技术学院, 贵阳 550025

收稿日期:2020-09-21 修回日期:2021-01-12 发布日期:2021-01-26 出版日期:2021-07-10
通讯作者: 欧阳智
作者简介:杨粟(1996-),女(侗族),贵州铜仁人,硕士研究生,主要研究方向:图像检索;欧阳智(1987-),男,四川安岳人,副教授,博士,主要研究方向:机器学习、大数据治理;杜逆索(1986-),男,贵州六盘水人,副教授,博士,CCF会员,主要研究方向:仿真模拟、数据科学。
基金资助:
贵州省科学技术厅重大科技计划项目（黔科合重大专项字[2018]3002）。

Unsupervised parallel hash image retrieval based on correlation distance

YANG Su^1,2, OUYANG Zhi¹, DU Nisuo^1,2

1. Guizhou Provincial Key Laboratory of Public Big Data(Guizhou University), Guiyang Guizhou 550025, China;
2. College of Computer Science and Technology, Guizhou University, Guiyang Guizhou 550025, China

Received:2020-09-21 Revised:2021-01-12 Online:2021-01-26 Published:2021-07-10
Supported by:
This work is partially supported by Major Scientific and Technological Program of Department of Science and Technology of Guizhou Province ([2018]3002).

摘要/Abstract

摘要： 针对传统无监督哈希图像检索模型中存在图像数据之间的语义信息学习不足，以及哈希编码长度每换一次模型就需重新训练的问题，提出一种用于大规模图像数据集检索的无监督搜索框架——基于相关度距离的无监督并行哈希图像检索模型。首先，使用卷积神经网络（CNN）学习图像的高维特征连续变量；然后，使用相关度距离衡量特征变量构建伪标签矩阵，并将哈希函数与深度学习相结合；最后，在哈希码生成时使用并行方式逐步逼近原始视觉特征，达到一次训练生成多长度哈希码的目的。实验结果表明，该模型在FLICKR25K数据集上对16 bit、32 bit、48 bit和64 bit的4种不同哈希码的平均精度均值（mAP）分别为0.726、0.736、0.738和0.738，与SSDH模型相比分别提升了9.4、8.2、6.2、7.3个百分点；而在训练时间方面，该模型与SSDH模型相比减少6.6 h。所提模型在大规模图像检索时能够有效缩短训练时间、提升检索精度。

关键词: 图像检索, 卷积神经网络, 哈希算法, 无监督, 相关度距离

Abstract: To address the problems of insufficient learning of semantic information between image data and the need to retrain the model every time when the hash code length is changed in traditional unsupervised hash image retrieval model, an unsupervised search framework for large-scale image dataset retrieval, the unsupervised parallel hash image retrieval model based on correlation distance, was proposed. First, the Convolutional Neural Network (CNN) was used to learn the high-dimensional feature continuous variables of the image. Second, the pseudo-label matrix was constructed by using the correlation distance measure feature variables, and the hash function was combined with deep learning. Finally, the parallel method was used to gradually approximate the original visual characteristics during the hash code generation, realizing the purpose of generating the multi-length hash codes in one training. Experimental results show that the mean Average Precisions (mAPs) of the proposed model for four of 16 bit, 32 bit, 48 bit and 64 bits hash codes on FLICKR25K dataset are 0.726, 0.736, 0.738, 0.738,respectively, which are 9.4, 8.2, 6.2, 7.3 percentage points higher than those of Semantic Structure-based Unsupervised Deep Hashing (SSDH) model, respectively; and compared with SSDH model, the training time of the proposed model is reduced by 6.6 hours. It can be seen that the proposed model can effectively shorten the training time and improve the retrieval accuracy in large-scale image retrieval.

Key words: image retrieval, Convolutional Neural Network (CNN), hash algorithm, unsupervised, correlation distance

中图分类号:

TP181

杨粟, 欧阳智, 杜逆索. 基于相关度距离的无监督并行哈希图像检索[J]. 计算机应用, 2021, 41(7): 1902-1907.

YANG Su, OUYANG Zhi, DU Nisuo. Unsupervised parallel hash image retrieval based on correlation distance[J]. Journal of Computer Applications, 2021, 41(7): 1902-1907.

参考文献

[1] BOWYER K,FLYNN P. A 20th anniversary survey:introduction to "content-based image retrieval at the end of the early years"[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2000,22(12):1348-1348.
[2] ANDONI A, INDYK P. Near-optimal hashing algorithms for approximate nearest neighbor in high dimensions[J]. Communications of the ACM,2008,51(1):117-122.
[3] OTAIR M. Approximate k-nearest neighbor based spatial clustering using K-D tree[J]. International Journal of Database Management Systems,2013,5(1):97-108.
[4] WANG J,ZHANG T,SONG J,et al. A survey on learning to hash[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence,2018,40(4):769-790.
[5] LIU W,WANG J,JI R,et al. Supervised hashing with kernels[C]//Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2012:2074-2081.
[6] SHAO J,WU F,OUYANG C,et al. Sparse spectral hashing[J]. Pattern Recognition Letters,2012,33(3):271-277.
[7] ZHU L,SHEN J,XIE L,et al. Unsupervised visual hashing with semantic assistant for content-based image retrieval[J]. IEEE Transactions on Knowledge and Data Engineering,2017,29(2):472-486.
[8] GU Y, WANG S, ZHANG H, et al. Clustering-driven unsupervised deep hashing for image retrieval[J]. Neurocomputing,2019,368:114-123.
[9] DENG C, YANG E, LIU T, et al. Unsupervised semanticpreserving adversarial hashing for image search[J]. IEEE Transactions on Image Processing,2019,28(8):4032-4044.
[10] 王伯伟, 聂秀山, 马林元, 等. 基于语义相似度的无监督图像哈希方法[J]. 南京大学学报(自然科学版),2019,55(1):41-48. (WANG B W,NIE X S,MA L Y,et al. Unsupervised image hash method based on semantic similarity[J]. Journal of Nanjing University(Natural Science),2019,55(1):41-48.)
[11] HEO J P, LEE Y, HE J, et al. Spherical hashing[C]//Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2012:2957-2964.
[12] XIA R,PAN Y,LAI H,et al. Supervised hashing for image retrieval via image representation learning[C]//Proceedings of the 28th AAAI Conference on Artificial Intelligence. Palo Alto,CA:AAAI Press,2014:2156-2162.
[13] GIONIS A,INDYK P,MOTWANI R. Similarity search in high dimensions via hashing[C]//Proceedings of 25th International Conference on Very Large Data Bases. San Francisco:Morgan Kaufmann Publishers Inc.,1999:518-529.
[14] SHEN F,SHEN C,LIU W,et al. Supervised discrete hashing[C]//Proceedings of 2015 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2015:37-45.
[15] 林计文, 刘华文. 基于伪成对标签的深度无监督哈希学习[J]. 模式识别与人工智能,2020,33(3):258-267.(LIN J W,LIU H W. Deep unsupervised hashing with pseudo pairwise labels[J]. Pattern Recognition and Artificial Intelligence,2020,33(3):258-267.)
[16] SONG J,HE T,GAO L,et al. Binary generative adversarial networks for image retrieval[C]//Proceedings of the 32nd AAAI Conference on Artificial Intelligence. Palo Alto, CA:AAAI Press,2018:394-401.
[17] GOODFELLOW I J,POUGET-ABADIE J,MIRZA M,et al. Generative adversarial nets[C]//Proceedings of the 27th International Conference on Neural Information Processing Systems. Cambridge:MIT Press,2014:2672-2680.
[18] 王妙, 景军锋. 基于哈希编码和卷积神经网络的图像检索方法[J]. 计算机工程与应用,2019,55(23):194-199.(WANG M, JING J F. Image retrieval based on Hash coding and convolutional neural network[J]. Computer Engineering and Applications, 2019,55(23):194-199.)
[19] 魏永超. 基于相关系数与相关距离的证据合成方法[J]. 计算技术与自动化,2017,36(1):32-35.(WEI Y C. Evidence combination method based on correlation coefficient and correlation distance[J]. Computing Technology and Automation, 2017,36(1):32-35)
[20] CAO Y,LIU B,LONG M,et al. HashGAN:deep learning to hash with pair conditional Wasserstein GAN[C]//Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2018:1287-1296.
[21] GAO L,ZHU X,SONG J,et al. Beyond product quantization:deep progressive quantization for image retrieval[C]//Proceedings of the 28th International Joint Conference on Artificial Intelligence. Palo Alto,CA:AAAI Press,2019:723-729.
[22] SONG J,ZHU X,GAO L,et al. Deep recurrent quantization for generating sequential binary codes[C]//Proceedings of the 28th International Joint Conference on Artificial Intelligence. Palo Alto,CA:AAAI Press,2019:912-918.
[23] SIMONYAN K, ZISSERMAN A. Very deep convolutional networks for large-scale image recognition[EB/OL].[2020-04-10]. https://arxiv.org/pdf/1409.1556.pdf.
[24] LIU Z,WU J,FU L,et al. Improved kiwifruit detection using pretrained VGG16 with RGB and NIR information fusion[J]. IEEE Access,2020,8:2327-2336.
[25] LOU G,SHI H. Face image recognition based on convolutional neural network[J]. China Communications, 2020, 17(2):117-124.
[26] GONG Y,LAZEBNIK S. Iterative quantization:a procrustean approach to learning binary codes[C]//Proceedings of the 2011 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2011:817-824.
[27] WEISS Y,TORRALBA A,FERGUS R. Spectral hashing[C]//Proceedings of the 21st International Conference on Neural Information Processing Systems. Red Hook, NY:Curran Associates Inc.,2008:1753-1760.
[28] JIN Z,LI C,LIN Y,et al. Density sensitive hashing[J]. IEEE Transactions on Cybernetics,2014,44(8):1362-1371.
[29] DAI B,GUO R,KUMAR S,et al. Stochastic generative hashing[C]//Proceedings of the 34th International Conference on Machine Learning. New York:JMLR. org,2017:913-922.
[30] LIN K,LU J,CHEN C S,et al. Learning compact binary descriptors with unsupervised deep neural networks[C]//Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2016:1183-1192.
[31] YANG E,DENG C,LIU T,et al. Semantic structure-based unsupervised deep hashing[C]//Proceedings of the 27th International Joint Conference on Artificial Intelligence. Palo Alto,CA:AAAI Press,2018:1064-1070.

基于相关度距离的无监督并行哈希图像检索

Unsupervised parallel hash image retrieval based on correlation distance

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

[1]	李云, 王富铕, 井佩光, 王粟, 肖澳. 基于不确定度感知的帧关联短视频事件检测方法[J]. 《计算机应用》唯一官方网站, 2024, 44(9): 2903-2910.
[2]	黄于欣, 徐佳龙, 余正涛, 侯书楷, 周家啟. 基于生成提示的无监督文本情感转换方法[J]. 《计算机应用》唯一官方网站, 2024, 44(9): 2667-2673.
[3]	秦璟, 秦志光, 李发礼, 彭悦恒. 基于概率稀疏自注意力神经网络的重性抑郁疾患诊断[J]. 《计算机应用》唯一官方网站, 2024, 44(9): 2970-2974.
[4]	贾洁茹, 杨建超, 张硕蕊, 闫涛, 陈斌. 基于自蒸馏视觉Transformer的无监督行人重识别[J]. 《计算机应用》唯一官方网站, 2024, 44(9): 2893-2902.
[5]	张春雪, 仇丽青, 孙承爱, 荆彩霞. 基于两阶段动态兴趣识别的购买行为预测模型[J]. 《计算机应用》唯一官方网站, 2024, 44(8): 2365-2371.
[6]	陈虹, 齐兵, 金海波, 武聪, 张立昂. 融合1D-CNN与BiGRU的类不平衡流量异常检测[J]. 《计算机应用》唯一官方网站, 2024, 44(8): 2493-2499.
[7]	赵宇博, 张丽萍, 闫盛, 侯敏, 高茂. 基于改进分段卷积神经网络和知识蒸馏的学科知识实体间关系抽取[J]. 《计算机应用》唯一官方网站, 2024, 44(8): 2421-2429.
[8]	王东炜, 刘柏辰, 韩志, 王艳美, 唐延东. 基于低秩分解和向量量化的深度网络压缩方法[J]. 《计算机应用》唯一官方网站, 2024, 44(7): 1987-1994.
[9]	高阳峄, 雷涛, 杜晓刚, 李岁永, 王营博, 闵重丹. 基于像素距离图和四维动态卷积网络的密集人群计数与定位方法[J]. 《计算机应用》唯一官方网站, 2024, 44(7): 2233-2242.
[10]	沈君凤, 周星辰, 汤灿. 基于改进的提示学习方法的双通道情感分析模型[J]. 《计算机应用》唯一官方网站, 2024, 44(6): 1796-1806.
[11]	黄梦源, 常侃, 凌铭阳, 韦新杰, 覃团发. 基于层间引导的低光照图像渐进增强算法[J]. 《计算机应用》唯一官方网站, 2024, 44(6): 1911-1919.
[12]	李健京, 李贯峰, 秦飞舟, 李卫军. 基于不确定知识图谱嵌入的多关系近似推理模型[J]. 《计算机应用》唯一官方网站, 2024, 44(6): 1751-1759.
[13]	姚迅, 秦忠正, 杨捷. 生成式标签对抗的文本分类模型[J]. 《计算机应用》唯一官方网站, 2024, 44(6): 1781-1785.
[14]	高文烁, 陈晓云. 基于节点结构的点云分类网络[J]. 《计算机应用》唯一官方网站, 2024, 44(5): 1471-1478.
[15]	席治远, 唐超, 童安炀, 王文剑. 基于双路时空网络的驾驶员行为识别[J]. 《计算机应用》唯一官方网站, 2024, 44(5): 1511-1519.