计算机应用 ›› 2014, Vol. 34 ›› Issue (12): 3481-3486.

• 数据技术 • 上一篇    下一篇

基于精确欧氏局部敏感哈希的协同过滤推荐算法

李红梅,郝文宁,陈刚   

  1. 解放军理工大学 指挥信息系统学院,南京 210007
  • 收稿日期:2014-05-19 修回日期:2014-07-04 出版日期:2014-12-01 发布日期:2014-12-31
  • 通讯作者: 李红梅
  • 作者简介:李红梅(1990-),女,河北衡水人,硕士研究生,主要研究方向:中文信息检索、数据挖掘;郝文宁(1971-),男,山西运城人,教授,博士,主要研究方向:海量高维数据归约、作战效能评估;陈刚(1974-),男,重庆人,副教授,硕士,主要研究方向:作战指挥训练模拟。

Collaborative filtering recommendation algorithm based on exact Euclidean locality-sensitive hashing

LI Hongmei,HE Wenning,CHEN Gang   

  1. College of Command and Information System, PLA University of Science and Technology, Nanjing Jiangsu 210007, China
  • Received:2014-05-19 Revised:2014-07-04 Online:2014-12-01 Published:2014-12-31
  • Contact: LI Hongmei

摘要:

针对推荐系统中用户评分数据的海量高维与稀疏性,以及直接利用传统相似性度量方法来获取近邻的计算量大、结果不准等对推荐质量的影响,提出基于精确欧氏局部敏感哈希(E2LSH)的协同过滤推荐算法。首先利用精确欧氏局部敏感哈希算法对用户评分数据进行降维处理并构建索引,以快速获取目标用户的近邻用户;然后利用加权策略来预测用户评分,进而完成协同过滤推荐。实验结果表明,该算法能有效解决用户数据的海量高维与稀疏性问题,且运行效率高,具有较好的推荐质量。

Abstract:

In recommendation systems, recommendation results are affected by the matter that rating data is characterized by large volume, high dimensionality, extreme sparsity, and the limitation of traditional similarity measuring methods in finding the nearest neighbors, including huge calculation and inaccurate results. Aiming at the poor recommendation quality, this paper presented a new collaborative filtering recommendation algorithm based on Exact Euclidean Locality-Sensitive Hashing (E2LSH). Firstly, E2LSH algorithm was utilized to lower dimensionality and construct index for large rating data. Based on the index, the nearest neighbor users of target user could be obtained with great efficiency. Then, a weighted strategy was applied to predict the user ratings to perform collaborative filtering recommendation. The experimental results on typical dataset show that the proposed method can overcome the bottleneck of high dimensionality and sparsity to some degree, with high running efficiency and good recommendation performance.

中图分类号: