计算机应用 ›› 2015, Vol. 35 ›› Issue (9): 2569-2573.DOI: 10.11772/j.issn.1001-9081.2015.09.2569

• 数据技术 • 上一篇    下一篇

基于主题模型的个性化图书推荐算法

郑祥云, 陈志刚, 黄瑞, 李博   

  1. 中南大学 软件学院, 长沙 410075
  • 收稿日期:2015-04-23 修回日期:2015-06-16 出版日期:2015-09-10 发布日期:2015-09-17
  • 通讯作者: 陈志刚(1964-),男,湖南益阳人,教授,博士生导师,博士,CCF会员,主要研究方向:无线网络、分布式计算,czg@csu.edu.cn
  • 作者简介:郑祥云(1992-),男,湖南永州人,硕士研究生,主要研究方向:数据挖掘;黄瑞(1989-),男,安徽安庆人,博士研究生,主要研究方向:社交网络;李博(1988-),男,河北衡水人,硕士研究生,主要研究方向:数据挖掘。
  • 基金资助:
    国家自然科学基金资助项目(61379057,61309001,61272149,61103202);中南大学中央高校基本科研业务费专项资金资助项目(2015zzts228)。

Personalized book recommendation algorithm based on topic model

ZHENG Xiangyun, CHEN Zhigang, HUANG Rui, LI Bo   

  1. School of Software, Central South University, Changsha Hunan 410075, China
  • Received:2015-04-23 Revised:2015-06-16 Online:2015-09-10 Published:2015-09-17

摘要: 针对传统推荐算法精准度不高的问题,在潜在狄利克雷分布(LDA)主题挖掘模型的基础上提出了一种新的适用于图书推荐(BR)的数据挖掘模型——BR_LDA模型。通过对目标借阅者的历史借阅数据与其他图书数据进行内容相似度分析,得到与目标借阅者历史借阅图书内容相似度较高的其他图书。通过对目标借阅者的历史借阅数据及其他借阅者的历史借阅数据进行相似性分析,得到最近邻借阅者的历史借阅数据。通过求解图书被推荐的概率,最终得到目标借阅者潜在感兴趣的图书。特别地,当推荐数量为4000时,BR_LDA模型比基于多特征方法和关联规则方法精准度分别提高了6.2%、4.5%;当推荐数量为500时,BR_LDA模型比协同过滤的近邻方法和矩阵分解方法分别提高了2.1%、0.5%。实验表明本模型能够更准确地向目标借阅者推荐历史感兴趣类别的新图书及潜在感兴趣的新类别的图书。

关键词: 图书推荐, 图书管理系统, 数据挖掘, 推荐算法

Abstract: Concerning the problem of high time complexity of traditional recommendation algorithms, a new recommendation model based on Latent Dirichlet Allocation (LDA) model was proposed. It was a data mining model applied to Book Recommendation (BR) in library management systems, named Book Recommendation_Latent Dirichlet Allocation (BR_LDA) model. Through the content similarity analysis of historical borrowing data of the target borrowers with other books, other books which had high content similarities with historical borrowing books of the target borrowers were gotten. Through the similarity analyses performed on the target borrowers' historical borrowing data and historical data from other borrowers, historical borrowing data of the nearest neighbors were gotten. Books which the target borrowers were interested in could be finally gotten by calculating the probabilities of the recommended books. In particular, when the number of recommended books is 4000, the precision of BR_LDA model is 6.2% higher than multi-feature method and 4.5% higher than association rule method; when the recommended list has 500 items, the precision of BR_LDA model is 2.1% higher than collaborative filtering based on the nearest neighbors and 0.5% higher than collaborative filtering based on matrix decomposition. The experimental results show that this model can efficiently mine data of books, reasonably recommend new books which belong to historical interested categories and new books in potential interested categories to the target borrowers.

Key words: Book Recommendation (BR), library management system, data mining, recommendation algorithm

中图分类号: