基于堆栈降噪自编码器改进的混合推荐算法

doi:10.11772/j.issn.1001-9081.2017123060

计算机应用 ›› 2018, Vol. 38 ›› Issue (7): 1866-1871.DOI: 10.11772/j.issn.1001-9081.2017123060

基于堆栈降噪自编码器改进的混合推荐算法

杨帅¹, 王鹃²

1. 国家多媒体软件工程技术研究中心(武汉大学计算机学院), 武汉 430072;
2. 空天信息安全与可信计算教育部重点实验室(武汉大学计算机学院), 武汉 430072

收稿日期:2017-12-29 修回日期:2018-02-26 发布日期:2018-07-12 出版日期:2018-07-10
通讯作者: 杨帅
作者简介:杨帅(1993-),男,山东枣庄人,硕士研究生,主要研究方向:通信与信息系统、模式识别;王鹃(1980-),女,湖北武汉人,副教授,博士,主要研究方向:系统与网络安全、访问控制、可信计算、云计算、SDN安全。
基金资助:
国家自然科学基金资助项目（61402342）。

Improved hybrid recommendation algorithm based on stacked denoising autoencoder

YANG Shuai¹, WANG Juan²

1. National Engineering Research Center for Multimedia Software(School of Computer Science, Wuhan University), Wuhan Hubei 430072, China;
2. Key Laboratory of Aerospace Information Security and Trusted Computing, Ministry of Education(School of Computer Science, Wuhan University), Wuhan Hubei 430072, China

Received:2017-12-29 Revised:2018-02-26 Online:2018-07-12 Published:2018-07-10
Supported by:
This work is partially supported by the National Natural Science Foundation of China (61402342).

摘要/Abstract

摘要： 针对传统协同过滤算法仅利用评分信息作为推荐依据，没有利用用户评论和标签信息，无法准确反映用户对项目特征的偏好，推荐精确度低且容易过拟合等问题，提出一种基于堆栈降噪自编码（SDAE）改进的混合推荐（SDHR）算法。首先利用深度学习模型SDAE从用户自由文本标签中抽取项目的显式特征信息；然后，改进隐因子模型（LFM）算法，使用显式项目特征信息替换LFM中的抽象特征，进行矩阵分解训练；最后通过用户-项目偏好矩阵为用户提供推荐。在公开数据集MovieLens上的实验测试，与三组推荐模型（基于标签权重及协同过滤、基于SDAE和极限学习机、基于循环神经网络）比较，该算法推荐精确度分别提高了45.2%、38.4%和16.1%。实验结果表明，所提算法可以充分利用项目自由文本标签信息提高推荐性能。

Abstract: Concerning the problem that traditional collaborative filtering algorithm just utilizes users' ratings on items when generating recommendation, without considering users' labels or comments, which can not reflect users' real preference on different items and the prediction accuracy is not high and easily overfits, a Stacked Denoising AutoEncoder (SDAE)-based improved Hybrid Recommendation (SDHR) algorithm was proposed. Firstly, SDAE was used to extract items' explicit features from users' free-text labels. Then, Latent Factor Model (LFM) algorithm was improved, the LFM's abstract item features were replaced with extracted explicit ones to train matrix decomposition model. Finally, the user-item preference matrix was used to generate recommendations. Experimental tests on the dataset MovieLens showed that the accuracy of the proposed algorithm was improved by 38.4%, 16.1% and 45.2% respectively compared to the three recommendation models (including the model based on label-based weights with collaborative filtering, the model based on SDAE and extreme learning machine, and the model based on recurrent neural networks). The experimental results show that the proposed algorithm can make full use of items' free-text label information to improve recommendation performance.

Key words: recommendation system, collaborative filtering, deep learning, Stacked Denoising AutoEncoder (SDAE), Latent Factor Model (LFM)

中图分类号:

TP181

杨帅, 王鹃. 基于堆栈降噪自编码器改进的混合推荐算法[J]. 计算机应用, 2018, 38(7): 1866-1871.

YANG Shuai, WANG Juan. Improved hybrid recommendation algorithm based on stacked denoising autoencoder[J]. Journal of Computer Applications, 2018, 38(7): 1866-1871.

参考文献

[1] RICCI F, ROKACH L, SHAPIRA B, et al. Recommender Systems Handbook[M]. Berlin:Springer, 2015:127-131.
[2] TUZHILIN A. Towards the next generation of recommender systems:a survey of the state-of-the-art and possible extensions[J]. IEEE Transactions of Knowledge and Data Engineering, 2005, 17(6):734-749.
[3] ABHISHEK K, KULKARNI S, KUMAR V N, et al. A review on personalized information recommendation system using collaborative filtering[J]. International Journal of Computer Science and Information Technologies, 2011, 2(3):1272-1278.
[4] HU L, CAO J, XU G, et al. Personalized recommendation via cross-domain triadic factorization[C]//Proceedings of the 22nd International World Wide Web Conference. New York:ACM, 2013:595-606.
[5] SEVIL S G, KUCUKTUNC O, DUYGULU P, et al. Automatic tag expansion using visual similarity for photo sharing websites[J]. Multimedia Tools & Applications, 2010, 49(1):81-99.
[6] WANG C, BLEI D M. Collaborative topic modeling for recommending scientific articles[C]//Proceedings of the 2011 ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. New York:ACM, 2011:448-456.
[7] 赵宇翔,范哲,朱庆华.用户生成内容(UGC)概念解析及研究进展[J].中国图书馆学报,2012,38(5):68-81.(ZHAO Y X, FAN Z, ZHU Q H. Conceptualization and research progress on user-generated content[J]. Journal of Library Science in China, 2012, 38(5):68-81.)
[8] 张敏,丁弼原,马为之,等.基于深度学习加强的混合推荐方法[J].清华大学学报(自然科学版),2017,57(10):1014-1021.(ZHANG M, DING B Y, MA W Z, et al. Hybrid recommendation approach enhanced by deep learning[J]. Journal of Tsinghua University (Science and Technology), 2017, 57(10):1014-1021.)
[9] WANG H, WANG N, YEUNG D Y. Collaborative deep learning for recommender systems[C]//Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. New York:ACM, 2014:1235-1244.
[10] ALMAHAIRI A, KASTNER K, CHO K, et al. Learning distributed representations from reviews for collaborative filtering[C]//Proceedings of the 9th ACM Conference on Recommender Systems. New York:ACM, 2015:147-154.
[11] GOLDER S A, HUBERMAN B A. The structure of collaborative tagging systems[J]. Journal of Information Science, 2006, 32(2):198-208.
[12] ADRIAN B, SAUERMANN L, ROTH T. ConTag:a semantic tag recommendation system[C]//I-SEMANTICS 2007:Proceedings of the 3rd International Semantic Technology Conference. New York:ACM, 2007:297-304.
[13] WANG H, SHI X, YEUNG Y. Relational stacked denoising autoencoder for tag recommendation[C]//Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence. Menlo Park:AAAI Press, 2015:3052-3058.
[14] VINCENT P, LAROCHELLE H, LAJOIE I, et al. Stacked de-noising autoencoders:learning useful representations in a deep network with a local denoising criterion[J]. Journal of Machine Learning Research, 2010, 11(12):3371-3408.
[15] HINTON G, OSINDERO S, TEH Y. A fast learning algorithm for deep belief nets[J]. Neural Computation, 2006, 18(7):1527-1554.
[16] ZHANG W, WANG J, FENG W. Combining latent factor model with location features for event-based group recommendation[C]//Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. New York:ACM, 2013:910-918.
[17] YUAN K, LING Q, YIN W. On the convergence of decentralized gradient descent[J]. SIAM Journal on Optimization, 2016, 26(3):1835-1854.
[18] REFAEILZADEH P, TANG L, LIU H. Cross-validation[M]//Encyclopedia of Database Systems. Berlin:Springer, 2009:532-538.
[19] 郭彩云,王会进.改进的基于标签的协同过滤算法[J].计算机工程与应用,2016,52(8):56-61.(GUO C Y, WANG H J. Improved collaborative filtering algorithm based on tags[J]. Computer Engineering and Applications, 2016, 52(8):56-61.)
[20] 潘昊,王新伟.基于SDAE及极限学习机模型的协同过滤应用研究[J].计算机应用研究,2017,34(8):2332-2335.(PAN H, WANG X W. Study on collaborative filtering recommendation algorithm based on extreme learning machine stacked denoising autoencodes[J]. Application Research of Computers, 2017, 34(8):2332-2335.)

基于堆栈降噪自编码器改进的混合推荐算法

Improved hybrid recommendation algorithm based on stacked denoising autoencoder

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

[1]	黄云川, 江永全, 黄骏涛, 杨燕. 基于元图同构网络的分子毒性预测[J]. 《计算机应用》唯一官方网站, 2024, 44(9): 2964-2969.
[2]	秦璟, 秦志光, 李发礼, 彭悦恒. 基于概率稀疏自注意力神经网络的重性抑郁疾患诊断[J]. 《计算机应用》唯一官方网站, 2024, 44(9): 2970-2974.
[3]	王熙源, 张战成, 徐少康, 张宝成, 罗晓清, 胡伏原. 面向手术导航3D/2D配准的无监督跨域迁移网络[J]. 《计算机应用》唯一官方网站, 2024, 44(9): 2911-2918.
[4]	杨兴耀, 陈羽, 于炯, 张祖莲, 陈嘉颖, 王东晓. 结合自我特征和对比学习的推荐模型[J]. 《计算机应用》唯一官方网站, 2024, 44(9): 2704-2710.
[5]	李顺勇, 李师毅, 胥瑞, 赵兴旺. 基于自注意力融合的不完整多视图聚类算法[J]. 《计算机应用》唯一官方网站, 2024, 44(9): 2696-2703.
[6]	潘烨新, 杨哲. 基于多级特征双向融合的小目标检测优化模型[J]. 《计算机应用》唯一官方网站, 2024, 44(9): 2871-2877.
[7]	唐廷杰, 黄佳进, 秦进. 基于图辅助学习的会话推荐[J]. 《计算机应用》唯一官方网站, 2024, 44(9): 2711-2718.
[8]	唐廷杰, 黄佳进, 秦进, 陆辉. 基于图共现增强多层感知机的会话推荐[J]. 《计算机应用》唯一官方网站, 2024, 44(8): 2357-2364.
[9]	刘禹含, 吉根林, 张红苹. 基于骨架图与混合注意力的视频行人异常检测方法[J]. 《计算机应用》唯一官方网站, 2024, 44(8): 2551-2557.
[10]	顾焰杰, 张英俊, 刘晓倩, 周围, 孙威. 基于时空多图融合的交通流量预测[J]. 《计算机应用》唯一官方网站, 2024, 44(8): 2618-2625.
[11]	石乾宏, 杨燕, 江永全, 欧阳小草, 范武波, 陈强, 姜涛, 李媛. 面向空气质量预测的多粒度突变拟合网络[J]. 《计算机应用》唯一官方网站, 2024, 44(8): 2643-2650.
[12]	赵亦群, 张志禹, 董雪. 基于密集残差物理信息神经网络的各向异性旅行时计算方法[J]. 《计算机应用》唯一官方网站, 2024, 44(7): 2310-2318.
[13]	徐松, 张文博, 王一帆. 基于时空信息的轻量视频显著性目标检测网络[J]. 《计算机应用》唯一官方网站, 2024, 44(7): 2192-2199.
[14]	孙逊, 冯睿锋, 陈彦如. 基于深度与实例分割融合的单目3D目标检测方法[J]. 《计算机应用》唯一官方网站, 2024, 44(7): 2208-2215.
[15]	吴筝, 程志友, 汪真天, 汪传建, 王胜, 许辉. 基于深度学习的患者麻醉复苏过程中的头部运动幅度分类方法[J]. 《计算机应用》唯一官方网站, 2024, 44(7): 2258-2263.