基于堆栈降噪自编码器改进的混合推荐算法

doi:10.11772/j.issn.1001-9081.2017123060

计算机应用 ›› 2018, Vol. 38 ›› Issue (7): 1866-1871.DOI: 10.11772/j.issn.1001-9081.2017123060

基于堆栈降噪自编码器改进的混合推荐算法

杨帅¹, 王鹃²

1. 国家多媒体软件工程技术研究中心(武汉大学计算机学院), 武汉 430072;
2. 空天信息安全与可信计算教育部重点实验室(武汉大学计算机学院), 武汉 430072

收稿日期:2017-12-29 修回日期:2018-02-26 出版日期:2018-07-10 发布日期:2018-07-12
通讯作者: 杨帅
作者简介:杨帅(1993-),男,山东枣庄人,硕士研究生,主要研究方向:通信与信息系统、模式识别;王鹃(1980-),女,湖北武汉人,副教授,博士,主要研究方向:系统与网络安全、访问控制、可信计算、云计算、SDN安全。
基金资助:
国家自然科学基金资助项目（61402342）。

Improved hybrid recommendation algorithm based on stacked denoising autoencoder

YANG Shuai¹, WANG Juan²

1. National Engineering Research Center for Multimedia Software(School of Computer Science, Wuhan University), Wuhan Hubei 430072, China;
2. Key Laboratory of Aerospace Information Security and Trusted Computing, Ministry of Education(School of Computer Science, Wuhan University), Wuhan Hubei 430072, China

Received:2017-12-29 Revised:2018-02-26 Online:2018-07-10 Published:2018-07-12
Supported by:
This work is partially supported by the National Natural Science Foundation of China (61402342).

摘要/Abstract

摘要： 针对传统协同过滤算法仅利用评分信息作为推荐依据，没有利用用户评论和标签信息，无法准确反映用户对项目特征的偏好，推荐精确度低且容易过拟合等问题，提出一种基于堆栈降噪自编码（SDAE）改进的混合推荐（SDHR）算法。首先利用深度学习模型SDAE从用户自由文本标签中抽取项目的显式特征信息；然后，改进隐因子模型（LFM）算法，使用显式项目特征信息替换LFM中的抽象特征，进行矩阵分解训练；最后通过用户-项目偏好矩阵为用户提供推荐。在公开数据集MovieLens上的实验测试，与三组推荐模型（基于标签权重及协同过滤、基于SDAE和极限学习机、基于循环神经网络）比较，该算法推荐精确度分别提高了45.2%、38.4%和16.1%。实验结果表明，所提算法可以充分利用项目自由文本标签信息提高推荐性能。

Abstract: Concerning the problem that traditional collaborative filtering algorithm just utilizes users' ratings on items when generating recommendation, without considering users' labels or comments, which can not reflect users' real preference on different items and the prediction accuracy is not high and easily overfits, a Stacked Denoising AutoEncoder (SDAE)-based improved Hybrid Recommendation (SDHR) algorithm was proposed. Firstly, SDAE was used to extract items' explicit features from users' free-text labels. Then, Latent Factor Model (LFM) algorithm was improved, the LFM's abstract item features were replaced with extracted explicit ones to train matrix decomposition model. Finally, the user-item preference matrix was used to generate recommendations. Experimental tests on the dataset MovieLens showed that the accuracy of the proposed algorithm was improved by 38.4%, 16.1% and 45.2% respectively compared to the three recommendation models (including the model based on label-based weights with collaborative filtering, the model based on SDAE and extreme learning machine, and the model based on recurrent neural networks). The experimental results show that the proposed algorithm can make full use of items' free-text label information to improve recommendation performance.

Key words: recommendation system, collaborative filtering, deep learning, Stacked Denoising AutoEncoder (SDAE), Latent Factor Model (LFM)

中图分类号:

TP181

杨帅, 王鹃. 基于堆栈降噪自编码器改进的混合推荐算法[J]. 计算机应用, 2018, 38(7): 1866-1871.

YANG Shuai, WANG Juan. Improved hybrid recommendation algorithm based on stacked denoising autoencoder[J]. Journal of Computer Applications, 2018, 38(7): 1866-1871.

参考文献

[1] RICCI F, ROKACH L, SHAPIRA B, et al. Recommender Systems Handbook[M]. Berlin:Springer, 2015:127-131.
[2] TUZHILIN A. Towards the next generation of recommender systems:a survey of the state-of-the-art and possible extensions[J]. IEEE Transactions of Knowledge and Data Engineering, 2005, 17(6):734-749.
[3] ABHISHEK K, KULKARNI S, KUMAR V N, et al. A review on personalized information recommendation system using collaborative filtering[J]. International Journal of Computer Science and Information Technologies, 2011, 2(3):1272-1278.
[4] HU L, CAO J, XU G, et al. Personalized recommendation via cross-domain triadic factorization[C]//Proceedings of the 22nd International World Wide Web Conference. New York:ACM, 2013:595-606.
[5] SEVIL S G, KUCUKTUNC O, DUYGULU P, et al. Automatic tag expansion using visual similarity for photo sharing websites[J]. Multimedia Tools & Applications, 2010, 49(1):81-99.
[6] WANG C, BLEI D M. Collaborative topic modeling for recommending scientific articles[C]//Proceedings of the 2011 ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. New York:ACM, 2011:448-456.
[7] 赵宇翔,范哲,朱庆华.用户生成内容(UGC)概念解析及研究进展[J].中国图书馆学报,2012,38(5):68-81.(ZHAO Y X, FAN Z, ZHU Q H. Conceptualization and research progress on user-generated content[J]. Journal of Library Science in China, 2012, 38(5):68-81.)
[8] 张敏,丁弼原,马为之,等.基于深度学习加强的混合推荐方法[J].清华大学学报(自然科学版),2017,57(10):1014-1021.(ZHANG M, DING B Y, MA W Z, et al. Hybrid recommendation approach enhanced by deep learning[J]. Journal of Tsinghua University (Science and Technology), 2017, 57(10):1014-1021.)
[9] WANG H, WANG N, YEUNG D Y. Collaborative deep learning for recommender systems[C]//Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. New York:ACM, 2014:1235-1244.
[10] ALMAHAIRI A, KASTNER K, CHO K, et al. Learning distributed representations from reviews for collaborative filtering[C]//Proceedings of the 9th ACM Conference on Recommender Systems. New York:ACM, 2015:147-154.
[11] GOLDER S A, HUBERMAN B A. The structure of collaborative tagging systems[J]. Journal of Information Science, 2006, 32(2):198-208.
[12] ADRIAN B, SAUERMANN L, ROTH T. ConTag:a semantic tag recommendation system[C]//I-SEMANTICS 2007:Proceedings of the 3rd International Semantic Technology Conference. New York:ACM, 2007:297-304.
[13] WANG H, SHI X, YEUNG Y. Relational stacked denoising autoencoder for tag recommendation[C]//Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence. Menlo Park:AAAI Press, 2015:3052-3058.
[14] VINCENT P, LAROCHELLE H, LAJOIE I, et al. Stacked de-noising autoencoders:learning useful representations in a deep network with a local denoising criterion[J]. Journal of Machine Learning Research, 2010, 11(12):3371-3408.
[15] HINTON G, OSINDERO S, TEH Y. A fast learning algorithm for deep belief nets[J]. Neural Computation, 2006, 18(7):1527-1554.
[16] ZHANG W, WANG J, FENG W. Combining latent factor model with location features for event-based group recommendation[C]//Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. New York:ACM, 2013:910-918.
[17] YUAN K, LING Q, YIN W. On the convergence of decentralized gradient descent[J]. SIAM Journal on Optimization, 2016, 26(3):1835-1854.
[18] REFAEILZADEH P, TANG L, LIU H. Cross-validation[M]//Encyclopedia of Database Systems. Berlin:Springer, 2009:532-538.
[19] 郭彩云,王会进.改进的基于标签的协同过滤算法[J].计算机工程与应用,2016,52(8):56-61.(GUO C Y, WANG H J. Improved collaborative filtering algorithm based on tags[J]. Computer Engineering and Applications, 2016, 52(8):56-61.)
[20] 潘昊,王新伟.基于SDAE及极限学习机模型的协同过滤应用研究[J].计算机应用研究,2017,34(8):2332-2335.(PAN H, WANG X W. Study on collaborative filtering recommendation algorithm based on extreme learning machine stacked denoising autoencodes[J]. Application Research of Computers, 2017, 34(8):2332-2335.)

基于堆栈降噪自编码器改进的混合推荐算法

Improved hybrid recommendation algorithm based on stacked denoising autoencoder

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

[1]	赵宏, 孔东一. 图像特征注意力与自适应注意力融合的图像内容中文描述[J]. 计算机应用, 2021, 41(9): 2496-2503.
[2]	徐江浪, 李林燕, 万新军, 胡伏原. 结合目标检测的室内场景识别方法[J]. 计算机应用, 2021, 41(9): 2720-2725.
[3]	陈成瑞, 孙宁, 何世彪, 廖勇. 面向C-V2X通信的基于深度学习的联合信道估计与均衡算法[J]. 计算机应用, 2021, 41(9): 2687-2693.
[4]	谢德峰, 吉建民. 融入句法感知表示进行句法增强的语义解析[J]. 计算机应用, 2021, 41(9): 2489-2495.
[5]	代雨柔, 杨庆, 张凤荔, 周帆. 基于自监督学习的社交网络用户轨迹预测模型[J]. 计算机应用, 2021, 41(9): 2545-2551.
[6]	郑志强, 胡鑫, 翁智, 王雨禾, 程曦. 基于改进DenseNet的牛眼图像特征提取方法[J]. 计算机应用, 2021, 41(9): 2780-2784.
[7]	包玄, 陈红梅, 肖清. 融入时间的兴趣点协同推荐算法[J]. 计算机应用, 2021, 41(8): 2406-2411.
[8]	何正海, 线岩团, 王蒙, 余正涛. 融合句法指导与字符注意力机制的案情阅读理解方法[J]. 计算机应用, 2021, 41(8): 2427-2431.
[9]	曹玉红, 徐海, 刘荪傲, 王紫霄, 李宏亮. 基于深度学习的医学影像分割研究综述[J]. 计算机应用, 2021, 41(8): 2273-2287.
[10]	秦斌斌, 彭良康, 卢向明, 钱江波. 司机分心驾驶检测研究进展[J]. 计算机应用, 2021, 41(8): 2330-2337.
[11]	高钦泉, 黄炳城, 刘文哲, 童同. 基于改进CenterNet的竹条表面缺陷检测方法[J]. 计算机应用, 2021, 41(7): 1933-1938.
[12]	李亚芳, 梁烨, 冯韦玮, 祖宝开, 康玉健. 基于社区优化的深度网络嵌入方法[J]. 计算机应用, 2021, 41(7): 1956-1963.
[13]	刘欢, 李晓戈, 胡立坤, 胡飞雄, 王鹏华. 基于知识图谱驱动的图神经网络推荐模型[J]. 计算机应用, 2021, 41(7): 1865-1870.
[14]	杜炎, 吕良福, 焦一辰. 基于模糊推理的模糊原型网络[J]. 计算机应用, 2021, 41(7): 1885-1890.
[15]	侯笑晗, 金国栋, 谭力宁, 薛远亮. 基于自适应和最优特征的合成孔径雷达舰船检测方法[J]. 计算机应用, 2021, 41(7): 2150-2155.