基于自适应提升的概率矩阵分解算法

doi:10.11772/j.issn.1001-9081.2015.12.3497

计算机应用 ›› 2015, Vol. 35 ›› Issue (12): 3497-3501.DOI: 10.11772/j.issn.1001-9081.2015.12.3497

基于自适应提升的概率矩阵分解算法

彭行雄^1,2, 肖如良^1,2, 张桂刚³

1. 福建师范大学软件学院, 福州 350117;
2. 大数据分析与应用福建省高校工程研究中心, 福州 350117;
3. 中国科学院自动化研究所, 北京 100190

收稿日期:2015-06-02 修回日期:2015-08-25 发布日期:2015-12-10 出版日期:2015-12-10
通讯作者: 肖如良(1966-),男,湖南娄底人,教授,博士,主要研究方向:大数据云服务
作者简介:彭行雄(1991-),男,湖北孝感人,硕士研究生,主要研究方向:机器学习;张桂刚(1978-),男,湖南邵阳人,副研究员,博士,主要研究方向:云计算、海量数据处理。
基金资助:
教育部规划基金项目(11YJA860028);福建省科技计划重大项目(2011H6006)。

Probabilistic matrix factorization algorithm based on AdaBoost

PENG Xingxiong^1,2, XIAO Ruliang^1,2, ZHANG Guigang³

1. Faculty of Software, Fujian Normal University, Fuzhou Fujian 350117, China;
2. Fujian Provincial University Engineering Research Center of Big Data Analysis and Application, Fuzhou Fujian 350117, China;
3. Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China

Received:2015-06-02 Revised:2015-08-25 Online:2015-12-10 Published:2015-12-10

摘要/Abstract

摘要： 针对推荐系统中概率矩阵分解模型(PMF)泛化能力(对新用户和物品的推荐性能)较差、预测准确性不高的问题,提出一种新的基于自适应提升的概率矩阵分解算法(AdaBoostPMF)。该算法首先为每个样本分配样本权重;然后根据PMF中的每一轮随机梯度下降法学习用户和物品特征向量,并计算总体预测误差均值和标准差。从全局的角度利用AdaBoost思想自适应调整样本权重,使算法更注重学习预测误差较大的样本;最后对预测误差分配样本权重,让用户和物品特征向量找到更合适的优化方向。相比传统的PMF算法,AdaBoostPMF算法能够将预测精度平均提高约2.5%。实验结果表明,该算法通过加权预测误差较大的样本,能够较好地拟合用户特征向量和物品特征向量,提高预测精度,可以有效地应用于研究个性化推荐。

Abstract: Concerning the poor generalization ability (the recommended performance for new users and items) and low predictive accuracy of Probabilistic Matrix Factorization (PMF) in recommendation system, a new algorithm of Probabilistic Matrix Factorization algorithm based on AdaBoost (AdaBoostPMF) was proposed. Firstly, the initial weight for each sample was assigned. Secondly, the feature vectors of users and items were learned by each round of PMF stochastic gradient descent method and the global mean and standard deviation of the prediction error were calculated. The sample weights were adaptively adjusted by using AdaBoost from the a global perspective, which made the proposed algorithm pay more attention to training those samples with the larger prediction error than others. Finally, the sample weights were assigned to predictive error, which found the more appropriate optimum direction for feature vectors of users and items. Compared with traditional PMF algorithm, the proposed AdaBoostPMF algorithm could significantly improve the prediction precision by about 2.5% on average. The experimental results show that, the proposed algorithm can better fit the user feature vector and the item feature vector and improve the prediction accuracy by weighting the samples with larger prediction error.The proposed algorithm can be effectively applied to the personalized recommendation.

Key words: recommendation system, Probabilistic Matrix Factorization (PMF), AdaBoost, model blending, rating prediction

中图分类号:

TP181
TP393

彭行雄, 肖如良, 张桂刚. 基于自适应提升的概率矩阵分解算法[J]. 计算机应用, 2015, 35(12): 3497-3501.

PENG Xingxiong, XIAO Ruliang, ZHANG Guigang. Probabilistic matrix factorization algorithm based on AdaBoost[J]. Journal of Computer Applications, 2015, 35(12): 3497-3501.

参考文献

[1] KAPOOR K, SUBBIAN K, SRIVASTAVA J, et al.Just in time recommendations:modeling the dynamics of boredom in activity streams[C]//WSDM'15:Proceedings of the 8th ACM International Conference on Web Search and Data Mining. New York:ACM, 2015:233-242.
[2] GU W, DONG S, ZENG Z. Increasing recommended effectiveness with Markov chains and purchase intervals[J]. Neural Computing and Applications, 2014, 25(5):1153-1162.
[3] BAUER J, NANOPOULOS A. Recommender systems based on quantitative implicit customer feedback[J]. Decision Support Systems, 2014, 68:77-88.
[4] SALAKHUTDINOV R, MNIH A. Probabilistic matrix factorization[C]//Proceedings of the 2008 Annual Conference on Neural Information Processing Systems 20. Cambridge:MIT Press, 2008:1257-1264.
[5] SALAKHUTDINOV R, MNIH A. Bayesian probabilistic matrix factorization using Markov chain Monte Carlo[C]//ICML'08:Proceedings of the 25th International Conference on Machine Learning. New York:ACM, 2008:880-887.
[6] ZHOU T, SHAN H, BANERJEE A, et al.Kernelized probabilistic matrix factorization:exploiting graphs and side information[C]//Proceedings of the 12th SIAM International Conference on Data Mining. Philadelphia:SIAM, 2012:403-414.
[7] ZHOU M. Application and research of boosting theory in the recommendation algorithm[D]. Chengdu:University of Electronic Science and Technology, 2012:50-65. (周密.Boosting理论在推荐算法中的应用与研究[D].成都:电子科技大学:2012,50-65.)
[8] LI P, XIAO R, DENG X, et al.A novel appocach to matrix factorization recommender system using gravitational impacts[J]. Journal of Chinese Computer Systems, 2015, 36(4):696-700.(李鹏澎,肖如良,邓新国,等.一种融合引力影响的新的矩阵分解推荐方法[J].小型微型计算机系统,2015,36(4):696-700.)
[9] FREUND Y, SCHAPIRE R E. A decision-theoretic generalization of on-line learning and an application to boosting[J]. Journal of Computer and System Sciences, 1997, 55(1):119-139.
[10] FREUND Y, IYER R, SCHAPIRE R E, et al.An efficient boosting algorithm for combining preferences[J]. Journal of Machine Learning Research, 2003, 4(6):933-969.
[11] SCHAPIRE R E, FREUND Y, BARTLETT P, et al.Boosting the margin:a new explanation for the effectiveness of voting methods[J]. The Annals of Statistics, 1998, 26(5):1651-1686.
[12] MIWA S, HIRAI T, SUMI K. Robust face detection using one-class estimation and real AdaBoost[J]. Electronics and Communications in Japan, 2014, 97(7):39-47.
[13] WARMUTH M K, GLOCER K A, VISHWANATHAN S V N. Entropy regularized LPBoost[C]//Proceedings of the 19th International Conference on Algorithmic Learning Theory, LNCS 5254. Berlin:Springer, 2008:256-271.

[1]	唐廷杰, 黄佳进, 秦进. 基于图辅助学习的会话推荐[J]. 《计算机应用》唯一官方网站, 2024, 44(9): 2711-2718.
[2]	唐廷杰, 黄佳进, 秦进, 陆辉. 基于图共现增强多层感知机的会话推荐[J]. 《计算机应用》唯一官方网站, 2024, 44(8): 2357-2364.
[3]	汪炅, 唐韬韬, 贾彩燕. 无负采样的正样本增强图对比学习推荐方法PAGCL[J]. 《计算机应用》唯一官方网站, 2024, 44(5): 1485-1492.
[4]	荆智文, 张屿佳, 孙伯廷, 郭浩. 二阶段孪生图卷积神经网络推荐算法[J]. 《计算机应用》唯一官方网站, 2024, 44(2): 469-476.
[5]	曾蠡, 杨婧如, 黄罡, 景翔, 罗超然. 超图应用方法综述：问题、进展与挑战[J]. 《计算机应用》唯一官方网站, 2024, 44(11): 3315-3326.
[6]	周北京, 王海荣, 王怡梦, 张丽丝, 马赫. 图谱嵌入传播的推荐方法[J]. 《计算机应用》唯一官方网站, 2024, 44(10): 3252-3259.
[7]	刘源, 董永权, 贾瑞, 杨昊霖. 面向个性化课程推荐的分层分期注意力网络模型[J]. 《计算机应用》唯一官方网站, 2023, 43(8): 2358-2363.
[8]	叶坤佩, 熊熙, 丁哲. 基于领域融合和时间权重的招工推荐模型[J]. 《计算机应用》唯一官方网站, 2023, 43(7): 2133-2139.
[9]	孙浩, 曹健, 李海生, 毛典辉. 基于改进胶囊网络的会话型推荐模型[J]. 《计算机应用》唯一官方网站, 2023, 43(4): 1043-1049.
[10]	孙轩宇, 史艳翠. 融合项目影响力的图神经网络会话推荐模型[J]. 《计算机应用》唯一官方网站, 2023, 43(12): 3689-3696.
[11]	魏楚元, 王梦珂, 户传豪, 张桄齐. 增强推荐系统可解释性的深度评论注意力神经网络模型[J]. 《计算机应用》唯一官方网站, 2023, 43(11): 3443-3448.
[12]	赵学健, 李豪, 唐浩天. 基于用户兴趣概念格约简的推荐评分预测算法[J]. 《计算机应用》唯一官方网站, 2023, 43(11): 3340-3345.
[13]	姚华勇, 叶东毅, 陈昭炯. 考虑多粒度反馈的多轮对话强化学习推荐算法[J]. 《计算机应用》唯一官方网站, 2023, 43(1): 15-21.
[14]	周嘉凡, 杜岳峰, 宋宝燕, 李晓光, 赵阿珠, 肖绪界. 基于元路径注意力机制的MOOC视频推荐方法[J]. 《计算机应用》唯一官方网站, 2022, 42(6): 1808-1813.
[15]	王利娥, 李小聪, 刘红翼. 融合知识图谱和差分隐私的新闻推荐方法[J]. 《计算机应用》唯一官方网站, 2022, 42(5): 1339-1346.

基于自适应提升的概率矩阵分解算法

Probabilistic matrix factorization algorithm based on AdaBoost

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics