计算机应用 ›› 2017, Vol. 37 ›› Issue (11): 3188-3193.DOI: 10.11772/j.issn.1001-9081.2017.11.3188

• 第十六届中国机器学习会议(CCML 2017) • 上一篇    下一篇

基于Hadoop的IPTV隐式评分模型

顾军华1, 官磊1, 张建1, 高星1, 张素琪2   

  1. 1. 河北工业大学 计算机科学与软件学院, 天津 300401;
    2. 天津商业大学 信息工程学院, 天津 300134
  • 收稿日期:2017-05-16 修回日期:2017-07-05 出版日期:2017-11-10 发布日期:2017-11-11
  • 通讯作者: 顾军华
  • 作者简介:顾军华(1966-),男,河北赵县人,教授,博士,CCF会员,主要研究方向:数据挖掘、智能信息处理、信息采集与集成、智能计算与优化、软件工程;官磊(1992-),男,河南信阳人,硕士研究生,主要研究方向:智能信息处理;张建(1993-),男,河北涿州人,硕士研究生,主要研究方向:数据挖掘;高星(1992-),女,河北赵县人,硕士研究生,主要研究方向:商务智能、软计算;张素琪(1980-),女,河北隆尧人,讲师,博士,CCF会员,主要研究方向:数据挖掘。
  • 基金资助:
    天津市自然科学基金资助项目(15JCQNJC00600,14JCYBJC15900)。

IPTV implicit scoring model based on Hadoop

GU Junhua1, GUAN Lei1, ZHANG Jian1, GAO Xing1, ZHANG Suqi2   

  1. 1. School of Computer Science and Software, Hebei University of Technology, Tianjin 300401, China;
    2. School of Information Engineering, Tianjin University of Commerce, Tianjin 300134, China
  • Received:2017-05-16 Revised:2017-07-05 Online:2017-11-10 Published:2017-11-11
  • Supported by:
    This work is partially supported by the Natural Science Foundation of Tianjin (15JCQNJC00600, 14JCYBJC15900).

摘要: 根据网路协定电视(IPTV)用户收视行为数据中的隐式特性,提出一种新型的隐式评分模型。首先,介绍了IPTV用户收视行为数据的主要特点,提出一种新的用户收视比值、用户兴趣偏置因子以及视频类型影响因子相结合的多特征混合隐式评分模型;然后,提出基于收视时长和收视比值的收视行为筛选策略;最后,设计并实现了基于Hadoop的分布式模型架构。实验结果表明,所提模型有效提高了IPTV系统中推荐结果的质量,同时提升了时间效率,对于大规模数据有良好的可扩展性。

关键词: 隐式反馈, 分布式模型, 兴趣模型, 网路协定电视

Abstract: According to the implicit characteristics of IPTV (Internet Protocol Television) user viewing behavior data, a novel implicit rating model was proposed. Firstly, the main features of IPTV user viewing behavior data were introduced, and a new mixed feature implicit scoring model was proposed, which combined with viewing ratio, user interest bias factor and video type influence factor. Secondly, the strategy of viewing behavior based on viewing time and viewing ratio was proposed. Finally, a distributed model architecture based on Hadoop was designed and implemented. The experimental results show that the proposed novel model effectively improves the quality of the recommended results in the IPTV system, improves the time efficiency, and has good scalability for large amounts of data.

Key words: implicit feedback, distributed model, interest model, Internet Protocol Television (IPTV)

中图分类号: