计算机应用 ›› 2014, Vol. 34 ›› Issue (12): 3502-3506.

• 数据技术 • 上一篇    下一篇

基于用户反馈与主题关联度的网页排序算法改进

王冲,曹姗姗   

  1. 桂林电子科技大学 计算机科学与工程学院,广西 桂林 541004
  • 收稿日期:2014-07-08 修回日期:2014-08-26 出版日期:2014-12-01 发布日期:2014-12-31
  • 通讯作者: 曹姗姗
  • 作者简介:王冲(1972-),男,四川宣汉人,副教授,硕士,主要研究方向:信息检索、多媒体教育;曹姗姗(1988-),女,河南平顶山人,硕士,主要研究方向:信息检索。
  • 基金资助:

    2014年桂林电子科技大学重点教改项目;2014广西可信软件重点实验室项目基金资助项目;2015广西教育教学改革A类项目

Improved PageRank algorithm based on user feedback and topic relevance

WANG Chong,CAO Shanshan   

  1. College of Computer Science and Engineering, Guilin University of Electronic Technology, Guilin Guangxi 541004, China
  • Received:2014-07-08 Revised:2014-08-26 Online:2014-12-01 Published:2014-12-31
  • Contact: CAO Shanshan

摘要:

针对传统PageRank算法存在主题漂移、忽略用户兴趣及偏向旧网页的问题,提出一种基于用户反馈与主题关联度的网页排序改进算法。该算法为了更好满足用户的检索需求,利用用户对链接的点击量、链接结构及网页浏览时间来构成用户反馈因子,同时结合网页内容的主题关联度因子,共同对网页PR值进行适当修正与合理分配。为了改善网页排序的效果,算法通过添加时间相关因子,对新网页作出一定补偿,使得新网页一定程度上浮,旧网页下沉。实验结果表明,所提算法在相同实验环境下,相对于传统PageRank算法,提升了用户搜索满意度平均值约2.1%,达到了优化网页排序效果的预期研究目标。

Abstract:

Concerning the problems that exist in traditional PageRank algorithm, such as topic drifting, neglecting user browsing interests and stressing on old Web pages, an improved PageRank algorithm was proposed. To satisfy user requirements better, factors of users' clicks to links, link structure, browser time on pages, topic relevance decided by contents and existing time of pages were taken into consideration. The experimental results show that compared with the traditional PageRank algorithm, the average value of users' degree of satisfaction has been promoted approximately by 2.1% with the proposed algorithm, and ranking results has been optimized in a certain extent.

中图分类号: