计算机应用 ›› 2012, Vol. 32 ›› Issue (12): 3331-3334.DOI: 10.3724/SP.J.1087.2012.03331

• 人工智能 • 上一篇    下一篇

一种两层加权融合的排序算法

胡小生,钟勇   

  1. 佛山科学技术学院 电子与信息工程学院, 广东 佛山 528000
  • 收稿日期:2012-06-28 修回日期:2012-08-13 发布日期:2012-12-29 出版日期:2012-12-01
  • 通讯作者: 胡小生
  • 作者简介:胡小生(1978-),男,湖北黄冈人,讲师,高级工程师,主要研究方向:信息检索、机器学习;〓钟勇(1970-),男,江西吉安人,教授,博士,主要研究方向:信息安全、信息检索、云计算。
  • 基金资助:
    广东省科技计划项目(2010B020315028);佛山市科技发展专项资金项目

Two-tier weighting aggregation ranking algorithm

HU Xiao-sheng,ZHONG Yong   

  1. College of Electronic and Information Engineering, Foshan University, Foshan Guangdong 528000, China
  • Received:2012-06-28 Revised:2012-08-13 Online:2012-12-29 Published:2012-12-01
  • Contact: HU Xiao-sheng

摘要: 当前排序学习算法在学习时将样本集中的所有查询及其相关文档等同对待,忽略了查询之间以及其相关文档之间的差异性,影响了排序模型的性能。对查询之间的差异进行分析,同时考虑文档排序位置造成的资料被检视概率不同的差异特性,提出了一种两层加权融合的排序方法。该方法为每一个查询及其相关文档建立一个子排序模型,在此过程中,对文档赋予非对称权重,然后通过建立新的损失函数作为优化目标,利用损失函数调节不同查询产生损失之间的权重,最终实现多查询相关排序模型的加权融合。在标准数据集LETOR OHSUMED上的实验结果表明,所提方法在排序性能上有较大提升。

关键词: 信息检索, 排序学习, 非对称加权, 融合

Abstract: In ranking for document retrieval, queries often vary greatly from one another. However, most of the existing ranking methods do not consider significant differences between queries. Correctly ranking documents on the top of the result list is crucial, and one must conduct training in a way that such ranked results are accurate. A two-tier weighting aggregation ranking method was proposed. This method consisted of two steps, training of base rankers and query-level ranker aggregation. First, base rankers were established based on each query, assigning asymmetric weights to its relevant documents, then, query-level ranker aggregation used a supervised approach to learn query-dependent weights when these base rankers were combined. The experimental results on the benchmark data set LETRO ONHSUMED show that the ranking performance has been significantly improved.

Key words: information retrieval, learing to rank, asymmetric weighting, aggregation

中图分类号: