Journal of Computer Applications ›› 2011, Vol. 31 ›› Issue (11): 3108-3111.DOI: 10.3724/SP.J.1087.2011.03108
• Artificial intelligence • Previous Articles Next Articles
HE Hai-jiang,LONG Yue-jin
Received:
Revised:
Online:
Published:
Contact:
何海江,龙跃进
通讯作者:
作者简介:
基金资助:
Abstract: An iterative co-ranking algorithm, which aimed to extend learning to rank from a supervised setting into a semi-supervised setting, was proposed. The approach employed two listwise rankers to identify document permutations for an unlabeled query. In particular, the use of likelihood listwise loss was introduced to measure the difference score of two learners for a given query. The unlabeled query which showed significant difference score was then chosen for constructing the newly training dataset at next iteration, and its ideal document permutation for a listwise ranker was defined by another learner. The experimental results show that the proposed method can improve the ranking performance of supervised listwise ranking algorithm on the public dataset LETOR. In addition, the labeling ratio was also discussed.
Key words: document retrieval, semi-supervised, rank learning, likelihood loss, co-training
摘要: 针对标记训练集不足的问题,提出了一种协同训练的多样本排序学习算法,从无标签数据挖掘隐含的排序信息。算法使用了两类多样本排序学习机,从当前已有的标记数据集分别构造两个不同的排序函数。相应地,每一个无标签查询都有两个不同的文档排列,由似然损失来计算这两个排列的相似性,为那些文档排列相似度低的查询贴上标签,使两个多样本排序学习机新增了训练数据。在排序学习公开数据集LETOR上的实验结果证实,协同训练的排序算法很有效。另外,还讨论了标注比例对算法的影响。
关键词: 文档检索, 半监督, 排序学习, 似然损失, 协同训练
HE Hai-jiang LONG Yue-jin. Semi-supervised learning listwise ranking functions for document retrieval[J]. Journal of Computer Applications, 2011, 31(11): 3108-3111.
何海江 龙跃进. 适应文档检索的半监督多样本排序学习算法[J]. 计算机应用, 2011, 31(11): 3108-3111.
0 / Recommend
Add to citation manager EndNote|Ris|BibTeX
URL: https://www.joca.cn/EN/10.3724/SP.J.1087.2011.03108
https://www.joca.cn/EN/Y2011/V31/I11/3108