计算机应用

• 数据挖掘 • 上一篇    下一篇

基于文档与搜索结果上下文的查询扩展方法

蒋辉 阳小华   

  1. 南华大学计算机学院
  • 收稿日期:2008-09-26 修回日期:2008-11-24 发布日期:2009-03-01 出版日期:2009-03-01

Query expansion based on context of document and search result

<a href="http://www.joca.cn/EN/article/advancedSearchResult.do?searchSQL=(((Jiang Hui[Author]) AND 1[Journal]) AND year[Order])" target="_blank">Jiang Hui</a>   

  • Received:2008-09-26 Revised:2008-11-24 Online:2009-03-01 Published:2009-03-01

摘要: 在查询扩展方法中,如果通过查询结果中关键词的上下文来计算候选关键词的权重,将权重大的词作为查询扩展词,其候选关键词来源于文档中关键词的上下文,这种方法存在主题漂移的问题。为了解决这个问题,提出一种将初始查询结果过滤,只选择与源文档语境相似的搜索结果,来帮助选择查询扩展词的方法。实验结果表明该方法能获得更合适的查询扩展词。

关键词: 信息检索, 查询扩展, 上下文

Abstract: When editing a word processing document, we may search the Web by using a term in the document as an initial query and then modifying the query by adding keywords extracted from the text surrounding the search term. There are query expansion methods which use the text surrounding the search term in the initial result to weight candidate keywords in the source document to modify the query. However, this approach may lead to topic drift. To solve the problem, the initial results were filtered first and only the results containing similar contexts as that in the source document were selected to help choosing additional keywords. Experiments show that this method can get more appropriate additional keywords than other methods.

Key words: information retrieval, query expansion, context

中图分类号: