计算机应用 ›› 2005, Vol. 25 ›› Issue (02): 305-308.DOI: 10.3724/SP.J.1087.2005.0305

• 数据库与数据挖掘 • 上一篇    下一篇

搜索引擎结果的重排序方法

杨广翔1,俞宁1,谌莉2   

  1. 1.武汉大学计算机学院; 2.武汉大学教育科学学院
  • 出版日期:2005-02-01 发布日期:2005-02-01

Rerank method of rearch engine

YANG Guang-xiang1, YU Ning1, SHEN Li2   

  1. 1.College of Computer Science, Wuhan University, Wuhan Hubei 430079, China; 2.School of Education Science, Wuhan University, Wuhan Hubei 430079, China
  • Online:2005-02-01 Published:2005-02-01

摘要:

当前Web搜索引擎返回的搜索结果一般是按“超链分析”进行排序的。采用词频统计、词分布特征量等方法对Web搜索引擎的搜索结果的关键词相关度进行计算,并重新对搜索结果排序,可以使得搜索结果中有关的页面文集更加集中。从而方便了信息的使用,特别是在对于特定内容的信息搜索时。

关键词: 词频统计, 搜索引擎, 词分布, 排序

Abstract:

The result that current web search engineer returned were ranked mainly by their hyperlink analyse, not their content. To take the search results as an order collection, we used item frenqency statistic and calculated item position in every page by certain formula, by which we calculated each pages relativity and re-ranked the collection. The experiment results show that the pages which meet the users needs were concentrated ahead. In this way, The precision was enhanced. It can help user find information rapidly.

Key words: term frenqency ferquera, search engine, item position, rank

中图分类号: