Journal of Computer Applications

• Network and communications • Previous Articles     Next Articles

Research and application of vertical search engine in networked manufacturing resource〖JP〗

<a href="http://www.joca.cn/EN/article/advancedSearchResult.do?searchSQL=((([Author]) AND 1[Journal]) AND year[Order])" target="_blank"></a> <a href="http://www.joca.cn/EN/article/advancedSearchResult.do?searchSQL=((([Author]) AND 1[Journal]) AND year[Order])" target="_blank"></a>   

  • Received:2006-11-27 Revised:2007-02-12 Online:2007-05-01 Published:2007-05-01

网络化制造资源垂直搜索引擎的研究与应用

程锦 张建   

  1. 贵州大学CAD/CIMS工程技术中心
  • 通讯作者: 程锦

Abstract: This paper put emphasis on the technologies of the system, including the topic crawler and the Chinese word segmentation. To improve the efficiency of the crawler, a model of page evaluation was added into the crawler module; therefore the urls in a page with a high similarity of the topic will be first crawled. Besides, an improved word matching algorithm was proposed to enhance the speed and precision of word segmentation.

Key words: network manufacturing, manufacturing resource, vertical search engine, Html Parser, Lucene

摘要: 着重研究了网络化制造资源垂直搜索系统的主题爬虫和中文分词技术。通过在主题爬虫中增加评价网页模块,优先爬行与主题相似度高的网页中的链接,提高了爬虫的工作效率。在对中文分词词典进行分层存储的基础上,通过一种改进的简洁的中文分词词典匹配算法,有效地改善了分词的速度与精度,并缩减了索引库,增强了用户的响应。

关键词: 网络化制造, 制造资源, 垂直搜索, 页面解析, 中文分词, Lucene