计算机应用 ›› 2005, Vol. 25 ›› Issue (05): 1036-1038.DOI: 10.3724/SP.J.1087.2005.1036

• 人工智能与仿真 • 上一篇    下一篇

文本聚类在自动文摘中的应用研究

郭庆琳1,2,樊孝忠2,柳长安1   

  1. 1.华北电力大学(北京)计算机系; 2.北京理工大学计算机科学与工程系
  • 发布日期:2005-05-25 出版日期:2005-05-01
  • 基金资助:

    国家自然科学基金资助项目(60305009)

Application in automatic abstracting for text clustering

GUO Qing-lin1,2, FAN Xiao-zhong2, LIU Chang-an1   

  1. 1. Department of Computer Science, North China Electric Power University, Beijing 102206, China; 2. Department of Computer Science and Engineering, Beijing Institute of Technology, Beijing 100081, China
  • Online:2005-05-25 Published:2005-05-01

摘要: 针对当前自动文摘方法的不足,提出了基于文本聚类的自动文摘实现方法。将文本聚类引入自动文摘中,能实现多文档的自动文摘。实现了面向“塑料”行业的基于文本聚类的自动文摘系统TCAAS,其单文档自动文摘的正确率和召回率在80%以上,多文档自动文摘的正确率和召回率在75%以上。实验表明该方法可行,对自动文摘系统的设计具有借鉴意义和深入研究的价值。

关键词: 自动文摘, 文本聚类, 多文档

Abstract: The method of automatic abstracting based on text clustering was brought forward to overcome the shortages of the current methods of automatic abstracting. This method used text clustering, which realized automatic abstracting of multi-document. For a specific plastic domain an automatic abstracting system named TCAAS based on text clustering was implemented, whose precision and recall was above 80%. And the precision and recall of automatic abstracting of multi-document was above 75%. Experiments proved that it is feasible to use the method to develop an automatic abstracting system, which is valuable for further study in more depth.

Key words: automatic abstracting, text clustering, multi-document

中图分类号: