计算机应用 ›› 2010, Vol. 30 ›› Issue (06): 1671-1672.

• 软件过程技术与中文信息处理 • 上一篇    下一篇

基于改进的VSM的词义排歧策略

赵晨光1,蔡东风2   

  1. 1. 沈阳航空工业学院电子信息工程学院
    2. 蔡东风沈阳航空工业学院 自然语言处理实验室
  • 收稿日期:2009-12-03 修回日期:2010-02-01 发布日期:2010-06-01 出版日期:2010-06-01
  • 通讯作者: 赵晨光

Word sense disambiguation based on improved vector space model

  • Received:2009-12-03 Revised:2010-02-01 Online:2010-06-01 Published:2010-06-01

摘要: 为了提高词义排歧的准确率,提出了一种基于改进的向量空间模型(VSM)的词义排歧策略,该模型在提取特征向量的基础上,考虑了语法、词形、语义等因素,计算语境相似度,并引入搭配约束,改进了算法的效果,在开放测试环境下,词义标注正确率可达到80%以上。实验结果表明,该方法对语境信息的描述更加全面,有利于进一步的语义分析。

关键词: 向量空间模型, 词义排歧, 语境相似度

Abstract: To increase the word disambiguation accuracy, a word disambiguation solution based on improved Vector Space Model (VSM) was presented. Since the algorithm takes account of grammar, morphology and semantic and calculates the context similarity requiring the character vector abstraction, the algorithm is able to achieve better results by using collocation constraint. The open test precision can reach 80%. The result shows that the method can fully describe the features of context, and is beneficial to further semantic parsing.

Key words: Vector Space Model, Word disambiguation, Context Similarity