计算机应用 ›› 2010, Vol. 30 ›› Issue (06): 1668-1670.
• 软件过程技术与中文信息处理 • 上一篇 下一篇
魏韡1,向阳2,陈千3
收稿日期:
修回日期:
发布日期:
出版日期:
通讯作者:
基金资助:
Received:
Revised:
Online:
Published:
摘要: 提出一种基于有向无环图和内在信息量的计算语义相似度的方法。首先计算出两个术语基于所在有向无环图的子图,再分别计算两个子图的交集和并集。用内在信息量方法计算出两个子图的交集和并集包含的节点的内在信息量,再计算出交集的节点内在信息量之和以及并集的节点内在信息量之和,将两者的比值作为两个术语的语义相似度。实验结果表明,该方法具有较高的准确度。
关键词: 语义相似度, 内在信息量, 有向无环图
Abstract: Measuring semantic similarities of terms is a key issue in many research fields. This paper proposed a method based on the Directed Acyclic Graphs (DAG) of terms and the intrinsic information content of terms to measure the semantic similarities of terms. It first calculated the sub-graphs of two terms based on the directed acyclic graph, and then calculated the intersection and union of the sub-graphs. The semantic similarity of two terms is the ratio of the total intrinsic information content of terms in the intersection to the total intrinsic information content of terms in the union. The experimental results show that the method has a higher degree of accuracy.
Key words: Semantic similarity, intrinsic information content, DAG
魏韡 向阳 陈千. 计算术语间语义相似度的混合方法[J]. 计算机应用, 2010, 30(06): 1668-1670.
0 / 推荐
导出引用管理器 EndNote|Ris|BibTeX
链接本文: http://www.joca.cn/CN/
http://www.joca.cn/CN/Y2010/V30/I06/1668