Journal of Computer Applications ›› 2026, Vol. 46 ›› Issue (5): 1424-1432.DOI: 10.11772/j.issn.1001-9081.2025050624

• Artificial intelligence • Previous Articles    

Word sense disambiguation method of modal verbs based on causal partial order diagram

Jilin FU1,2, Jianping YU1,2(), Tao ZHANG2,3, Weihua XU4, Enliang YAN5, Liyang WANG1,2   

  1. 1.School of Foreign Studies,Yanshan University,Qinhuangdao Hebei 066004,China
    2.Center of Computational Linguistics,Yanshan University,Qinhuangdao Hebei 066004,China
    3.School of Information Science and Engineering,Yanshan University,Qinhuangdao Hebei 066004,China
    4.College of Artificial Intelligence,Southwest University,Chongqing 400715,China
    5.School of Artificial Intelligence,South China Normal University,Guangzhou Guangdong 528225,China
  • Received:2025-06-06 Revised:2025-06-16 Accepted:2025-06-26 Online:2025-07-08 Published:2026-05-10
  • Contact: Jianping YU
  • About author:FU Jilin, born in 1974, M. S., professor. His research interests include computational linguistics, foreign linguistics, applied linguistics.
    ZHANG Tao, born in 1979, Ph. D., professor. His research interests include cognitive computing, knowledge discovery.
    XU Weihua, born in 1979, Ph. D., professor. His research interests include cognitive computing, machine learning, granular computing, data mining.
    YAN Enliang, born in 1989, Ph. D., associate research fellow. His research interests include conceptual cognitive learning, traditional Chinese medicine knowledge representation and reasoning.
    WANG Liyang, born in 1991, M. S. Her research interests include computational linguistics, word sense disambiguation and knowledge discovery.
  • Supported by:
    National Social Science Foundation of China(20BYY207)

基于因果偏序图的情态动词语义消歧方法

付继林1,2, 于建平1,2(), 张涛2,3, 徐伟华4, 闫恩亮5, 王丽洋1,2   

  1. 1.燕山大学 外国语学院,河北 秦皇岛 066004
    2.燕山大学 计算语言学研究中心,河北 秦皇岛 066004
    3.燕山大学 信息科学与工程学院,河北 秦皇岛 066004
    4.西南大学 人工智能学院,重庆 400715
    5.华南师范大学 人工智能学院,广州 528225
  • 通讯作者: 于建平
  • 作者简介:付继林(1974—),男,吉林镇赉人,教授,硕士,主要研究方向:计算语言学、外国语言学、应用语言学
    张涛(1979—),男,河北唐山人,教授,博士, CCF会员,主要研究方向:认知计算、知识发现
    徐伟华(1979—),男,山西浑源人,教授,博士生导师,博士,CCF会员,主要研究方向:认知计算、机器学习、粒计算、数据挖掘
    闫恩亮(1989—),男,河北邯郸人,副研究员,博士, CCF会员,主要研究方向:概念认知学习、中医知识表示与推理
    王丽洋(1991—),女,河北秦皇岛人,硕士,主要研究方向:计算语言学、语义消歧与知识发现。
  • 基金资助:
    国家社会科学基金资助项目(20BYY207)

Abstract:

The semantic analysis of modal verbs faces many challenges due to their inherent complexity, requiring the extraction of diverse feature sets for Word Sense Disambiguation (WSD), including semantic, syntactic, pragmatic, and genre features. These features vary in the contribution to semantic disambiguation, and some are confusing or redundant. To eliminate the influence of confusing and redundant features on WSD, a word sense disambiguation approach based on Causal Partial Order Diagram (CPOD) was proposed. Learning from the concept of intervention in causal reasoning, eliminating confusing factors through do-calculus, and combining with the idea of constructing an Attribute Partial Order Diagram (APOD), a CPOD was developed as a model for WSD of modal verbs. Results of WSD experiments on 15 English modal verbs showed that the proposed method achieved an average accuracy of 93.42%, eliminating confusing and redundant features. Furthermore, to quantify the specific contribution of each feature to WSD, each feature's contribution was calculated and ranked. It is found that semantic features contribute the most, followed by syntactic features, while genre features contribute relatively less. Specifically, in semantic features, features with low mutual information contribute much more to WSD than those with high mutual information, making them the most influential features.

Key words: Causal Partial Order Diagram (CPOD), modal verb, Word Sense Disambiguation (WSD), attribute feature, contribution

摘要:

情态动词的语义解析因内在的复杂性而面临诸多挑战,它要求在语义消歧(WSD)过程中提取多样化的特征集,包括语义特征、句法特征、语用特征以及体裁特征。这些特征对于语义消歧的贡献程度各异,还有一些特征为混淆和冗余特征。为了消除混淆和冗余特征对语义消歧的影响,提出一种基于因果偏序图(CPOD)的语义消歧方法。借鉴因果推理中的干预(intervention)概念,利用do-calculus消除混淆因素,再结合属性偏序图(APOD)的构建思路构建CPOD作为情态动词的语义消歧模型。对15个情态动词的语义消歧实验结果表明,所提方法的平均正确率达到了93.42%,消除了混淆和冗余特征。进一步地,为了量化有效特征对语义消歧的具体贡献,计算并排序了每个特征对语义消歧的贡献度,发现语义特征的贡献最显著,其次是句法特征,而体裁特征的贡献相对较小;特别地,在语义特征中,低值互信息特征对语义消歧的贡献远大于高值互信息特征,成为影响最大的特征。

关键词: 因果偏序图, 情态动词, 语义消歧, 属性特征, 贡献度

CLC Number: