《计算机应用》唯一官方网站 ›› 2023, Vol. 43 ›› Issue (2): 356-364.DOI: 10.11772/j.issn.1001-9081.2021122142

• 人工智能 • 上一篇    

融合常识库和语法特征的数学应用题题意理解

刘清堂, 马鑫倩(), 周洁, 吴林静, 周鹏霄   

  1. 华中师范大学 人工智能教育学部,武汉 430079
  • 收稿日期:2021-11-24 修回日期:2022-06-28 接受日期:2022-07-13 发布日期:2022-08-03 出版日期:2023-02-10
  • 通讯作者: 马鑫倩
  • 作者简介:刘清堂(1969—),男,湖北仙桃人,教授,博士,主要研究方向:数字化学习、版权保护、知识挖掘、知识服务
    周洁(1995—),女,湖南长沙人,硕士,主要研究方向:自然语言处理、数学自动解题
    吴林静(1987—),女,湖北松滋人,副教授,博士,主要研究方向:数据挖掘、人工智能及其教育应用
    周鹏霄(1993—),女,河南南阳人,硕士,主要研究方向:自然语言处理、数学自动解题。
  • 基金资助:
    国家自然科学基金资助项目(62277021);教育部人文社科规划基金资助项目(22YJAZH067);中央高校基本科研业务费专项(CCNU22JC011)

Understanding of math word problems integrating commonsense knowledge base and grammatical features

Qingtang LIU, Xinqian MA(), Jie ZHOU, Linjing WU, Pengxiao ZHOU   

  1. Faculty of Artificial Intelligence in Education,Central China Normal University,Wuhan Hubei 430079,China
  • Received:2021-11-24 Revised:2022-06-28 Accepted:2022-07-13 Online:2022-08-03 Published:2023-02-10
  • Contact: Xinqian MA
  • About author:LIU Qingtang, born in 1969, Ph. D., professor. His research interests include digital learning, copyright protection, knowledge mining, knowledge services.
    ZHOU Jie, born in 1995, M. S. Her research interests include natural language processing, automatic mathematical problem solving.
    WU Linjing, born in 1987, Ph. D., associate professor. Her research interests include data mining, artificial intelligence and its educational applications.
    ZHOU Pengxiao, born in 1993, M. S. Her research interests include natural language processing, automatic mathematical problem solving.
  • Supported by:
    National Natural Science Foundation of China(62277021);Humanities and Social Science Project of Ministry of Education(22YJAZH067);Fundamental Research Funds for Central Universities(CCNU22JC011)

摘要:

数学问题的题意理解是实现自动解题的关键,然而现有研究对情境复杂、参数较多等特征的应用题实现题意理解的准确率较低,尚没有很好的优化解决方案。基于此,以语境复杂的古典概型应用题为突破点,提出了融合常识库和语法特征的数学应用题题意理解方法。首先,结合古典概型应用题的文本和结构特征,构建了包含7类关键解题参数的古典概型题意表征模型;然后,根据该模型将应用题题意理解任务转化为解题参数识别问题,并设计了融合多维语法特征的条件随机场(CRF)题意参数识别方法来解决这个问题。进一步地,针对隐性参数识别问题设计了常识参数补全模块,并提出了融合常识库和语法特征的数学应用题题意理解方法。以新东方在线网站和21世纪教育在线题库中的948道古典概型应用题为实验语料进行实验。实验结果表明,所提方法的各题意参数识别F1平均值达到93.56%,高于最大熵模型(MaxEnt)、双向长短期记忆网络-条件随机场(BiLSTM-CRF)和传统CRF方法;并且题意理解准确率达到66.54%,显著高于上述其他方法,验证了所提方法对古典概型应用题题意理解的有效性。

关键词: 题意理解, 自动解题, 条件随机场, 古典概型应用题, 常识库, 语法特征

Abstract:

Understanding the meaning of mathematical problems is the key for automatic problem solving. However, the accuracy of understanding word problems with complex situations and many parameters is relatively low in previous studies, and the effective optimization solutions need to be further explored and studied. On this basis, a math word problem understanding method integrating commonsense knowledge base and grammatical features was proposed for the classical probability word problems with complex context. Firstly, a classical probability word problem representation model containing seven kinds of key problem-solving parameters was constructed according to text and structure characteristics of the classical probability word problems. Then, based on this model, the task of understanding of word problems was transformed into the problem of solving parameter identification, and a Conditional Random Field (CRF) parameter identification method integrating multi-dimensional grammatical features was presented to solve it. Furthermore, aiming at the problem of implicit parameter identification, a commonsense completion module was added, and an understanding method of math word problems integrating commonsense knowledge base and grammatical features was proposed. Experimental results show that the proposed method has the average F1-score of 93.56% for problem-solving parameter identification, and the accuracy of word problem understanding reached 66.54%, which are better than those of Maximum Entropy Model (MaxEnt), Bidirectional Long Short-Term Memory-Conditional Random Field (BiLSTM-CRF) and traditional CRF methods. It proves the effectiveness of this method in understanding of classical probability word problems.

Key words: understanding of meaning of problem, automatic problem solving, Conditional Random Field (CRF), classical probability word problem, commonsense knowledge base, grammatical feature

中图分类号: