Graph to equation tree model based on expression layer-by-layer aggregation and dynamic selection

doi:10.11772/j.issn.1001-9081.2022071054

Journal of Computer Applications ›› 2023, Vol. 43 ›› Issue (8): 2390-2395.DOI: 10.11772/j.issn.1001-9081.2022071054

• Artificial intelligence • Previous Articles

Graph to equation tree model based on expression layer-by-layer aggregation and dynamic selection

Bin LIU(), Qian ZHANG, Yaqin WEI, Xueying CUI, Hongying ZHI

School of Applied Science，Taiyuan University of Science and Technology，Taiyuan Shanxi 030024，China

Received:2022-07-21 Revised:2022-11-03 Accepted:2022-11-07 Online:2023-01-15 Published:2023-08-10
Contact: Bin LIU
About author:ZHANG Qian， born in 1998， M. S. candidate. Her research interests include natural language processing.
WEI Yaqin， born in 1998， M. S. candidate. Her research interests include natural language processing.
CUI Xueying， born in 1978， Ph. D.， associate professor. Her research interests include image processing， deep learning.
ZHI Hongying， born in 1980， Ph. D.， associate professor. Her research interests include Bayesian statistics.
Supported by:
National Natural Science Foundation of China(11701406);Fundamental Research Program of Shanxi Province(202103021224274);Research Project Supported by Shanxi Scholarship Council of China(2022-163);Social and Economic Statistical Research Project in Shanxi Province(KY［2022］73);Doctoral Research Starting Fund of Taiyuan University of Science and Technology(20212019)

基于表达式的逐层聚合和动态选择的图到方程树模型

刘斌(), 张倩, 魏亚琴, 崔学英, 智红英

太原科技大学应用科学学院，太原 030024

通讯作者: 刘斌
作者简介:张倩（1998—），女，山西朔州人，硕士研究生，主要研究方向：自然语言处理
魏亚琴（1998—），女，山西晋中人，硕士研究生，主要研究方向：自然语言处理
崔学英（1978—），女，山西临汾人，副教授，博士，主要研究方向：图像处理、深度学习
智红英（1980—），女，山西太谷人，副教授，博士，主要研究方向：贝叶斯统计。
基金资助:
国家自然科学基金资助项目(11701406);山西省基础研究计划项目(202103021224274);山西省省筹资金资助回国留学人员科研项目(2022?163);山西省社会经济统计科研课题(KY［2022］73);太原科技大学博士科研启动基金资助项目(20212019)

Abstract

Abstract:

Existing tree decoder is only suitable for solving single variable problems， but has no good effect of solving multivariate problems. At the same time， most mathematical solvers select truth expression wrongly， which leads to learning deviation occurred in training. Aiming at the above problems， a Graph to Equation Tree （GET） model based on expression level-by-level aggregation and dynamic selection was proposed. Firstly， text semantics was learned through the graph encoder. Then， subexpressions were obtained by aggregating quantities and unknown variables iteratively from bottom of the equation tree layer by layer. Finally， combined with the longest prefix of output expression， truth expression was selected dynamically to minimize the deviation. Experimental results show that the precision of proposed model reaches 83.10% on Math23K dataset， which is 5.70 percentage points higher than that of Graph to Tree （Graph2Tree） model. Therefore， the proposed model can be applied to solution of complex multivariate mathematical problems， and can reduce influence of learning deviation on experimental results.

Key words: layer-by-layer aggregation, dynamic selection, Graph to Equation Tree (GET), multivariate mathematical problem

摘要：

现有树解码器仅适合求解单变量问题而求解多元问题的效果欠佳，而大多数数学求解器对真值表达式的错误选择导致训练出现学习偏差。针对上述问题，提出基于表达式的逐层聚合和动态选择的图到方程树（GET）模型。首先，通过图编码器学习文本语义；其次，从方程树的底层开始逐层迭代地聚合数量和未知变量以得到子表达式；最后，结合输出表达式的最长前缀动态地选择真值表达式以实现偏差最小化。实验结果表明，所提模型在Math23K数据集上的精度达到83.10%，相较于图到树（Graph2Tree）模型提升了5.70个百分点。可见，所提模型适用于复杂多元数学问题的求解，并能降低学习偏差对实验结果的影响。

关键词: 逐层聚合, 动态选择, 图到方程树, 多元数学问题

CLC Number:

TP391.1

Bin LIU, Qian ZHANG, Yaqin WEI, Xueying CUI, Hongying ZHI. Graph to equation tree model based on expression layer-by-layer aggregation and dynamic selection[J]. Journal of Computer Applications, 2023, 43(8): 2390-2395.

刘斌, 张倩, 魏亚琴, 崔学英, 智红英. 基于表达式的逐层聚合和动态选择的图到方程树模型[J]. 《计算机应用》唯一官方网站, 2023, 43(8): 2390-2395.

Figures/Tables 8

Fig. 1 Example of math word problem

Fig. 2 Graph to equation tree model

Fig. 3 Dependency resolution trees for different math word problems

Fig. 4 Example of equivalent expression generation

Tab. 1 Precision comparison of different models

模型	不同数据集中的精度
模型	Math23K	Alg514	HMWP
StackDecoder	66.00	28.86	27.40
GTS	75.60	52.14	41.50
Graph2Tree	77.40	56.72	43.26
GET	83.10	60.39	50.85

Fig.5 Influence of length of expression on precision

Tab. 2 Ablation experimental results of different components of GET model on Math23K dataset

模型	精度	模型	精度
GET	83.10	GET w/o多头注意力	81.86
GET w/o解析图	82.20	GET w/o等价方程树	81.43
GET w/o数量图	82.49

Fig.6 Typical examples

References 17

1	WU Q Z， ZHANG Q， WEI Z Y， et al. Math word problem solving with explicit numerical values［C］// Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and 11th International Joint Conference on Natural Language Processing. Stroudsburg， PA： ACL， 2021： 5859-5869. 10.18653/v1/2021.acl-long.455
2	WU Q Z， ZHANG Q， FU J L， et al. A knowledge-aware sequence-to-tree network for math word problem solving［C］// Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing. Stroudsburg， PA： ACL， 2020： 7137-7146. 10.18653/v1/2020.emnlp-main.579
3	ZHANG J P， WANG L， LEE R K W， et al. Graph-to-tree learning for solving math word problems［C］// Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. Stroudsburg， PA： ACL， 2020： 3928-3937. 10.18653/v1/2020.acl-main.362
4	WANG Y， LIU X J， SHI S M. Deep neural solver for math word problems［C］// Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing. Stroudsburg， PA： ACL， 2017： 845-854. 10.18653/v1/d17-1088
5	HUANG D Q， LIU J， LIN C Y， et al. Neural math word problem solver with reinforcement learning［C］// Proceedings of the 27th International Conference on Computational Linguistics. Stroudsburg， PA： ACL， 2018： 213-223.
6	WANG L， WANG Y， CAI D， et al. Translating a math word problem to an expression tree［C］// Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing. Stroudsburg， PA： ACL， 2018： 1064-1069. 10.18653/v1/d18-1132
7	XIE Z P， SUN S C. A goal-driven tree-structured neural model for math word problems［C］// Proceedings of the 28th International Joint Conference on Artificial Intelligence. California： ijcai.org， 2019： 5299-5305. 10.24963/ijcai.2019/736
8	王伟，赵尔平，崔志远，等. 基于HowNet义原和Word2vec词向量表示的多特征融合消歧方法［J］. 计算机应用， 2021， 41（8）： 2193-2198. 10.11772/j.issn.1001-9081.2020101625
	WANG W， ZHAO E P， CUI Z Y， et al. Disambiguation method of multi-feature fusion based on HowNet sememe and Word2vec word embedding representation［J］. Journal of Computer Applications， 2021， 41（8）： 2193-2198. 10.11772/j.issn.1001-9081.2020101625
9	张继杰，杨艳，刘勇. 利用初始残差和解耦操作的自适应深层图卷积［J］. 计算机应用， 2022， 42（1）： 9-15. 10.11772/j.issn.1001-9081.2021071289
	ZHANG J J， YANG Y， LIU Y. Adaptive deep graph convolution using initial residual and decoupling operations［J］. Journal of Computer Applications， 2022， 42（1）： 9-15. 10.11772/j.issn.1001-9081.2021071289
10	QIN J H， LIANG X F， HONG Y N， et al. Neural-symbolic solver for math word problems with auxiliary tasks［C］// Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and 11th International Joint Conference on Natural Language Processing （Volume 1： Long Papers）. Stroudsburg， PA： ACL， 2021：5870-5881. 10.18653/v1/2021.acl-long.456
11	LIN X， HUANG Z Y， ZHAO H K， et al. HMS： a hierarchical solver with dependency-enhanced understanding for math word problem［C］// Proceedings of the 35th AAAI Conference on Artificial Intelligence. Palo Alto， CA： AAAI Press， 2021： 4232-4240. 10.1609/aaai.v35i5.16547
12	VASWANI A， SHAZEER N， PARMAR N， et al. Attention is all you need［C］// Proceedings of the 31st International Conference on Neural Information Processing Systems. Red Hook， NY： Curran Associates Inc.， 2017：6000-6010.
13	LI J R， WANG L， ZHANG J P， et al. Modeling intra-relation in math word problems with different functional multi-head attentions［C］// Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. Stroudsburg， PA： ACL， 2019： 6162-6167. 10.18653/v1/p19-1619
14	LAN Y H， WANG L， ZHANG Q Y， et al. MWPToolkit： an open-source framework for deep learning-based math word problem solvers［C］// Proceedings of the 36th AAAI Conference on Artificial Intelligence. Palo Alto， CA： AAAI Press， 2022： 13188-13190. 10.1609/aaai.v36i11.21723
15	CHIANG T R， CHEN Y N. Semantically-aligned equation generation for solving and reasoning math word problems［C］// Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics： Human Language Technologies， Volume 1 （Long and Short Papers）. Stroudsburg， PA： ACL， 2019： 2656-2668. 10.18653/v1/n19-1272
16	PATEL A， BHATTAMISHRA S， GOYAL N. Are NLP models really able to solve simple math word problems［C］// Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics： Human Language Technologies. Stroudsburg， PA： ACL， 2021：2080-2094. 10.18653/v1/2021.naacl-main.168
17	HUANG S F， WANG J W， XU J， et al. Recall and learn： a memory-augmented solver for math word problems［C］// Findings of the Association for Computational Linguistics： EMNLP 2021. Stroudsburg， PA： ACL， 2021： 786-796. 10.18653/v1/2021.findings-emnlp.68

Graph to equation tree model based on expression layer-by-layer aggregation and dynamic selection

基于表达式的逐层聚合和动态选择的图到方程树模型

RichHTML

PDF

Knowledge

Abstract

Cite this article

share this article

Figures/Tables 8

References 17

Related Articles 1

Recommended Articles

Metrics