Journal of Computer Applications ›› 2026, Vol. 46 ›› Issue (2): 406-415.DOI: 10.11772/j.issn.1001-9081.2025020174

• Artificial intelligence • Previous Articles    

Double decision mechanism-based deep symbolic regression algorithm

Zeyi GUO1, Fenglian LI1(), Lichun XU2   

  1. 1.College of Electronic Information Engineering,Taiyuan University of Technology,Taiyuan Shanxi 030600,China
    2.College of Physics and Optoelectronic Engineering,Taiyuan University of Technology,Taiyuan Shanxi 030600,China
  • Received:2025-02-25 Revised:2025-04-07 Accepted:2025-04-11 Online:2025-04-24 Published:2026-02-10
  • Contact: Fenglian LI
  • About author:GUO Zeyi, born in 1997, M. S. candidate. His research interests include symbolic regression, deep learning.
    LI Fenglian, born in 1972, Ph. D., professor. Her research interests include intelligent information processing theory and its application in industrial data analysis Email:lifenglian@tyut.edu.cn
    XU Lichun, born in 1985, Ph. D., associate professor. His research interests include computational physics, artificial intelligence.
  • Supported by:
    Special Project for Scientific and Technological Cooperation and Exchange in Shanxi Province(202304041101035)

基于双重决策机制的深度符号回归算法

郭泽一1, 李凤莲1(), 徐利春2   

  1. 1.太原理工大学 电子信息工程学院,太原 030600
    2.太原理工大学 物理与光电工程学院,太原 030600
  • 通讯作者: 李凤莲
  • 作者简介:郭泽一(1997—),男,山西临汾人,硕士研究生,主要研究方向:符号回归、深度学习
    李凤莲(1972—),女,山西运城人,教授,博士,CCF会员,主要研究方向:智能信息处理理论及其在工业数据分析中的应用 Email:lifenglian@tyut.edu.cn
    徐利春(1985—),男,四川内江人,副教授,博士,主要研究方向:计算物理、人工智能。
  • 基金资助:
    山西省科技合作交流专项(202304041101035)

Abstract:

Concerning the problem that the Deep Symbolic Regression (DSR) algorithm, which generates expression trees through Recurrent Neural Network (RNN) automatically, cannot ensure both accuracy and structural simplicity simultaneously, a Double decision mechanism-based DSR (DDSR) algorithm was proposed. Firstly, a dual scoring mechanism was employed to evaluate the accuracy and simplicity of the expression trees comprehensively on the basis of initial RNN decision. Then, reinforcement learning was used to train the expression trees, and Risk Proximal Policy Optimization (RPPO) algorithm was utilized to perform reward feedback, so as to update model parameters of the next batch. Experimental results on public datasets show that compared with DSR algorithm, DDSR algorithm achieves a maximum improvement of 0.396 and a minimum improvement of 0.001 in the coefficient related to fitness, with an average gain of 0.116. The above proves the effectiveness of DDSR algorithm.

Key words: symbolic regression, deep learning, scoring mechanism, proximal policy optimization algorithm, risk-seeking policy gradient

摘要:

深度符号回归(DSR)算法由循环神经网络(RNN)自动化生成表达式树,进而获得较高的模型性能,然而,它无法兼顾表达式树的准确性和结构的简洁性。因此,提出一种基于双重决策机制的深度符号回归(DDSR)算法。首先,在RNN初步决策的基础上,利用双评分机制综合评估表达式树的结构简洁性和准确性。其次,采用强化学习对表达式树生成进行训练,将表达式树生成视为序列决策过程,并利用风险近端策略优化(RPPO)算法进行奖励反馈以更新下一批次的模型参数。在公共数据集上的实验结果表明,相较于DSR算法,DDSR算法在拟合度相关系数上最多提高了0.396,最少提高了0.001,而整体性能提升了0.116。以上证明了DDSR算法的有效性。

关键词: 符号回归, 深度学习, 评分机制, 近端策略优化算法, 风险寻优策略梯度

CLC Number: