About author:ZHANG ZiHeng, born in 2002, M. S. candidate. His research interests include reinforcement learning.
QIN Jin, born in 1978, Ph. D., associated professor. His research interests include computational intelligence, reinforcement learning.
Supported by:
Natural Science Foundation of China (62162007); Scientific and Technological Projects in Guizhou (KJZY [2025]020)