基于奖励高速路网络的多智能体强化学习中的全局信用分配算法
姚兴虎, 谭晓阳
Reward highway network based global credit assignment algorithm in multi-agent reinforcement learning
YAO Xinghu, TAN Xiaoyang
计算机应用 . 2021, (1): 1 -7 .  DOI: 10.11772/j.issn.1001-9081.2020061009