Journal of Computer Applications ›› 2024, Vol. 44 ›› Issue (8): 2626-2633.DOI: 10.11772/j.issn.1001-9081.2023081120

• Frontier and comprehensive applications • Previous Articles     Next Articles

Multi-robot path following and formation based on deep reinforcement learning

Haodong HE1, Hao FU1,2(), Qiang WANG1, Shuai ZHOU1, Wei LIU1   

  1. 1.School of Computer Science and Technology,Wuhan University of Science and Technology,Wuhan Hubei 430081,China
    2.Hubei Key Laboratory of Digital Textile Equipment,Wuhan Hubei 430200,China
  • Received:2023-08-22 Revised:2023-11-16 Accepted:2023-11-24 Online:2023-12-18 Published:2024-08-10
  • Contact: Hao FU
  • About author:HE Haodong, born in 1997, M. S. candidate. His research interests include multi-robot intelligent control, reinforcement learning.
    WANG Qiang, born in 1995, M. S. candicate. His research interests include multi-robot intelligent control, artificial intelligence.
    ZHOU Shuai, born in 2000, M. S. candidate. His research interests include offline reninforcement learning, intelligent robot.
    LIU Wei, born in 1998, M. S. candidate. His research interests include multi-robot intelligent control.
  • Supported by:
    National Natural Science Foundation of China(62173262);Scientific Research Project of Education Department of Hubei Province(B2021020);Knowledge Innovation Special Project of Wuhan(2022010801020315);Hubei Key Laboratory of Digital Textile Equipment(KDTL2022002);Hubei Provincial Advantaged Characteristic Disciplines (Groups) Project of Wuhan University of Science and Technology(2023D031)


何浩东1, 符浩1,2(), 王强1, 周帅1, 刘伟1   

  1. 1.武汉科技大学 计算机科学与技术学院,武汉 430081
    2.湖北省数字化纺织装备重点实验室,武汉 430200
  • 通讯作者: 符浩
  • 作者简介:何浩东(1997—),男,四川巴中人,硕士研究生,主要研究方向:多机器人智能控制、强化学习
  • 基金资助:


Aiming at the obstacle avoidance and trajectory smoothness problem of multi-robot path following and formation in crowd environment, a multi-robot path following and formation algorithm based on deep reinforcement learning was proposed. Firstly, a pedestrian danger priority mechanism was established, which was combined with reinforcement learning to design a danger awareness network to enhance the safety of multi-robot formation. Subsequently, a virtual robot was introduced as the reference target for multiple robots, thus transforming path following into tracking control of the virtual robot by the multiple robots, with the purpose of enhancing the smoothness of the robot trajectories. Finally, quantitative and qualitative analysis was conducted through simulation experiments to compare the proposed algorithm with existing ones. The experimental results show that compared with the existing point-to-point path following algorithms, the proposed algorithm has excellent obstacle avoidance performance in crowd environments, which ensures the smoothness of multi-robot motion trajectories.

Key words: multi-robot, path-following, formation obstacle-avoiding, reinforcement learning, crowd environment



关键词: 多机器人, 路径跟随, 编队避障, 强化学习, 人群环境

CLC Number: