TY - GEN
T1 - Solving a Multi-robot Search Problem with Bionic Sarsa Algorithm and Artificial Potential Field
AU - Liu, Haichao
AU - Qu, Zhenshen
AU - Zhu, Runwen
N1 - Publisher Copyright:
© 2021 IEEE
PY - 2021
Y1 - 2021
N2 - Safe and effective path planning of multiple combat vehicles engaged in antagonistic environments keeps a challenging problem. Based on the application background of multi-robots in cooperative reconnaissance of enemy camps in environment with traps, this paper studies the multi-agent path planning based on bionic algorithms and artificial potential field method. The proposed bionic PP-AP Sarsa Scheme is inspired by food-finding scheme of Physarum Polycephalum (PP), which can effectively solve the dimensional explosion problem of traditional multi-agent reinforcement learning methods. This paper first studies the single-agent bionic planning problem with the PP algorithm to initialize the Q table used in Sarsa-based reinforcement learning, which effectively reduces the search space and accelerates the convergence speed of the early stage of reinforcement learning. After the Q tables in the same map are obtained through the training of different single agents, the Q tables of every agents are extended to multi-agents scenario by the assistance of simplified artificial potential field, hence a composite parallel path planner named RL-APCP3 is constructed to synchronously update the actions of all of the agents, which allows us to complete the coordinated and efficient search of enemy camps by multiple agents. Compared with the Sarsa path planning algorithm of single agent, the efficiency of this scheme is improved up to 55.22%.
AB - Safe and effective path planning of multiple combat vehicles engaged in antagonistic environments keeps a challenging problem. Based on the application background of multi-robots in cooperative reconnaissance of enemy camps in environment with traps, this paper studies the multi-agent path planning based on bionic algorithms and artificial potential field method. The proposed bionic PP-AP Sarsa Scheme is inspired by food-finding scheme of Physarum Polycephalum (PP), which can effectively solve the dimensional explosion problem of traditional multi-agent reinforcement learning methods. This paper first studies the single-agent bionic planning problem with the PP algorithm to initialize the Q table used in Sarsa-based reinforcement learning, which effectively reduces the search space and accelerates the convergence speed of the early stage of reinforcement learning. After the Q tables in the same map are obtained through the training of different single agents, the Q tables of every agents are extended to multi-agents scenario by the assistance of simplified artificial potential field, hence a composite parallel path planner named RL-APCP3 is constructed to synchronously update the actions of all of the agents, which allows us to complete the coordinated and efficient search of enemy camps by multiple agents. Compared with the Sarsa path planning algorithm of single agent, the efficiency of this scheme is improved up to 55.22%.
KW - Artificial Potential Field
KW - Multi-agent Path Planning
KW - Physarum polycephalum
KW - Reinforcement Learning
UR - https://www.scopus.com/pages/publications/85128014869
U2 - 10.1109/CAC53003.2021.9728613
DO - 10.1109/CAC53003.2021.9728613
M3 - 会议稿件
AN - SCOPUS:85128014869
T3 - Proceeding - 2021 China Automation Congress, CAC 2021
SP - 1830
EP - 1835
BT - Proceeding - 2021 China Automation Congress, CAC 2021
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2021 China Automation Congress, CAC 2021
Y2 - 22 October 2021 through 24 October 2021
ER -