TY - GEN
T1 - A Dynamics Perspective of Pursuit-Evasion Games of Intelligent Agents with the Ability to Learn
AU - Xiong, Hao
AU - Cao, Huanhui
AU - Lu, Wenjie
N1 - Publisher Copyright:
© 2022 Technical Committee on Control Theory, Chinese Association of Automation.
PY - 2022
Y1 - 2022
N2 - Pursuit-evasion games are ubiquitous in nature and in an artificial world. In nature, pursuer(s) and evader(s) are intelligent agents that can learn from experience, and dynamics (i.e., Newtonian or Lagrangian) are vital for the pursuer and the evader in some scenarios. To this end, this paper addresses the pursuit-evasion game of intelligent agents from the perspective of dynamics. A bio-inspired dynamics formulation of a pursuit-evasion game and baseline pursuit and evasion strategies are introduced at first. Then, reinforcement learning techniques are used to mimic the ability of intelligent agents to learn from experience. Based on the dynamics formulation and reinforcement learning techniques, the effects of improving both pursuit and evasion strategies based on experience in pursuit-evasion games are investigated at two levels: 1) individual runs and 2) ranges of the parameters of pursuit-evasion games. The results of the investigation are consistent with nature observations and the natural law - survival of the fittest. More importantly, with respect to the result of a pursuit-evasion game of agents with baseline strategies, this study achieves a different result. It is shown that, in a pursuit-evasion game with a dynamics formulation, an evader is not able to escape from a slightly faster pursuer with an effective learned pursuit strategy, based on agile maneuvers and an effective learned evasion strategy.
AB - Pursuit-evasion games are ubiquitous in nature and in an artificial world. In nature, pursuer(s) and evader(s) are intelligent agents that can learn from experience, and dynamics (i.e., Newtonian or Lagrangian) are vital for the pursuer and the evader in some scenarios. To this end, this paper addresses the pursuit-evasion game of intelligent agents from the perspective of dynamics. A bio-inspired dynamics formulation of a pursuit-evasion game and baseline pursuit and evasion strategies are introduced at first. Then, reinforcement learning techniques are used to mimic the ability of intelligent agents to learn from experience. Based on the dynamics formulation and reinforcement learning techniques, the effects of improving both pursuit and evasion strategies based on experience in pursuit-evasion games are investigated at two levels: 1) individual runs and 2) ranges of the parameters of pursuit-evasion games. The results of the investigation are consistent with nature observations and the natural law - survival of the fittest. More importantly, with respect to the result of a pursuit-evasion game of agents with baseline strategies, this study achieves a different result. It is shown that, in a pursuit-evasion game with a dynamics formulation, an evader is not able to escape from a slightly faster pursuer with an effective learned pursuit strategy, based on agile maneuvers and an effective learned evasion strategy.
KW - dynamics
KW - pursuit-evasion games
KW - reinforcement learning
UR - https://www.scopus.com/pages/publications/85140491260
U2 - 10.23919/CCC55666.2022.9901529
DO - 10.23919/CCC55666.2022.9901529
M3 - 会议稿件
AN - SCOPUS:85140491260
T3 - Chinese Control Conference, CCC
SP - 7082
EP - 7087
BT - Proceedings of the 41st Chinese Control Conference, CCC 2022
A2 - Li, Zhijun
A2 - Sun, Jian
PB - IEEE Computer Society
T2 - 41st Chinese Control Conference, CCC 2022
Y2 - 25 July 2022 through 27 July 2022
ER -