Skip to main navigation Skip to search Skip to main content

Pursuit-Evasion Game of Multiple Unmannd Surface Vessels in Partially Observable Environments

  • School of Information Science and Engineering, Harbin Institute of Technology Weihai
  • Ministry of Industry and Information Technology
  • School of Astronautics, Harbin Institute of Technology

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The pursuit-evasion game for unmanned surface vessels (USVs) is a topic of great interest in the field of ocean engineering. Most of the previous studies only focused on ideal scenarios with fully observable environments, which are beyond the practical applications. In this paper, we present a reinforcement learning (RL) model for the real marine environment including signal shielding regions, depots, and obstacles. Suppose in the signal shielding regions, the vessels' position information is unknown to each other except for the leader pursuer, and the new environment model is partially observable to ordinary USVs. Then, a distributed multi-agent RL algorithm is proposed and a leader-follower implicit communication mechanism to train the pursuer's strategy is introduced. Weighting the distance, number of captures, and other relevant parameters, a new reward function of the pursuit-evasion game is designed to optimize the pursuit strategy continuously. As shown in the simulation results, the ordinary pursuers can predict the movement direction and capture points of the evader with high accuracy, even when it is in the signal shielding area. This is due to the implicit communication with the leader pursuer in the partially observable environment, which can also effectively optimize the pursuit strategy in the new environment.

Original languageEnglish
Title of host publicationProceedings - 2023 China Automation Congress, CAC 2023
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages8243-8248
Number of pages6
ISBN (Electronic)9798350303759
DOIs
StatePublished - 2023
Externally publishedYes
Event2023 China Automation Congress, CAC 2023 - Chongqing, China
Duration: 17 Nov 202319 Nov 2023

Publication series

NameProceedings - 2023 China Automation Congress, CAC 2023

Conference

Conference2023 China Automation Congress, CAC 2023
Country/TerritoryChina
CityChongqing
Period17/11/2319/11/23

Keywords

  • USVs
  • deep reinforcement learning
  • implicit communication
  • partially observable environment
  • pursuit-evasion game

Fingerprint

Dive into the research topics of 'Pursuit-Evasion Game of Multiple Unmannd Surface Vessels in Partially Observable Environments'. Together they form a unique fingerprint.

Cite this