Skip to main navigation Skip to search Skip to main content

Graph-based multi-agent reinforcement learning for collaborative search and tracking of multiple UAVs

  • Bocheng ZHAO
  • , Mingying HUO*
  • , Zheng LI
  • , Wenyu FENG
  • , Ze YU
  • , Naiming QI
  • , Shaohai WANG
  • *Corresponding author for this work
  • Harbin Institute of Technology
  • Tianjin Lingyi Intelligent Technology Co. Ltd.

Research output: Contribution to journalArticlepeer-review

Abstract

This paper investigates the challenges associated with Unmanned Aerial Vehicle (UAV) collaborative search and target tracking in dynamic and unknown environments characterized by limited field of view. The primary objective is to explore the unknown environments to locate and track targets effectively. To address this problem, we propose a novel Multi-Agent Reinforcement Learning (MARL) method based on Graph Neural Network (GNN). Firstly, a method is introduced for encoding continuous-space multi-UAV problem data into spatial graphs which establish essential relationships among agents, obstacles, and targets. Secondly, a Graph AttenTion network (GAT) model is presented, which focuses exclusively on adjacent nodes, learns attention weights adaptively and allows agents to better process information in dynamic environments. Reward functions are specifically designed to tackle exploration challenges in environments with sparse rewards. By introducing a framework that integrates centralized training and distributed execution, the advancement of models is facilitated. Simulation results show that the proposed method outperforms the existing MARL method in search rate and tracking performance with less collisions. The experiments show that the proposed method can be extended to applications with a larger number of agents, which provides a potential solution to the challenging problem of multi-UAV autonomous tracking in dynamic unknown environments.

Original languageEnglish
Article number103214
JournalChinese Journal of Aeronautics
Volume38
Issue number3
DOIs
StatePublished - Mar 2025

Keywords

  • Dynamic and unknown environment
  • Graph attention network (GAT)
  • Multi-agent reinforcement learning (MARL)
  • Tracking
  • Unmanned aerial vehicle (UAV)

Fingerprint

Dive into the research topics of 'Graph-based multi-agent reinforcement learning for collaborative search and tracking of multiple UAVs'. Together they form a unique fingerprint.

Cite this