Skip to main navigation Skip to search Skip to main content

Flocking control of UAV swarms with deep reinforcement leaming approach

  • Harbin Institute of Technology

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The flocking control of UAV swarms has been studied extensively due to its wide applications. In this paper, the UAV flocking control problem is formulated as a Partial Observable Markov Decision Process (POMDP) where the constraints of the UAV's communication and perception ranges are considered. A deep reinforcement learning approach is proposed to solve this problem with centralized training and decentralized execution manner. The experience collected by all UAVs is used to train the shared flocking control policy, and each UAV performs actions based on the local environment information it observes. To enable the UAV swarm to maintain a flock and navigate in an environment with dense obstacles, a reward function is constructed considering with goal reaching, obstacles avoidance and flocking maintenance. Especially, the flocking maintenance reward is designed with the global information of the UAV swarm, which can only be obtained during the training phase. Simulation results demonstrate that the policy trained with the flocking maintenance reward can make the UAV swarm keep a flock when encountering obstacles and has good generalization ability with different number of UAVs.

Original languageEnglish
Title of host publicationProceedings of 2020 3rd International Conference on Unmanned Systems, ICUS 2020
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages592-599
Number of pages8
ISBN (Electronic)9781728180250
DOIs
StatePublished - 27 Nov 2020
Event3rd International Conference on Unmanned Systems, ICUS 2020 - Harbin, China
Duration: 27 Nov 202028 Nov 2020

Publication series

NameProceedings of 2020 3rd International Conference on Unmanned Systems, ICUS 2020

Conference

Conference3rd International Conference on Unmanned Systems, ICUS 2020
Country/TerritoryChina
CityHarbin
Period27/11/2028/11/20

Keywords

  • Deep reinforcement learning
  • Flocking control
  • Obstacles avoidance
  • UAV swarms

Fingerprint

Dive into the research topics of 'Flocking control of UAV swarms with deep reinforcement leaming approach'. Together they form a unique fingerprint.

Cite this