Skip to main navigation Skip to search Skip to main content

Flexible Formation Control Using Hausdorff Distance: A Multi-agent Reinforcement Learning Approach

  • Chaoyi Pan*
  • , Yuzi Yan*
  • , Zexu Zhang
  • , Yuan Shen*
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

While fixed topology formation control with a centralized controller has been studied for multi-agent systems, it remains challenging to develop robust distributed control policies that can achieve a flexible formation without a global coordinate system. In this paper, we design a fully decentralized displacement-based formation control policy for multi-agent systems, which can achieve any formation after one-time training. In particular, we use a model-free multi-agent reinforcement learning (MARL) approach to obtain such a policy in the centralized training process. The Hausdorff distance is adopted in the reward function for measuring the distance between the current and target topology. The feasibility of our method is verified by both simulation and implementation on omnidirectional vehicles.

Original languageEnglish
Title of host publication30th European Signal Processing Conference, EUSIPCO 2022 - Proceedings
PublisherEuropean Signal Processing Conference, EUSIPCO
Pages972-976
Number of pages5
ISBN (Electronic)9789082797091
StatePublished - 2022
Externally publishedYes
Event30th European Signal Processing Conference, EUSIPCO 2022 - Belgrade, Serbia
Duration: 29 Aug 20222 Sep 2022

Publication series

NameEuropean Signal Processing Conference
Volume2022-August
ISSN (Print)2219-5491

Conference

Conference30th European Signal Processing Conference, EUSIPCO 2022
Country/TerritorySerbia
CityBelgrade
Period29/08/222/09/22

Keywords

  • Hausdorff distance
  • Multi-agent reinforcement learning (MARL)
  • formation control

Fingerprint

Dive into the research topics of 'Flexible Formation Control Using Hausdorff Distance: A Multi-agent Reinforcement Learning Approach'. Together they form a unique fingerprint.

Cite this