Skip to main navigation Skip to search Skip to main content

基于强化学习的多无人机避碰计算制导方法

Translated title of the contribution: A Reinforcement Learning Based Computational Guidance Approach for UAVs Collision Avoidance
  • School of Astronautics, Harbin Institute of Technology

Research output: Contribution to journalArticlepeer-review

Abstract

Aiming at the problem of cooperative collision avoidance for a large number of fixed wing UAVs in limited airspace, a computational guidance method based on multi-agent deep reinforcement learning is proposed. Firstly, the process of collision avoidance and guidance is formulated as a sequential decision problem, which is mathematically described by Markov game theory. Then, a decision-making method of autonomous collision avoidance guidance based on multilayer neural network technology is proposed. The network is trained by the improved Actor-Critic model. Furthermore, the machine learning architecture is designed to implement the method. The relevant neural network structure and coordination mechanisms among UAVs are given. Finally, a flight simulator with variable number of entities is established, in which centralized training and distributed execution are performed. In order to verify the performance of the algorithm, several simulation experiments are carried out in the scene of high traffic density. The simulation results show that the proposed onboard computational guidance method can effectively reduce the collision probability of multiple UAVs in flight process have a good adaptability to the scene of high route density.

Translated title of the contributionA Reinforcement Learning Based Computational Guidance Approach for UAVs Collision Avoidance
Original languageChinese (Traditional)
Pages (from-to)31-40
Number of pages10
JournalNavigation, Positionng and Timing
Volume8
Issue number1
DOIs
StatePublished - Jan 2021
Externally publishedYes

Fingerprint

Dive into the research topics of 'A Reinforcement Learning Based Computational Guidance Approach for UAVs Collision Avoidance'. Together they form a unique fingerprint.

Cite this