Abstract
In order to improve the ability of spacecraft formation to evade multiple interceptors, aiming at the low success rate of traditional procedural maneuver evasion, a multi-agent cooperative autonomous decision-making algorithm, which is based on deep reinforcement learning method, is proposed. Based on the actor-critic architecture, a multi-agent reinforcement learning algorithm is designed, in which a weighted linear fitting method is proposed to solve the reliability allocation problem of the self-learning system. To solve the sparse reward problem in task scenario, a sparse reward reinforcement learning method based on inverse value method is proposed. According to the task scenario, the space multi-agent countermeasure simulation system is established, and the correctness and effectiveness of the proposed algorithm are verified.
| Translated title of the contribution | Self-learning-based Multiple Spacecraft Evasion Decision Making Simulation Under Sparse Reward Condition |
|---|---|
| Original language | Chinese (Traditional) |
| Pages (from-to) | 1766-1774 |
| Number of pages | 9 |
| Journal | Xitong Fangzhen Xuebao / Journal of System Simulation |
| Volume | 33 |
| Issue number | 8 |
| DOIs | |
| State | Published - 18 Aug 2021 |
| Externally published | Yes |
Fingerprint
Dive into the research topics of 'Self-learning-based Multiple Spacecraft Evasion Decision Making Simulation Under Sparse Reward Condition'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver