Skip to main navigation Skip to search Skip to main content

稀疏奖励下多航天器规避决策自学习仿真

Translated title of the contribution: Self-learning-based Multiple Spacecraft Evasion Decision Making Simulation Under Sparse Reward Condition

Research output: Contribution to journalArticlepeer-review

Abstract

In order to improve the ability of spacecraft formation to evade multiple interceptors, aiming at the low success rate of traditional procedural maneuver evasion, a multi-agent cooperative autonomous decision-making algorithm, which is based on deep reinforcement learning method, is proposed. Based on the actor-critic architecture, a multi-agent reinforcement learning algorithm is designed, in which a weighted linear fitting method is proposed to solve the reliability allocation problem of the self-learning system. To solve the sparse reward problem in task scenario, a sparse reward reinforcement learning method based on inverse value method is proposed. According to the task scenario, the space multi-agent countermeasure simulation system is established, and the correctness and effectiveness of the proposed algorithm are verified.

Translated title of the contributionSelf-learning-based Multiple Spacecraft Evasion Decision Making Simulation Under Sparse Reward Condition
Original languageChinese (Traditional)
Pages (from-to)1766-1774
Number of pages9
JournalXitong Fangzhen Xuebao / Journal of System Simulation
Volume33
Issue number8
DOIs
StatePublished - 18 Aug 2021
Externally publishedYes

Fingerprint

Dive into the research topics of 'Self-learning-based Multiple Spacecraft Evasion Decision Making Simulation Under Sparse Reward Condition'. Together they form a unique fingerprint.

Cite this