Abstract
This article proposes a fast multispacecraft rendezvous sequence mission planning algorithm based on neural combinatorial optimization and deep reinforcement learning (DRL). The algorithm can address rendezvous requirements for various geostationary Earth orbit (GEO) spacecraft on-orbit servicing missions, such as on-orbit maintenance and on-orbit refueling. In order to meet the fast decision-making needs of multispacecraft rendezvous missions and ensure the successful completion of the missions, we take advantage of the fast calculation speed of the DRL algorithm and make some unique designs to address the challenges of missions, and finally propose the data augmented REINFORCE algorithm (DARA). The algorithm can rapidly plan the optimal rendezvous sequence for multiple GEO targets and minimize fuel consumption while completing all rendezvous missions. It is implemented based on the REINFORCE algorithm using a multihead attention neural network. In order to solve the problem that DRL training is difficult due to the time-varying characteristics of the space targets’ positions, a data augmentation method is designed to strengthen the learning weights of different parameters. In order to solve the problem of trivial objective function gradient and slow iteration caused by the dense distribution of orbital planes, a loss function is designed to improve the earning efficiency of the neural network. Through multiple comparative experiments with existing algorithms, the calculation speed of DARA significantly exceeds that of commonly used heuristic mission planning algorithms. The results accuracy is better than that of various meta-heuristic algorithms and typical reinforcement learning algorithm, which proves the effectiveness and superiority of the proposed algorithm.
| Original language | English |
|---|---|
| Pages (from-to) | 16251-16266 |
| Number of pages | 16 |
| Journal | IEEE Transactions on Aerospace and Electronic Systems |
| Volume | 61 |
| Issue number | 6 |
| DOIs | |
| State | Published - 2025 |
Keywords
- Attention network
- deep reinforcement learning (DRL)
- geostationary Earth orbit (GEO) satellite
- mission planning
- on-orbit servicing
Fingerprint
Dive into the research topics of 'GEO Spacecraft Rendezvous Sequence Mission Planning Based on Data Augmented REINFORCE Algorithm'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver