TY - GEN
T1 - DRL-based Optimal Scheduling for On-orbit Service with the Encoder-decoder network
AU - Zhang, Jierui
AU - Si, Chaoming
AU - Ma, Changbo
AU - Chen, Ting
AU - Xia, Hongwei
AU - Ma, Guangcheng
N1 - Publisher Copyright:
© 2024 Technical Committee on Control Theory, Chinese Association of Automation.
PY - 2024
Y1 - 2024
N2 - In this paper, we present a deep reinforcement learning (DRL) based strategy for optimizing the scheduling of satellite on-orbit services. The orbital maneuvers necessitate the servicing satellite to consecutively rendezvous with multiple targets to execute its on-orbit missions. The principal aim of our optimization approach is to ascertain the most advantageous sequence for servicing satellites, thereby minimizing the overall cost, contingent upon the expenditure of propulsion maneuvers. To surmount this formidable challenge, we introduce an attention-based encoder-decoder neural network and train its parameters utilizing the REINFORCE algorithm with a greedy rollout baseline. Ultimately, experimental results across diverse scenarios validate the efficacy and supremacy of our proposed algorithm. The chief contribution of this work lies in its conceptualization of the satellite on-orbit service scheduling optimization quandary as an extended traveling salesman problem, culminating in the introduction of an innovative DRL-based methodology.
AB - In this paper, we present a deep reinforcement learning (DRL) based strategy for optimizing the scheduling of satellite on-orbit services. The orbital maneuvers necessitate the servicing satellite to consecutively rendezvous with multiple targets to execute its on-orbit missions. The principal aim of our optimization approach is to ascertain the most advantageous sequence for servicing satellites, thereby minimizing the overall cost, contingent upon the expenditure of propulsion maneuvers. To surmount this formidable challenge, we introduce an attention-based encoder-decoder neural network and train its parameters utilizing the REINFORCE algorithm with a greedy rollout baseline. Ultimately, experimental results across diverse scenarios validate the efficacy and supremacy of our proposed algorithm. The chief contribution of this work lies in its conceptualization of the satellite on-orbit service scheduling optimization quandary as an extended traveling salesman problem, culminating in the introduction of an innovative DRL-based methodology.
KW - Deep reinforcement learning
KW - Encoder-decoder neural network
KW - Optimal scheduling
KW - Satellite on-orbit service
UR - https://www.scopus.com/pages/publications/85205490779
U2 - 10.23919/CCC63176.2024.10662023
DO - 10.23919/CCC63176.2024.10662023
M3 - 会议稿件
AN - SCOPUS:85205490779
T3 - Chinese Control Conference, CCC
SP - 8786
EP - 8791
BT - Proceedings of the 43rd Chinese Control Conference, CCC 2024
A2 - Na, Jing
A2 - Sun, Jian
PB - IEEE Computer Society
T2 - 43rd Chinese Control Conference, CCC 2024
Y2 - 28 July 2024 through 31 July 2024
ER -