Spacecraft Resources Dynamic Scheduling Strategy Based on Reinforcement Learning

Chao Zhang,Lei Wang,Cong Zhang,Sihang Zhang,Yuan Huang
DOI: https://doi.org/10.1109/ICUS55513.2022.9986637
2022-10-28
Abstract:With the development of intelligent control of spacecraft, spacecraft needs to have the ability of dynamic scheduling of on-board resources. In this paper, reinforcement learning is applied to the dynamic scheduling of spacecraft finite resources. The space situation information under a specific mission is used as the input, and the dynamical spacecraft resource allocation is output. In order to further improve the effectiveness of the algorithm, the classical Deep Q-learning Network (DQN) algorithm is modified. Aiming at the convergence issues of the classical DQN algorithm, the varying epsilon mechanism and normalization are introduced. In view of the limitation of spacecraft resources, the Action-Mask mechanism is introduced. Simulation experiments show that compared with the random scheduling strategy and the scheduling strategy based on expert knowledge, the modified DQN algorithm proposed in this paper ensures the spacecraft to complete the mission more smoothly and saves the velocity increment of the spacecraft.
What problem does this paper attempt to address?