Reinforcement Learning-Based Multi-Impulse Rendezvous Approach for Satellite Constellation Reconfiguration

Longwei Xu,Gang Zhang,Shi Qiu,Xibin Cao
DOI: https://doi.org/10.1016/j.actaastro.2024.08.029
IF: 2.954
2024-01-01
Acta Astronautica
Abstract:This paper presents a satellite constellation orbital reconfiguration method based on the intelligent multi-impulse rendezvous considering the impulse-magnitude limit. This problem is solved by two steps: the single-target rendezvous trajectory optimization and the multi-target rendezvous mission planning. For single target, the multi-impulse rendezvous problem is transformed into a Markov decision process and solved by a reinforcement learning algorithm. The rewards of the critic networks are used as estimation of fuel consumption, while the actions of the actor networks are used as initial values for further optimization. For multiple targets, an efficient Monte-Carlo tree search method is used to realize the global optimal solution of the satellite constellation reconfiguration problem. Finally, the numerical examples show that the proposed method can rapidly optimize the multi-impulse constellation reconfiguration problem.
What problem does this paper attempt to address?