OPTIMIZATION ALGORITHM FOR INTERPLANETARY TRANSFER TRAJECTORIES OF SOLAR SAILCRAFT BASED ON DEEP REINFORCEMENT LEARNING

Chengyang Zhou,Lin Cheng,Qingzhen Zhang,Ke Fang
2018-01-01
Abstract:This paper studies the optimization problem of transfer trajectories for solar sailcraft based on deep reinforcement learning (DRL). Firstly, the orbital dynamic model of an ideal solar sailcraft is established and the control angles at discrete time points are used as design variable, which avoids the guessing for initial value of covariables without physical significance. By this way, the optimization problem is transformed into nonlinear programming (NLP) problem. On this basis, the Markov model of solar sailcraft is established, and the reward function is designed according to objective function and constraints. Combining deep learning and stochastic policy gradient reinforcement learning, A3C (Asynchronous Advantage Actor-Critic) algorithm is implemented in the framework of Python+Tensorflow, and the transfer trajectories for solar sailcraft from geo-synchronous orbit to Mars synchronous orbit is simulated to verify effectiveness of the algorithm. The simulation results show that the algorithm converges quickly, and the controller trained for 30000 episodes can meet the terminal constraints and has good real-time performance. The results successfully verify the effectiveness of deep reinforcement learning in solving optimization problem for interplanetary transfer trajectories of solar sailcraft.
What problem does this paper attempt to address?