Learning to Construct a Solution for the Agile Satellite Scheduling Problem With Time-Dependent Transition Times

Yuning Chen,Lining Xing,Ke Tang,Y. Chen,Yonghao Du,Ming Chen
DOI: https://doi.org/10.1109/TSMC.2024.3411640
2024-10-01
IEEE Transactions on Systems, Man, and Cybernetics: Systems
Abstract:The agile earth observation satellite scheduling problem (AEOSSP) with time-dependent transition times is a complex combinational optimization problem that has emerged from the development of large-scale satellite management techniques. To address this problem, we propose a deep reinforcement learning-based construction model (DRL-CM) that consists of five parts: 1) a Markov decision process (MDP); 2) a feature engineering; 3) a constructive heuristic neural network (CHNN); 4) an RL training method; and 5) an evaluation system. Specifically, the CHNN comprises six modules containing three special components that we propose: a dynamic encoder, a dynamic global layer, and a two-stage attention layer. First, we build the MDP of the AEOSSP and the feature engineering with effective features required for decision-making. Second, we design the CHNN to function as the MDP policy and train it with an RL model. Finally, we propose a comprehensive evaluation system for the validation of our model. The experimental results indicate that the proposed DRL-CM outperforms the state-of-the-art algorithm in terms of both optimization speed and quality. In addition, the feature engineering and network architecture built in our model are verified to be effective in comprehensive experiments.
Computer Science,Engineering
What problem does this paper attempt to address?