DRL-Based Dynamic Destroy Approaches for Agile-Satellite Mission Planning

Wei Huang,Zongwang Li,Xiaohe He,Junyan Xiang,Xu Du,Xuwen Liang
DOI: https://doi.org/10.3390/rs15184503
IF: 5
2023-09-14
Remote Sensing
Abstract:Agile-satellite mission planning is a crucial issue in the construction of satellite constellations. The large scale of remote sensing missions and the high complexity of constraints in agile-satellite mission planning pose challenges in the search for an optimal solution. To tackle the issue, a dynamic destroy deep-reinforcement learning (D3RL) model is designed to facilitate subsequent optimization operations via adaptive destruction to the existing solutions. Specifically, we first perform a clustering and embedding operation to reconstruct tasks into a clustering graph, thereby improving data utilization. Secondly, the D3RL model is established based on graph attention networks (GATs) to enhance the search efficiency for optimal solutions. Moreover, we present two applications of the D3RL model for intensive scenes: the deep-reinforcement learning (DRL) method and the D3RL-based large-neighborhood search method (DRL-LNS). Experimental simulation results illustrate that the D3RL-based approaches outperform the competition in terms of solutions' quality and computational efficiency, particularly in more challenging large-scale scenarios. DRL-LNS outperforms ALNS with an average scheduling rate improvement of approximately 11% in Area instances. In contrast, the DRL approach performs better in World scenarios, with an average scheduling rate that is around 8% higher than that of ALNS.
environmental sciences,imaging science & photographic technology,remote sensing,geosciences, multidisciplinary
What problem does this paper attempt to address?
### Problems Addressed by the Paper This paper aims to address key issues in agile satellite mission planning, particularly in large-scale scenarios. Specifically, agile satellite mission planning faces the following challenges: 1. **Large-scale Remote Sensing Missions**: With the increase in the number of remote sensing missions and the complexity of tasks, existing mission planning systems struggle to find optimal solutions within limited time. 2. **High Complexity Constraints**: Traditional Earth Observation Satellites (EOS) face various constraints such as limited attitude maneuverability and visible time windows, making mission planning very complex. To solve these problems, the authors designed a Deep Reinforcement Learning based on Dynamic Destruction (D3RL) model. This model assists subsequent optimization operations by adaptively disrupting existing solutions. The specific steps are as follows: 1. **Clustering and Embedding Operations**: First, tasks are clustered and embedded, reconstructing the tasks into a clustered graph to improve data utilization. 2. **D3RL Model Establishment**: Based on Graph Attention Networks (GATs), the D3RL model is established to enhance the efficiency of searching for optimal solutions. 3. **Proposed Application Methods**: Two application methods are proposed: - Deep Reinforcement Learning (DRL) method - Large Neighborhood Search method based on D3RL (DRL-LNS) Experimental results show that the D3RL method outperforms other competing methods in terms of solution quality and computational efficiency, especially in challenging large-scale scenarios. Specifically, in the Area instance, the DRL-LNS method improved the average scheduling rate by approximately 11%, while in the World scenario, the DRL method improved the average scheduling rate by about 8%. ### Main Contributions 1. Designed a D3RL model for large-scale agile satellite mission planning and detailed the state, action, and reward functions in the Markov Decision Process (MDP). 2. Established a GAT actor network based on target clustering information. 3. Proposed two practical application methods: the DRL method and the DRL-LNS method. 4. Experimental results demonstrate that the proposed algorithm converges faster and achieves higher task scheduling rates in large-scale scenarios.