Abstract:Agile-satellite mission planning is a crucial issue in the construction of satellite constellations. The large scale of remote sensing missions and the high complexity of constraints in agile-satellite mission planning pose challenges in the search for an optimal solution. To tackle the issue, a dynamic destroy deep-reinforcement learning (D3RL) model is designed to facilitate subsequent optimization operations via adaptive destruction to the existing solutions. Specifically, we first perform a clustering and embedding operation to reconstruct tasks into a clustering graph, thereby improving data utilization. Secondly, the D3RL model is established based on graph attention networks (GATs) to enhance the search efficiency for optimal solutions. Moreover, we present two applications of the D3RL model for intensive scenes: the deep-reinforcement learning (DRL) method and the D3RL-based large-neighborhood search method (DRL-LNS). Experimental simulation results illustrate that the D3RL-based approaches outperform the competition in terms of solutions' quality and computational efficiency, particularly in more challenging large-scale scenarios. DRL-LNS outperforms ALNS with an average scheduling rate improvement of approximately 11% in Area instances. In contrast, the DRL approach performs better in World scenarios, with an average scheduling rate that is around 8% higher than that of ALNS.

What problem does this paper attempt to address?

### Problems Addressed by the Paper This paper aims to address key issues in agile satellite mission planning, particularly in large-scale scenarios. Specifically, agile satellite mission planning faces the following challenges: 1. **Large-scale Remote Sensing Missions**: With the increase in the number of remote sensing missions and the complexity of tasks, existing mission planning systems struggle to find optimal solutions within limited time. 2. **High Complexity Constraints**: Traditional Earth Observation Satellites (EOS) face various constraints such as limited attitude maneuverability and visible time windows, making mission planning very complex. To solve these problems, the authors designed a Deep Reinforcement Learning based on Dynamic Destruction (D3RL) model. This model assists subsequent optimization operations by adaptively disrupting existing solutions. The specific steps are as follows: 1. **Clustering and Embedding Operations**: First, tasks are clustered and embedded, reconstructing the tasks into a clustered graph to improve data utilization. 2. **D3RL Model Establishment**: Based on Graph Attention Networks (GATs), the D3RL model is established to enhance the efficiency of searching for optimal solutions. 3. **Proposed Application Methods**: Two application methods are proposed: - Deep Reinforcement Learning (DRL) method - Large Neighborhood Search method based on D3RL (DRL-LNS) Experimental results show that the D3RL method outperforms other competing methods in terms of solution quality and computational efficiency, especially in challenging large-scale scenarios. Specifically, in the Area instance, the DRL-LNS method improved the average scheduling rate by approximately 11%, while in the World scenario, the DRL method improved the average scheduling rate by about 8%. ### Main Contributions 1. Designed a D3RL model for large-scale agile satellite mission planning and detailed the state, action, and reward functions in the Markov Decision Process (MDP). 2. Established a GAT actor network based on target clustering information. 3. Proposed two practical application methods: the DRL method and the DRL-LNS method. 4. Experimental results demonstrate that the proposed algorithm converges faster and achieves higher task scheduling rates in large-scale scenarios.

DRL-Based Dynamic Destroy Approaches for Agile-Satellite Mission Planning

Deep Reinforcement Learning-Based Autonomous Mission Planning Method for High and Low Orbit Multiple Agile Earth Observing Satellites

Deep Reinforcement Learning-Based Periodic Earth Observation Scheduling for Agile Satellite Constellation.

Spacecraft Attitude Maneuver Planning Based on Deep Reinforcement Learning under Complex Constraints

A Fast Approach to Satellite Range Rescheduling Using Deep Reinforcement Learning

Mission Planning for Distributed Multiple Agile Earth Observing Satellites by Attention-Based Deep Reinforcement Learning Method

Learning to Construct a Solution for the Agile Satellite Scheduling Problem With Time-Dependent Transition Times

Event-Triggered Deep Reinforcement Learning for Dynamic Task Scheduling in Multi-Satellite Resource Allocation

AI-Driven Risk-Aware Scheduling for Active Debris Removal Missions

Deep Reinforcement Learning with Local Attention for Single Agile Optical Satellite Scheduling Problem

DL-DRL: A double-level deep reinforcement learning approach for large-scale task scheduling of multi-UAV

Reinforcement Learning-enabled Satellite Constellation Reconfiguration and Retasking for Mission-Critical Applications

Active Debris Removal Mission Planning Method Based on Machine Learning

Adaptive satellite attitude control for varying masses using deep reinforcement learning

Dynamic Channel Allocation for Satellite Internet of Things via Deep Reinforcement Learning

Collaborative Deep Reinforcement Learning for Resource Optimization in Non-Terrestrial Networks

Deep Reinforcement Learning for UAV Intelligent Mission Planning

Autonomous spacecraft collision avoidance with a variable number of space debris based on safe reinforcement learning

A DRL-Based Satellite Service Allocation Method in LEO Satellite Networks

Deep Reinforcement Learning for Intelligent Dual-UAV Reconnaissance Mission Planning