Abstract:In this work, we investigate the problem of multisatellite resource allocation for expected long-term performance optimization with a dynamic task network model, where communication tasks generated by task satellites are expected to be transmitted by resource satellites in the application layer, and the set of tasks changes with satellite orbital motions. The features of the tasks include priority, execution duration, visible time, etc. Since the feature information has a high dimension and changes with time, the scheduling problem is formulated as a dynamic combinatorial optimization problem and a receding-horizon task scheduling algorithm based on the event-triggered deep reinforcement learning is proposed. A residual-fully connected network is designed to extract the features of the complex task network model, and a deep double Q-learning iteration with the experience replay memory mechanism is employed to change the allocation strategy by evaluated rewards adaptively. An event-triggered strategy is then proposed to handle urgent tasks online. Numerical simulations show the performance improvement of the proposed algorithm. For the scenario of 50 task satellites and ten resource satellites, the proposed algorithm achieves 4.1, 5.9, and 11.4 higher reward scores than the static deep reinforcement learning algorithm, the data-driven parallel scheduling algorithm, and the improved genetic algorithm, respectively. The computation time of the proposed algorithm is only 34.7 and 21.3 of that of the latter two algorithms, and is similar to that of the static deep reinforcement learning algorithm.

Reinforcement-Learning-Based Task Planning for Self-Reconfiguration of Cellular Satellites

Autonomous Target Revisiting Planning for LEO Observing Constellations Based on Improved Contract Network Protocol

Deep Reinforcement Learning-Based Autonomous Mission Planning Method for High and Low Orbit Multiple Agile Earth Observing Satellites

Deep Reinforcement Learning-Based Periodic Earth Observation Scheduling for Agile Satellite Constellation.

Spacecraft Attitude Maneuver Planning Based on Deep Reinforcement Learning under Complex Constraints

Satellite Attitude Tracking Control of Moving Targets Combining Deep Reinforcement Learning and Predefined-time Stability Considering Energy Optimization

Reinforcement Learning-enabled Satellite Constellation Reconfiguration and Retasking for Mission-Critical Applications

Event-Triggered Deep Reinforcement Learning for Dynamic Task Scheduling in Multi-Satellite Resource Allocation

Distributed Satellite Mission Planning via Learning in Games

Motion planning techniques for self-configuration of homogeneous pivoting cube modular satellites

An Algorithm of Reinforcement Learning for Maneuvering Parameter Self-Tuning Applying in Satellite Cluster

Self-reconfiguration Strategies for Space-distributed Spacecraft

Two-Phase Neural Combinatorial Optimization with Reinforcement Learning for Agile Satellite Scheduling

Mission planning for Earth observation satellite with competitive learning strategy

Mission Planning for Distributed Multiple Agile Earth Observing Satellites by Attention-Based Deep Reinforcement Learning Method

Adaptive differential game for modular reconfigurable satellites based on neural network observer

DRL-Based Dynamic Destroy Approaches for Agile-Satellite Mission Planning

Autonomous Task Planning Method for Multi-Satellite System Based on a Hybrid Genetic Algorithm

A Fast Approach to Satellite Range Rescheduling Using Deep Reinforcement Learning

Deep Reinforcement Learning with Local Attention for Single Agile Optical Satellite Scheduling Problem

A General Technique To Combine Off-Policy Reinforcement Learning Algorithms With Satellite Attitude Control