Abstract:The cooperative positioning problem of hypersonic vehicles regarding LEO constellations is the focus of this research study on space-based early warning systems. A hypersonic vehicle is highly maneuverable, and its trajectory is uncertain. New challenges are posed for the cooperative positioning capability of the constellation. In recent years, breakthroughs in artificial intelligence technology have provided new avenues for collaborative multi-satellite intelligent autonomous decision-making technology. This paper addresses the problem of multi-satellite cooperative geometric positioning for hypersonic glide vehicles (HGVs) by the LEO-constellation-tracking system. To exploit the inherent advantages of hierarchical reinforcement learning in intelligent decision making while satisfying the constraints of cooperative observations, an autonomous intelligent decision-making algorithm for satellites that incorporates a hierarchical proximal policy optimization with random hill climbing (MAPPO-RHC) is designed. On the one hand, hierarchical decision making is used to reduce the solution space; on the other hand, it is used to maximize the global reward and to uniformly distribute satellite resources. The single-satellite local search method improves the capability of the decision-making algorithm to search the solution space based on the decision-making results of the hierarchical proximal policy-optimization algorithm, combining both random hill climbing and heuristic methods. Finally, the MAPPO-RHC algorithm's coverage and positioning accuracy performance is simulated and analyzed in two different scenarios and compared with four intelligent satellite decision-making algorithms that have been studied in recent years. From the simulation results, the decision-making results of the MAPPO-RHC algorithm can obtain more balanced resource allocations and higher geometric positioning accuracy. Thus, it is concluded that the MAPPO-RHC algorithm provides a feasible solution for the real-time decision-making problem of the LEO constellation early warning system.

Reinforcement Learning-Based Multi-Impulse Rendezvous Approach for Satellite Constellation Reconfiguration

Autonomous Target Revisiting Planning for LEO Observing Constellations Based on Improved Contract Network Protocol

Deep Reinforcement Learning-Based Autonomous Mission Planning Method for High and Low Orbit Multiple Agile Earth Observing Satellites

Satellite Attitude Tracking Control of Moving Targets Combining Deep Reinforcement Learning and Predefined-time Stability Considering Energy Optimization

Deep Reinforcement Learning-Based Periodic Earth Observation Scheduling for Agile Satellite Constellation.

Model Predictive Control-based Mission Planning Method for Moving Target Tracking by Multiple Observing Satellites

Spacecraft Attitude Maneuver Planning Based on Deep Reinforcement Learning under Complex Constraints

Reinforcement Learning-enabled Satellite Constellation Reconfiguration and Retasking for Mission-Critical Applications

Multi-Satellite Reconfiguration of Formation Around Libration Point

Reinforcement learning-based satellite formation attitude control under multi-constraint

Simultaneous approach with partial error control on non-collocation points based satellite formation reconfiguration

A Fast Approach to Satellite Range Rescheduling Using Deep Reinforcement Learning

Observation Method for Autonomous Maneuver of Spacecraft under Emergency Conditions

Optimal Multi-impulse Close-range Rendezvous

An LEO Constellation Early Warning System Decision-Making Method Based on Hierarchical Reinforcement Learning

An Algorithm of Reinforcement Learning for Maneuvering Parameter Self-Tuning Applying in Satellite Cluster

Reconfiguration of a satellite constellation in circular formation orbit with decentralized model predictive control

Minimum Cost Perturbed Multi-impulsive Maneuver Methodology to Accomplish an Optimal Deployment Scheduling for a Satellite Constellation

Deterministic Multistage Constellation Reconfiguration Using Integer Programming and Sequential Decision-Making Methods

A General Technique To Combine Off-Policy Reinforcement Learning Algorithms With Satellite Attitude Control

Two-Phase Neural Combinatorial Optimization with Reinforcement Learning for Agile Satellite Scheduling