Abstract:Amidst the escalating need for stable power supplies and high-quality communication services in remote regions globally, due to challenges associated with deploying a conventional power communication infrastructure and its susceptibility to natural disasters, LEO satellite networks present a promising solution for broad geographical coverage and the provision of stable and high-speed communication services in remote regions. Given the necessity for frequent handovers to maintain service continuity, due to the high mobility of LEO satellites, a primary technical challenge confronting LEO satellite networks lies in efficiently managing the handover process between satellites, to guarantee the continuity and quality of communication services, particularly for power services. Thus, there is a critical need to explore satellite handover optimization algorithms. This paper presents a handover optimization scheme that integrates deep reinforcement learning (DRL) and graph neural networks (GNN) to dynamically optimize the satellite handover process and adapt to the time-varying satellite network environment. DRL models can effectively detect changes in the topology of satellite handover graphs across different time periods by leveraging the powerful representational capabilities of GNNs to make optimal handover decisions. Simulation experiments confirm that the handover strategy based on the fusion of message-passing neural network and deep Q-network algorithm (MPNN-DQN) outperforms traditional handover mechanisms and DRL-based strategies in reducing handover frequency, lowering communication latency, and achieving network load balancing. Integrating DRL and GNN into the satellite handover mechanism enhances the communication continuity and reliability of power systems in remote areas, while also offering a new direction for the design and optimization of future power system communication networks. This research contributes to the advancement of sophisticated satellite communication architectures that facilitate high-speed and reliable internet access in remote regions worldwide.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is: in the Low - Earth - Orbit (LEO) satellite network, especially in the power grid scenario, how to optimize the satellite handover strategy to ensure the continuity of communication and Quality of Service (QoS). Due to the high - speed movement characteristics of LEO satellites, users need to frequently switch between different satellites to maintain a stable communication connection. The challenges in this process include: 1. **High - frequency handover**: The rapid movement of LEO satellites causes user terminals to frequently switch to new satellites, which may increase communication latency and reduce connection stability. 2. **Transmission delay**: The transmission delay between satellites and ground stations directly affects the real - time and accuracy of data transmission, especially in smart grid applications where timely data collection and analysis are crucial. 3. **Data rate**: To meet the power system's requirements for real - time monitoring and control, it is necessary to ensure a sufficient data transmission rate. To solve these problems, the paper proposes a satellite handover strategy based on Graph Reinforcement Learning (GRL). By combining Deep Reinforcement Learning (DRL) and Graph Neural Networks (GNN), this strategy can dynamically optimize the satellite handover process and adapt to the constantly changing satellite network environment. Specifically, this method can effectively detect changes in the satellite handover graph topology during different time periods, thereby making optimal handover decisions. ### Formula summary - **Service time**: \[ T_i(t)=t_e^i - t \] where \(t_e^i\) is the end - of - service time of satellite \(i\) for the user, and \(t\) is the current time. - **Transmission delay**: \[ PD = \frac{d}{c_{\text{light}}} \] where \(d\) is the propagation distance between the satellite and the ground station, and the calculation formula is: \[ d=\sqrt{h^2+(x - o_x)^2+(y - o_y)^2} \] \((o_x, o_y)\) is the position coordinate directly below the satellite, \((x, y)\) is the coordinate position of the user node, and \(h\) is the vertical height of the satellite above the ground. - **Data rate**: According to Shannon's capacity theorem: \[ R = B\log_2\left(1+\frac{P_r}{P_N}\right) \] where \(B\) is the channel bandwidth, \(P_r\) is the received power, and the calculation formula is: \[ P_r = P_t - L_p+G_r+G_t \] \(P_t\) is the transmit power, \(G_t\) and \(G_r\) are the antenna gains at the transmitting end and the receiving end respectively, \(L_p\) is the total signal transmission loss, and the calculation formula is: \[ L_p = L_F+L_a+L_o \] The free - space propagation loss \(L_F\) is calculated by the formula: \[ L_F = 20\log_{10}\left(\frac{4\pi df}{\lambda}\right) \] where \(c\) is the speed of light, \(f\) is the carrier frequency (GHz), \(L_a\) is the signal loss caused by the atmosphere and rainfall, etc., and \(L_o\) is other losses and fading. Through these formulas, the paper proposes a multi - objective optimization problem that comprehensively considers service time, transmission delay, and data rate, aiming to maximize user service quality.

A Graph Reinforcement Learning-Based Handover Strategy for Low Earth Orbit Satellites under Power Grid Scenarios

GRouting: Dynamic Routing for LEO Satellite Networks with Graph-based Deep Reinforcement Learning

Load-Aware Satellite Handover Strategy Based on Multi-Agent Reinforcement Learning

Handover Protocol Learning for LEO Satellite Networks: Access Delay and Collision Minimization

Graph Neural Network and Reinforcement Learning Based Routing for Mega LEO Satellite Constellations

An Intelligent Routing Algorithm for LEO Satellites Based on Deep Reinforcement Learning

Nash Soft Actor-Critic LEO Satellite Handover Management Algorithm for Flying Vehicles

Deep Reinforcement Learning Architecture for Continuous Power Allocation in High Throughput Satellites

Deep Reinforcement Learning for Congestion Control and Routing Optimization in LEO Satellite Networks

Seamless Handover in LEO Based Non-Terrestrial Networks: Service Continuity and Optimization

Multi objective constellation optimization and dynamic link utilization for sustainable information delivery using PD-NOMA deep reinforcement learning

Handover Strategies for Emerging LEO, MEO, and HEO Satellite Networks

Accelerating Handover in Mobile Satellite Network

A Graph-Based Customizable Handover Framework for LEO Satellite Networks

Reinforcement learning based dynamic distributed routing scheme for mega LEO satellite networks

QoS-Centric Handover for Civil Aviation Aircraft Access in Ultra-Dense LEO Satellite Networks

Enabling High-Throughput Routing for LEO Satellite Broadband Networks: A Flow-Centric Deep Reinforcement Learning Approach

Collaborative Deep Reinforcement Learning for Resource Optimization in Non-Terrestrial Networks

QoE-Driven Intelligent Handover for User-Centric Mobile Satellite Networks

Collaborative Computing in Non-Terrestrial Networks: A Multi-Time-Scale Deep Reinforcement Learning Approach