A Graph Reinforcement Learning-Based Handover Strategy for Low Earth Orbit Satellites under Power Grid Scenarios

Haizhi Yu,Weidong Gao,Kaisa Zhang
DOI: https://doi.org/10.3390/aerospace11070511
IF: 2.66
2024-06-25
Aerospace
Abstract:Amidst the escalating need for stable power supplies and high-quality communication services in remote regions globally, due to challenges associated with deploying a conventional power communication infrastructure and its susceptibility to natural disasters, LEO satellite networks present a promising solution for broad geographical coverage and the provision of stable and high-speed communication services in remote regions. Given the necessity for frequent handovers to maintain service continuity, due to the high mobility of LEO satellites, a primary technical challenge confronting LEO satellite networks lies in efficiently managing the handover process between satellites, to guarantee the continuity and quality of communication services, particularly for power services. Thus, there is a critical need to explore satellite handover optimization algorithms. This paper presents a handover optimization scheme that integrates deep reinforcement learning (DRL) and graph neural networks (GNN) to dynamically optimize the satellite handover process and adapt to the time-varying satellite network environment. DRL models can effectively detect changes in the topology of satellite handover graphs across different time periods by leveraging the powerful representational capabilities of GNNs to make optimal handover decisions. Simulation experiments confirm that the handover strategy based on the fusion of message-passing neural network and deep Q-network algorithm (MPNN-DQN) outperforms traditional handover mechanisms and DRL-based strategies in reducing handover frequency, lowering communication latency, and achieving network load balancing. Integrating DRL and GNN into the satellite handover mechanism enhances the communication continuity and reliability of power systems in remote areas, while also offering a new direction for the design and optimization of future power system communication networks. This research contributes to the advancement of sophisticated satellite communication architectures that facilitate high-speed and reliable internet access in remote regions worldwide.
engineering, aerospace
What problem does this paper attempt to address?
The problem that this paper attempts to solve is: in the Low - Earth - Orbit (LEO) satellite network, especially in the power grid scenario, how to optimize the satellite handover strategy to ensure the continuity of communication and Quality of Service (QoS). Due to the high - speed movement characteristics of LEO satellites, users need to frequently switch between different satellites to maintain a stable communication connection. The challenges in this process include: 1. **High - frequency handover**: The rapid movement of LEO satellites causes user terminals to frequently switch to new satellites, which may increase communication latency and reduce connection stability. 2. **Transmission delay**: The transmission delay between satellites and ground stations directly affects the real - time and accuracy of data transmission, especially in smart grid applications where timely data collection and analysis are crucial. 3. **Data rate**: To meet the power system's requirements for real - time monitoring and control, it is necessary to ensure a sufficient data transmission rate. To solve these problems, the paper proposes a satellite handover strategy based on Graph Reinforcement Learning (GRL). By combining Deep Reinforcement Learning (DRL) and Graph Neural Networks (GNN), this strategy can dynamically optimize the satellite handover process and adapt to the constantly changing satellite network environment. Specifically, this method can effectively detect changes in the satellite handover graph topology during different time periods, thereby making optimal handover decisions. ### Formula summary - **Service time**: \[ T_i(t)=t_e^i - t \] where \(t_e^i\) is the end - of - service time of satellite \(i\) for the user, and \(t\) is the current time. - **Transmission delay**: \[ PD = \frac{d}{c_{\text{light}}} \] where \(d\) is the propagation distance between the satellite and the ground station, and the calculation formula is: \[ d=\sqrt{h^2+(x - o_x)^2+(y - o_y)^2} \] \((o_x, o_y)\) is the position coordinate directly below the satellite, \((x, y)\) is the coordinate position of the user node, and \(h\) is the vertical height of the satellite above the ground. - **Data rate**: According to Shannon's capacity theorem: \[ R = B\log_2\left(1+\frac{P_r}{P_N}\right) \] where \(B\) is the channel bandwidth, \(P_r\) is the received power, and the calculation formula is: \[ P_r = P_t - L_p+G_r+G_t \] \(P_t\) is the transmit power, \(G_t\) and \(G_r\) are the antenna gains at the transmitting end and the receiving end respectively, \(L_p\) is the total signal transmission loss, and the calculation formula is: \[ L_p = L_F+L_a+L_o \] The free - space propagation loss \(L_F\) is calculated by the formula: \[ L_F = 20\log_{10}\left(\frac{4\pi df}{\lambda}\right) \] where \(c\) is the speed of light, \(f\) is the carrier frequency (GHz), \(L_a\) is the signal loss caused by the atmosphere and rainfall, etc., and \(L_o\) is other losses and fading. Through these formulas, the paper proposes a multi - objective optimization problem that comprehensively considers service time, transmission delay, and data rate, aiming to maximize user service quality.