Deep Reinforcement Learning-Based Beam Hopping Algorithm in Multibeam Satellite Systems.

Xin Hu,Shuaijun Liu,Yipeng Wang,Lexi Xu,Yuchen Zhang,Cheng Wang,Weidong Wang
DOI: https://doi.org/10.1049/iet-com.2018.5774
IF: 1.345
2019-01-01
IET Communications
Abstract:Beam hopping (BH) is the key technology to improve the system throughput and decrease the transmission delay in multibeam satellite systems. The objective of this study is to find a policy to maximise the expected long-term resource utilisation. The BH illumination plan (BHIP) optimisation problem aimed at minimising the transmission delay is formulated and modelled as a partially observable Markov decision process. To tackle the issue of unknown dynamics and prohibitive computation, an artificial intelligence method named deep reinforcement learning (DRL) is first proposed to solve the BHIP problem in multibeam satellite systems. The proposed DRL-BHIP algorithm considers a series of realistic conditions, including the traffic demands in spatial distribution and temporal variation, ModCod constraints, antenna radiation pattern and inter-beam interference. The state reformulation concept is adopted to characterise the traffic spatial and temporal features. Simulation results show that the proposed DRL-BHIP algorithm can decrease the transmission delay and improve the system throughput compared with existing algorithms.
What problem does this paper attempt to address?