Traffic Optimization in Satellites Communications: A Multi-agent Reinforcement Learning Approach
Zeyu Qin,Haipeng Yao,Tianle Mai
DOI: https://doi.org/10.1109/iwcmc48107.2020.9148523
2020-06-01
Abstract:Past few years have witnessed the compelling applications of the satellite communications and networking in our daily life. Due to the extremely high moving speeds and limited networking resources of LEO satellites, how to optimize inter-satellite traffic has received amount of attention from both academia and industry. In this paper, we proposed a hybrid satellites network traffic control paradigm. In our architecture, the centralized platform collect the global state and the joint action from each agent during the training phase to ease the training, and during execution, the each agent can return the action to the local state through the trained policy. Besides, we adopt a multiagent actor-critic algorithms named Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments(MADDPG) to our architecture. In addition, some simulation results are presented to evaluate the correctness of our architecture and algorithm.
What problem does this paper attempt to address?