DeepNap: Data-Driven Base Station Sleeping Operations Through Deep Reinforcement Learning

Jingchu Liu,Bhaskar Krishnamachari,Sheng Zhou,Zhisheng Niu
DOI: https://doi.org/10.1109/jiot.2018.2846694
IF: 10.6
2018-01-01
IEEE Internet of Things Journal
Abstract:Base station (BS) sleeping is an effective way to reduce the energy consumption of mobile networks. Previous efforts to design sleeping control algorithms mainly rely on stochastic traffic models and analytical derivation. However, the tractability of models often conflicts with the complexity of real-world traffic, making it difficult to apply in reality. In this paper, we propose a data-driven algorithm for dynamic sleeping control called DeepNap. This algorithm uses a deep Q-network (DQN) to learn effective sleeping policies from high-dimensional raw observations or un-quantized systems state vectors. We propose to enhance the original DQN algorithm with action-wise experience replay and adaptive reward scaling to deal with the challenges in nonstationary traffic. We also provide a model-assisted variant of DeepNap through the Dyna framework for inferring and simulating system dynamics. Periodical traffic modeling makes it possible to capture the nonstationarity in real-world traffic and the incorporation with DQN allows for feature learning and generalization from model outputs. Experiments show that both the end-to-end and the model-assisted version of DeepNap outperform table-based ${Q}$ -learning algorithm and the nonstationarity enhancements improve the stability of vanilla DQN.
What problem does this paper attempt to address?