A Reinforcement Learning Approach for D2D-Assisted Cache-Enabled HetNets

Jie Tang,Hengbin Tang,Nan Zhao,Kanapathippillai Cumanan,Shunqing Zhang,Yongjin Zhou
DOI: https://doi.org/10.1109/GLOBECOM38437.2019.9014027
2019-01-01
Abstract:Mobile edge caching (MEC) and device to device (D2D) communications are two potential technologies to resolve traffic overload in heterogeneous networks (HetNets). Prior works usually investigate them separately with MEC for traffic offloading and D2D for information transmission. In this paper, a joint framework consists of MEC and cache-enabled D2D communications is proposed to minimize the energy cost of systematic traffic transmission, where file popularity and user preference are the critical criteria for small base stations (SBSs) and users respectively. Under this framework, we propose a novel caching strategy where Markov decision process (MDP) is applied to model the requesting behaviors of users. A new scheme based on reinforcement learning (RL) is proposed to reveal the popularity of files as well as users' preference. In particular, Q-learning (QL) algorithm and deep Q-network (DQN) algorithm are respectively applied to users and SBS. To save the energy cost of systematic traffic transmission, users acquire partial traffic through D2D communications based on the cached contents. The proposed RL algorithm enables users' devices and SBS to prefetch the optimal files while learning, and hence reducing the energy cost significantly. Simulation results demonstrate the superior energy saving performance of the proposed scheme.
What problem does this paper attempt to address?