Deep Reinforcement Learning (DRL)-Based Device-to-Device (D2D) Caching With Blockchain and Mobile Edge Computing

Ran Zhang,F. Richard Yu,Jiang Liu,Tao Huang,Yunjie Liu
DOI: https://doi.org/10.1109/TWC.2020.3003454
IF: 10.4
2020-01-01
IEEE Transactions on Wireless Communications
Abstract:Device-to-Device (D2D) caching assists Mobile Edge Computing (MEC) based caching in offloading inter-domain traffic by sharing cached items with nearby users, while its performance relies heavily on caching nodes' sharing willingness. In this paper, a Blockchain-based Cache and Delivery Market (CDM) is proposed as an incentive mechanism for the distributed caching system. Under given incentive mechanisms, both D2D and MEC caching nodes' willingness is guaranteed by satisfying their expected reward for cache sharing. Besides, for the distributed CDM, content delivery related transactions are executed by smart contracts. To achieve consensus on transactions and prevent frauds, a consensus protocol among the smart contract execution nodes (SCENE) is necessary. To minimize the latency of reaching consensus while guaranteeing its confidence level, we propose partial Practical Byzantine Fault Tolerance (pPBFT) protocol. Further, the model of cache sharing and transaction execution consensus is proposed, and we further formulate caching placement and SCENE selection as Markov Decision Process problems. Due to the complexity and dynamics of the problems, a deep reinforcement learning approach is adopted to solve the problem. The simulation results show that the proposed schemes outperform conventional solutions in terms of traffic offloading, content retrieval latency, and consensus latency.
What problem does this paper attempt to address?