Dynamic Pricing for Smart Mobile Edge Computing: A Reinforcement Learning Approach

Shiyu Chen,Lingxiang Li,Zhi Chen,Shaoqian Li
DOI: https://doi.org/10.1109/lwc.2020.3039863
IF: 6.3
2021-04-01
IEEE Wireless Communications Letters
Abstract:This letter studies the revenue maximization problem for the mobile edge computing (MEC) system, where an access point (AP) is equipped with an MEC server, providing job offloading service for multiple resource-hungry users and charging users a service fee for it. Usually, the information about users' personal demand is unknown and users' job arrival rate is time-varying, which make pricing highly challenging. As such, we develop a policy gradient (PG)-based reinforcement learning (RL) algorithm. In specific, a deep neural network (DNN) is adopted as the policy network to design price policy, and a baseline neural network (BNN) is used to reduce the inherent high variance of the gradient obtained using PG. The proposed PG-based algorithm enables continuous pricing, thus constituting an advancement over the conventional Q-learning algorithm that has provided only discrete action space. Simulation results show that our proposed method converges to the optimal revenue performance, while the Q-learning algorithm suffers 44% revenue loss.
computer science, information systems,telecommunications,engineering, electrical & electronic
What problem does this paper attempt to address?