Deep Reinforcement Learning-Based Pricing Strategy in Double-Auction Market for Edge Computing Resource Allocation

Song Jiang,Weidong Li,Qian Su,Xuejie Zhang
DOI: https://doi.org/10.1109/CNCIT56797.2022.00017
2022-01-01
Abstract:Due to the rapid increase in Internet of Things (IoT) terminals, the continuous double auction (CDA) has been applied to achieve the reasonable allocation of edge server resources. Existing research focuses on the matching mechanism of the auction and the convergence of the game with implementation of the same strategy, while neglecting exploration of the advantageous pricing strategy for agents when multiple strategies are applied simultaneously. In this paper, we establish a CDA market where IoT terminals act as buyers, edge servers act as sellers, and cloud servers perform the market clearing work. All agents in the market pursue maximization of their own revenues by competing against others. Zero intelligence (ZI), experience-weighted attraction (EWA), deep Q-learning (DQN), and deep deterministic policy gradient (DDPG) are deployed for both buyers and sellers to enable bidding and asking. Historical trading information of the market is used as a substitute for real-time global information, which is inaccessible. The cases of unpredictable requests and fixed requests are discussed separately. Experimental results demonstrate that the game converges to equilibrium with the two environments and the deep reinforcement learning-based strategies outperform the classical algorithms. In particular, the DDPG-based strategy obtains a superior response faster than does the DQN-based strategy.
What problem does this paper attempt to address?