Abstract:With the development of the IoT (Internet of Things), sensors networks can bring a large amount of valuable data. In addition to be utilized in the local IoT applications, the data can also be traded in the connected edge servers. As an efficient resource allocation mechanism, the double auction has been widely used in the stock and futures markets and can be also applied in the data resource allocation in sensor networks. Currently, there usually exist multiple edge servers running double auctions competing with each other to attract data users (buyers) and producers (sellers). Therefore, the double auction market run on each edge server needs efficient mechanism to improve the allocation efficiency. Specifically, the pricing strategy of the double auction plays an important role on affecting traders’ profit, and thus, will affect the traders’ market choices and bidding strategies, which in turn affect the competition result of double auction markets. In addition, the traders’ trading strategies will also affect the market’s pricing strategy. Therefore, we need to analyze the double auction markets’ pricing strategy and traders’ trading strategies. Specifically, we use a deep reinforcement learning algorithm combined with mean field theory to solve this problem with a huge state and action space. For trading strategies, we use the Independent Parametrized Deep Q-Network (I-PDQN) algorithm combined with mean field theory to compute the Nash equilibrium strategies. We then compare it with the fictitious play (FP) algorithm. The experimental results show that the computation speed of I-PDQN algorithm is significantly faster than that of FP algorithm. For pricing strategies, the double auction markets will dynamically adjust the pricing strategy according to traders’ trading strategies. This is a sequential decision-making process involving multiple agents. Therefore, we model it as a Markov game. We adopt Multiagent Deep Deterministic Policy Gradient (MADDPG) algorithm to analyze the Nash equilibrium pricing strategies. The experimental results show that the MADDPG algorithm solves the problem faster than the FP algorithm.

Deep Reinforcement Learning-Based Pricing Strategy in Double-Auction Market for Edge Computing Resource Allocation

Trading and Pricing Sensor Data in Competing Edge Servers with Double Auction Markets

Deep Reinforcement Learning for Strategic Bidding in Electricity Markets

Reinforcement-Learning- and Belief-Learning-Based Double Auction Mechanism for Edge Computing Resource Allocation

Dynamic Pricing Scheme for Edge Computing Services: A Two-layer Reinforcement Learning Approach

Continuous Double Auction Mechanism and Bidding Strategies in Cloud Computing Markets

Deep Reinforcement Learning-Based Trading Strategy for Load Aggregators on Price-Responsive Demand

Continuous Double Auction For Cloud Market: Pricing And Bidding Analysis

Deep Reinforcement Learning Based Resource Allocation Strategy in Cloud-Edge Computing System

Optimal Auction For Edge Computing Resource Management in Mobile Blockchain Networks: A Deep Learning Approach

Dynamic Pricing for Smart Mobile Edge Computing: A Reinforcement Learning Approach

A Deep Reinforcement Learning Bidding Algorithm on Electricity Market

Multi-user Resource Control with Deep Reinforcement Learning in IoT Edge Computing

Collaborative Edge Computing and Caching with Deep Reinforcement Learning Decision Agents

Distributed Dynamic Pricing Strategy Based on Deep Reinforcement Learning Approach in a Presale Mechanism

Deep reinforcement learning-based optimal bidding strategy for real-time multi-participant electricity market with short-term load

Deep Reinforcement Learning-Based Resource Allocation for Content Distribution in IoT-Edge-Cloud Computing Environments

DPS: Dynamic Pricing and Scheduling for Distributed Machine Learning Jobs in Edge-Cloud Networks

Multi-unit Double Auctions: Equilibrium Analysis and Bidding Strategy using DDPG in Smart-grids

A Knowledge-based Continuous Double Auction Model for Cloud Market

Computing Resource Allocation Strategy Based on Cloud-Edge Cluster Collaboration in Internet of Vehicles