Abstract:Periodic double auctions (PDA) have applications in many areas such as in e-commerce, intra-day equity markets, and day-ahead energy markets in smart-grids. While the trades accomplished using PDAs are worth trillions of dollars, finding a reliable bidding strategy in such auctions is still a challenge as it requires the consideration of future auctions. A participating buyer in a PDA has to design its bidding strategy by planning for current and future auctions. Many equilibrium-based bidding strategies proposed are complex to use in real-time. In the current exposition, we propose a scale-based bidding strategy for buyers participating in PDA. We first present an equilibrium analysis for single-buyer single-seller multi-unit single-shot k-Double auctions. Specifically, we analyze the situation when a seller and a buyer trade two identical units of quantity in a double auction where both the buyer and the seller deploy a simple, scale-based bidding strategy. The equilibrium analysis becomes intractable as the number of participants increases. To be useful in more complex settings such as wholesale markets in smart-grids, we model equilibrium bidding strategy as a learning problem. We develop a deep deterministic policy gradient (DDPG) based learning strategy, DDPGBBS, for a participating agent in PDAs to suggest an action at any auction instance. DDPGBBS, which empirically follows the obtained theoretical equilibrium, is easily extendable when the number of buyers/sellers increases. We take Power Trading Agent Competition's (PowerTAC) wholesale market PDA as a testbed to evaluate our novel bidding strategy. We benchmark our DDPG based strategy against several baselines and state-of-the-art bidding strategies of the PowerTAC wholesale market PDA and demonstrate the efficacy of DDPGBBS against several benchmarked strategies.

Scaling POMDPs For Selecting Sellers in E-markets-Extended Version

Online Optimization for Real-Time Peer-to-Peer Electricity Market Mechanisms

Scalable Planning and Learning for Multiagent POMDPs: Extended Version

Monte-Carlo Search for an Equilibrium in Dec-POMDPs

Bridging the Gap between Partially Observable Stochastic Games and Sparse POMDP Methods

Sequential Selection of Correlated Ads by POMDPs

Multi-agent Assortment Optimization in Sequential Matching Markets

A Framework for Sequential Planning in Multi-Agent Settings

Factored Online Planning in Many-Agent POMDPs

MESOB: Balancing Equilibria & Social Optimality

Harder Tasks Need More Experts: Dynamic Routing in MoE Models

Multi-unit Double Auctions: Equilibrium Analysis and Bidding Strategy using DDPG in Smart-grids

An Efficient Knowledge Transfer Solution to A Novel Smdp Formalization of A Broker'S Decision Problem

Ask and You Shall be Served: Representing and Solving Multi-agent Optimization Problems with Service Requesters and Providers

A Multi-Armed Bandit Approach for Online Expert Selection in Markov Decision Processes

Approximate Dec-POMDP Solving Using Multi-Agent A*

Modelling bidding behavior of power suppliers in electricity markets using multi-agent systems, part 1: Model and algorithm

Hindsight is Only 50/50: Unsuitability of MDP based Approximate POMDP Solvers for Multi-resolution Information Gathering

A Mixture-of-Experts Prediction Framework for Evolutionary Dynamic Multiobjective Optimization

SMoA: Improving Multi-agent Large Language Models with Sparse Mixture-of-Agents

Use of Markov Chains to Design an Agent Bidding Strategy for Continuous Double Auctions