Abstract:Multi-building multi-energy virtual power plants (MB-ME-VPPs) show great promise for the aggregation and coordination of distributed flexible resources across multiple integrated energy buildings to participate in electricity markets. However, a significant challenge arises when managing the energy of MB-ME-VPPs, especially since buildings can dynamically join or depart during the aggregation phase. Traditional model- based optimization methods face difficulties in obtaining accurate mathematical models of individual buildings, and may also raise privacy concerns. In contrast, model-free multi-agent reinforcement learning (MARL) methods offer a promising alternative by allowing agents to learn their control policies through interactions with their environments. Nevertheless, conventional MARL methods are normally applied in static multi- agent environments, where the number and identity of agents remain fixed and predetermined. Consequently, these conventional MARL methods lack the ability to adapt to the dynamic behaviors of agents joining or leaving the environment. To this end, this paper proposes a novel approach named MAT-Adapt, embedding the multi-agent transformer with a parallel adapter module, to address the dynamic participation issue in MB-ME-VPP energy management. Firstly, it formulates the coordination of building agents as a sequential modeling process and leverages the representational capabilities of the attention mechanism from the multi- agent transformer technique. Secondly, it introduces a parallel adapter module called AdaptMLP to enhance adaptability during the dynamic participation phase, efficiently reducing the need for extensive fine-tuning of model parameters. Simulations on the IEEE 33-bus distributional electricity market with 3 to 9 multi- energy buildings show the superior performance of our proposed MAT-Adapt method in facilitating efficient coordination of dynamically participating buildings within the context of the MB-ME-VPP. In comparison to the conventional MADDPG and MAPPO methods training from scratch, the proposed MAT-Adapt method demonstrates its superior adaptability, achieving 0.75-0.91 normalized rewards in new state conditions within 5% of training episodes, while MADDPG and MAPPO can only reach 0.11-0.43 within the same timeframe. Furthermore, the proposed MAT-Adapt method exhibits its strong generalization performance by evaluating the dynamic participation of various building types and regions.

Multi-agent Battery Storage Management using MPC-based Reinforcement Learning

Secure Energy Management of Multi-Energy Microgrid: A Physical-Informed Safe Reinforcement Learning Approach

Adaptive Multi-Agent Reinforcement Learning for Flexible Resource Management in a Virtual Power Plant with Dynamic Participating Multi-Energy Buildings

Optimal Management of the Peak Power Penalty for Smart Grids Using MPC-based Reinforcement Learning

Should we use model-free or model-based control? A case study of battery management systems

Stochastic model predictive control for energy management of power-split plug-in hybrid electric vehicles based on reinforcement learning

Adaptively Constrained Stochastic Model Predictive Control for the Optimal Dispatch of Microgrid

Stochastic Model Predictive Control Based Scheduling Optimization of Multi-Energy System Considering Hybrid CHPs and EVs

Learning Model Predictive Control Parameters via Bayesian Optimization for Battery Fast Charging

Learning-Based Model Predictive Control of DC-DC Buck Converters in DC Microgrids: A Multi-Agent Deep Reinforcement Learning Approach

Multiagent-Based Reinforcement Learning for Optimal Reactive Power Dispatch.

Safe Learning-Based Optimization of Model Predictive Control: Application to Battery Fast-Charging

Multi-Agent Deep Reinforcement Learning for Voltage Control with Coordinated Active and Reactive Power Optimization

Multi-Stage Real-Time Operation of a Multi-Energy Microgrid With Electrical and Thermal Energy Storage Assets: A Data-Driven MPC-ADP Approach

Multi-agent deep reinforcement learning-based optimal energy management for grid-connected multiple energy carrier microgrids

Multi-agent hierarchical reinforcement learning for energy management

Physics-Shielded Multi-Agent Deep Reinforcement Learning for Safe Active Voltage Control with Photovoltaic/Battery Energy Storage Systems

MPC-driven optimal scheduling of grid-connected microgrid: Cost and degradation minimization with PEVs integration

Large-scale deep reinforcement learning method for energy management of power supply units considering regulation mileage payment

Machine Learning-Based Online MPC for Large-Scale Charging Infrastructure Management

Multi-Agent Reinforcement Learning for Power System Operation and Control