Abstract:In order to achieve the target of carbon peak and carbon neutrality, electric vehicles (EVs) have increasingly received a prominent interest to electrify the transportation sector due to their advantages of mobility and flexibility on handling complicated transportation and power networks. However, it is still challenging to realize the significant potential of EVs towards an emerging low-carbon transition. Previous works have focused on vehicle-to-grid (V2G) technology that allows for an increased utilization of EVs to make arbitrage by the temporal differentials of electricity prices. Nevertheless, the economic potential of EVs flexibility may not be fully exploited lacking an appropriate business model. This paper addresses this challenge by developing a coupled power-transportation network for cooperative EVs to optimize the provision of multiple inter-dependent services, including charging service, demand management service, carbon intensity service, and balancing service. In order to unlock this value, the EVs operation problem has already been tackled using model-based optimization approaches, which may raise privacy issues since the requirement for global information and also can be time consuming due to the high variability of transportation and power networks. In this paper, we propose a model-free hierarchical and hybrid multi-agent reinforcement learning method to learn the routing and scheduling decisions of EVs in a coupled power-transportation network with the objective of optimizing multi-service provisions. To this end, EVs do not reply on any knowledge of the simulated environment and are capable of handling system uncertainties via the learning process. Extensive case studies based on a 15-bus radial power distribution network and a 9-node 12-edge transportation network are developed to show that the proposed method outperforms the conventional learning algorithms in terms of policy quality and convergence speed. Finally, the generalizability and scalability are also investigated for different environment circumstances and EV numbers.

Multi-Agent Cooperation Based Reduced-Dimension Q(λ) Learning for Optimal Carbon-Energy Combined-Flow

Balancing Operation Cost and Carbon Emissions in Multi-Energy Microgrid Scheduling Using Joint Operator Evolutionary Algorithm

Multi-service Provision for Electric Vehicles in Power-Transportation Networks Towards a Low-Carbon Transition: A Hierarchical and Hybrid Multi-Agent Reinforcement Learning Approach

Multi-Agent Q-Value Mixing Network with Covariance Matrix Adaptation Strategy for the Voltage Regulation Problem

Optimal Scheduling of Integrated Energy Systems with Multiple CCHPs for High Efficiency and Low Emissions.

Deep reinforcement learning based power system optimal carbon emission flow

Boosting Communication Efficiency in Federated Learning for Multiagent-Based Multimicrogrid Energy Management

QoS Optimization for Mobile Ad Hoc Cloud: A Multi-Agent Independent Learning Approach

Carbon-Aware Optimal Power Flow

Multi-dimensional energy management based on an optimal power flow model using an improved quasi-reflection jellyfish optimization algorithm

Carbon emission flow oriented multitasking multi‐objective optimization of electricity‐hydrogen integrated energy system

Multi-agent energy management optimization for integrated energy systems under the energy and carbon co-trading market

CL-ADMM: A Cooperative-Learning-Based Optimization Framework for Resource Management in MEC

Multi-Objective Optimal Power Flow Calculation Considering Carbon Emission Intensity

A Multi-Agent Deep Constrained Q-Learning Method for Smart Building Energy Management Under Uncertainties

Cooperative operation of multiple low-carbon microgrids: An optimization study addressing gaming fraud and multiple uncertainties

Large-scale deep reinforcement learning method for energy management of power supply units considering regulation mileage payment

Multi-Objective Reinforcement Learning based Multi-Microgrid System Optimisation Problem

Joint optimization of multi-dimensional resource allocation and task offloading for QoE enhancement in Cloud-Edge-End collaboration

Multi-objective Cooperative QEA for Low-Carbon Time Dependent Vehicle Routing Problem with Simultaneous Delivery and Pickup

A New Multi-Objective Unit Commitment Model Solved by Decomposition-Coordination