Abstract:Most of the existing traffic signal control strategies are hard to satisfy the real-time requirements of traffic big data analysis, knowledge reasoning and decision making for sophisticated traffic dynamics and heterogeneous intersection structures in the context of Internet of Vehicles (IoV). In this paper, we attempt to propose a cooperative multi-agent actor–critic (CMAC) deep reinforcement learning (DRL) approach with value decomposition based on edge computing architecture. The intuition behind CMAC is to decompose the global actor–critic learning tasks into several local actor–critic sub-problems with respect to each intersection. Each agent searches the local optimal decision by actor–critic network that takes the discrete state encoding about several consecutive frames of image-like traffic states as the inputs of the network. Among them, the green ratio output strategy considering multiple constraints is formulated in the output layer of the actor network, so that the continuous control of traffic signals using multi-agent DRL (MADRL) can be realized. Furthermore, a cooperative mechanism that considers contribution weight distributions of local agents to the global traffic pattern is proposed to coordinate multiple local agents to evolve toward global optimization. Especially, some parallel training tasks of CMAC with a large number of computing loads are deployed on the cloud side in the edge computing architecture to accelerate learning and reconstructing knowledge. The well-trained multi-agent model is downloaded from the cloud side into the edge side for real-time decision making of traffic network flow adaptive control. Simulation results with regard to a realistic traffic network demonstrate that the proposed CMAC approach under edge computing architecture outperforms the value-decomposition based multi-agent actor–critic (VMAC), independent multi-agent actor–critic (IMAC), and the fixed timing control (FTC) in terms of alleviating traffic congestion.

Cooperative traffic signal control through a counterfactual multi-agent deep actor critic approach

Adaptive signal control for urban traffic network gridlock

A Deep Reinforcement Learning Approach to Traffic Signal Control With Temporal Traffic Pattern Mining

Distributed Signal Control of Arterial Corridors Using Multi-Agent Deep Reinforcement Learning

A multi‐agent deep reinforcement learning approach for traffic signal coordination

Cooperative Traffic Signal Control Using a Distributed Agent-Based Deep Reinforcement Learning With Incentive Communication

Cooperative Multi-Agent Actor–critic Control of Traffic Network Flow Based on Edge Computing

Cooperative Reinforcement Learning on Traffic Signal Control

Network Clustering-Based Multi-Agent Reinforcement Learning for Large-Scale Traffic Signal Control

Two-layer Coordinated Reinforcement Learning for Traffic Signal Control in Traffic Network

Traffic Signal Timing via Parallel Reinforcement Learning

A multi-agent reinforcement learning based approach for intelligent traffic signal control

Urban Traffic Light Control Via Active Multi-Agent Communication and Supply-Demand Modeling

Joint Optimization of Traffic Signal Control and Vehicle Routing in Signalized Road Networks using Multi-Agent Deep Reinforcement Learning

Multiobjective Reinforcement Learning for Traffic Signal Control Using Vehicular Ad Hoc Network

A Novel Multi-Agent Deep RL Approach for Traffic Signal Control

Towards Multi-agent Reinforcement Learning based Traffic Signal Control through Spatio-temporal Hypergraphs

Mean Field Multi-Agent Reinforcement Learning Method for Area Traffic Signal Control

A Multi-Agent Coordinate Model for Urban Traffic Signal Control

Multi-agent Deep Reinforcement Learning collaborative Traffic Signal Control method considering intersection heterogeneity

A Cooperative Multiagent System for Traffic Signal Control Using Game Theory and Reinforcement Learning