Abstract:Traffic signal control (TSC) plays a crucial role in enhancing traffic capacity. In recent years, researchers have demonstrated improved performance by utilizing deep reinforcement learning (DRL) for optimizing TSC. However, existing DRL frameworks predominantly rely on manually crafted states, actions, and reward designs, which limit direct information exchange between the DRL agent and the environment. To overcome this challenge, we propose a novel design method that maintains consistency among states, actions, and rewards, named uniformity state-action-reward (USAR) method for TSC. The USAR method relies on: 1) Updating the action selection for the next time step using a formula based on the state perceived by the agent at the current time step, thereby encouraging rapid convergence to the optimal strategy from state perception to action; and 2) integrating the state representation with the reward function design, allowing for precise assessment of the efficacy of past action strategies based on the received feedback rewards. The consistency-preserving design method jointly optimizes the TSC strategy through the updates and feedback among the Markov elements. Furthermore, the method proposed in this paper employs a residual block into the DRL model. It introduces an additional pathway between the input and output layers to transfer feature information, thus promoting the flow of information across different network layers. To assess the effectiveness of our approach, we conducted a series of simulation experiments using the simulation of urban mobility. The USAR method, incorporating a residual block, outperformed other methods and exhibited the best performance in several evaluation metrics.

An Attention Reinforcement Learning–Based Strategy for Large-Scale Adaptive Traffic Signal Control System

Uniformity of Markov Elements in Deep Reinforcement Learning for Traffic Signal Control

A Deep Reinforcement Learning Approach to Traffic Signal Control With Temporal Traffic Pattern Mining

Network Clustering-Based Multi-Agent Reinforcement Learning for Large-Scale Traffic Signal Control

Multi-Agent Deep Reinforcement Learning for Large-scale Traffic Signal Control

A multi‐agent deep reinforcement learning approach for traffic signal coordination

A Deep Reinforcement Learning Approach for Traffic Signal Control Optimization

A multi-agent reinforcement learning method with curriculum transfer for large-scale dynamic traffic signal control

Learning in practice: reinforcement learning-based traffic signal control augmented with actuated control

A Reinforcement Learning Approach for Intelligent Traffic Signal Control at Urban Intersections

Adaptive urban traffic signal control based on enhanced deep reinforcement learning

Multiobjective Reinforcement Learning for Traffic Signal Control Using Vehicular Ad Hoc Network

Multi-objective deep reinforcement learning approach for adaptive traffic signal control system with concurrent optimization of safety, efficiency, and decarbonization at intersections

A multi-agent reinforcement learning based approach for intelligent traffic signal control

Cooperative traffic signal control through a counterfactual multi-agent deep actor critic approach

A Novel Multi-Agent Deep RL Approach for Traffic Signal Control

Distributed agent-based deep reinforcement learning for large scale traffic signal control

Batch-Augmented Multi-Agent Reinforcement Learning for Efficient Traffic Signal Optimization

An Efficient Deep Reinforcement Learning Model for Urban Traffic Control

Deep Reinforcement Learning for Adaptive Traffic Signal Control