Distributed Optimization of Regional Traffic Signals via Deep Reinforcement Learning

Tongchao Cui,Xudong Liu,Liguo Zhang
DOI: https://doi.org/10.23919/ccc52363.2021.9550100
2021-01-01
Abstract:With the continuous increase of car ownership, how to improve the traffic efficiency of road network has become a hot topic. This study proposes a signal control scheme of regional intersections based on multi-agent proximal policy optimization (MA-PPO) algorithm. The scheme has the idea of distributed control and centralized optimization, which can dynamically adjust the output phase and duration according to the traffic flow of each intersection in the region, and finally reduce the total waiting time of vehicles in the region. Specifically, firstly, neural network is used to extract intersection state information, which can effectively reduce data dimension and redundancy. Then reinforcement learning is used to improve the decision-making performance of the control system. Finally, we use the traffic simulation platform SUMO for experimental verification. The results show that, compared with the traditional timing control and DQN-based control method, this scheme has effectiveness and stability in different traffic flow modes.
What problem does this paper attempt to address?