Deep reinforcement learning based cooperative control of traffic signal for multi‐intersection network in intelligent transportation system using edge computing

Ananya Paul,Sulata Mitra
DOI: https://doi.org/10.1002/ett.4588
IF: 3.6
2022-07-20
Transactions on Emerging Telecommunications Technologies
Abstract:The primary purpose is to manage multiple traffic signals in a road network in order to mitigate traffic congestion. To accomplish this, two deep reinforcement learning‐based approaches, advantage actor‐critic and proximal policy optimization, are incorporated with edge computing and intelligent transportation system. Moreover, the proposed agent explores various deep neural networks, including the state‐of‐the‐art transformer network. In the current era, the coordination of traffic flow is hindered by the discrepancy between road infrastructure and the number of vehicles which leads to traffic congestion. One of the widely used strategies to mitigate traffic congestion is to control traffic signals with the help of deep reinforcement learning (DRL) in edge computing based intelligent transportation system. This article provides a comprehensive analysis of the most recent DRL algorithms, advantage actor‐critic and proximal policy optimization in multiple deep neural networks (DNNs), including a state‐of‐the‐art transformer model for effective traffic signal management. Here, a single DRL agent is used, which obtains the spatio‐temporal information of the traffic to identify traffic patterns from complex intersection environments. The agent uses this information as the input to the DNNs and then applies the algorithms to retrieve the essential parameters of DNN to seek an optimal action selection policy to mitigate congestion. Different real‐time maps and small city networks are explored here to determine which DNN is best suited for traffic congestion management. The simulation study reveals that both the algorithms significantly outperform the baseline. The transformer model gives the best result when compared to other DNNs. The transformer model decreases average waiting time by 96.16%, implying that it has a higher capability of dealing with congested environments.
telecommunications
What problem does this paper attempt to address?