Abstract:Due to accelerating urbanization, the importance of solving the signal control problem increases. This paper analyzes various existing methods and suggests options for increasing the number of agents to reduce the average travel time. Experiments were carried out with 2 datasets. The results show that in some cases, the implementation of multiple agents can improve existing methods. For a fine-tuned large language model approach there is small enhancement on all metrics.
What problem does this paper attempt to address?
The problem that this paper attempts to solve is that traffic signal control (TSC) is becoming increasingly important in the context of the accelerating urbanization process. Specifically, the author analyzed various existing methods and proposed a method to increase the number of agents to reduce the average travel time. The paper experimentally verified that in some cases, the implementation of a multi - agent system can improve the existing traffic signal control methods.
### Background and Objectives of the Paper
With the rapid growth of the urban population, traffic congestion problems are becoming increasingly serious, leading to huge economic losses and environmental problems. For example, in 2014, the United States lost more than $160 billion in productivity due to traffic congestion and wasted more than 3.1 billion gallons of fuel. The cost of traffic congestion in the European Union is equivalent to 1% of the entire GDP. Therefore, efficient traffic signal control (TSC) can not only bring significant economic and environmental benefits but also improve social well - being.
### Existing Methods
Currently, there are mainly three types of methods to solve the traffic signal control problem:
1. **Traditional traffic methods**: Dynamically adjust traffic signal configurations based on real - time lane - level traffic conditions, but require a large amount of manual design and effort.
2. **Reinforcement learning (RL) methods**: Use deep neural networks (DNNs) for training. Although they perform well in various traffic scenarios, they face challenges when extended to larger - scale road networks or rare high - traffic situations.
3. **Transformer - based methods**: Use transformer models to capture spatio - temporal correlations and guide the collaboration of traffic strategies.
### Contributions of the Paper
The main contribution of this paper is to create a multi - agent system in which agents can adopt different methods, including those based on reinforcement learning (RL) and large - language models (LLMs). Specifically, the author assumes that combining RL or LLM methods can improve traffic signal control by taking advantage of the two methods. Through the majority - voting mechanism, the author aims to aggregate the decisions of different agents to achieve a more robust and effective traffic signal control system.
### Experimental Setup
The experiments were carried out on two datasets: the Jinan dataset (12 intersections) and the Hangzhou dataset (16 intersections). The experiments evaluated the following methods:
- Fixed - Time Algorithm
- Max - Pressure Algorithm
- MPLight Algorithm (based on RL)
- LLMLight Algorithm (based on LLM)
### Experimental Results
The experimental results show that for the Jinan dataset, the multi - agent system performs well in the MPLight algorithm, but has a poorer performance in the Hangzhou dataset. This is mainly because the number of intersections in the Hangzhou dataset is larger, which increases the complexity of the learning task. For the LLMLight algorithm, the method based on Llama 13b does not show significant improvement, while the method based on LightGPT shows a stable performance improvement when the number of agents is increased.
### Conclusions
In general, the multi - agent system performs well when dealing with small - scale intersection datasets, but for large - scale datasets, the performance may decline due to increased complexity. For areas with sufficient computing resources, using a multi - agent system and a fine - tuned LightGPT model can achieve the best results. If computing resources are limited, the Max - Pressure Algorithm is also a good choice.
### Future Work
The author plans to provide more experiments on different RL and LLM algorithms in future work and try to use different types of agents for comparison. In addition, it will also explore how to adjust the algorithms according to the specific needs of each city, improve data quality, and utilize advanced distributed learning methods.