Neural-Attention-Based Deep Learning Architectures for Modeling Traffic Dynamics on Lane Graphs

Matthew A. Wright,Simon F. G. Ehlers,Roberto Horowitz
DOI: https://doi.org/10.1109/ITSC.2019.8917174
2019-07-15
Abstract:Deep neural networks can be powerful tools, but require careful application-specific design to ensure that the most informative relationships in the data are learnable. In this paper, we apply deep neural networks to the nonlinear spatiotemporal physics problem of vehicle traffic dynamics. We consider problems of estimating macroscopic quantities (e.g., the queue at an intersection) at a lane level. First-principles modeling at the lane scale has been a challenge due to complexities in modeling social behaviors like lane changes, and those behaviors' resultant macro-scale effects. Following domain knowledge that upstream/downstream lanes and neighboring lanes affect each others' traffic flows in distinct ways, we apply a form of neural attention that allows the neural network layers to aggregate information from different lanes in different manners. Using a microscopic traffic simulator as a testbed, we obtain results showing that an attentional neural network model can use information from nearby lanes to improve predictions, and, that explicitly encoding the lane-to-lane relationship types significantly improves performance. We also demonstrate the transfer of our learned neural network to a more complex road network, discuss how its performance degradation may be attributable to new traffic behaviors induced by increased topological complexity, and motivate learning dynamics models from many road network topologies.
Machine Learning,Systems and Control
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to use deep neural networks (DNN) to model the nonlinear spatio - temporal dynamics of traffic flow on lane graphs. Specifically, the researchers focus on how to estimate macroscopic traffic volumes (such as the queue length at intersections) at the lane level. Traditional macroscopic traffic models based on links have difficulty dealing with complex driving behaviors (such as lane - changing), which can have a significant impact on macroscopic traffic flow. Therefore, this paper proposes a deep - learning architecture based on the neural attention mechanism to better capture the relationships between different lanes and improve prediction accuracy. ### Main Problems and Challenges 1. **Complex Social Behaviors**: Traditional lane - scale modeling is challenged because it is difficult to accurately simulate social behaviors such as lane - changing and their impact on macroscopic traffic flow. 2. **Relationships between Lanes**: The traffic flows of upstream, downstream, and adjacent lanes can influence each other in different ways, and a method that can distinguish these relationships is required. 3. **Data - Driven Models**: Due to the difficulty of modeling from first principles, researchers turn to data - driven methods to learn traffic dynamics models. ### Solutions To address the above challenges, the researchers propose the following methods: - **Neural Attention Mechanism**: By introducing multi - edge - type attentional neural network layers, the neural network can aggregate information according to different types of lane relationships (such as upstream, downstream, and adjacent lanes). - **Microscopic Traffic Simulator**: Use SUMO (Simulation of Urban MObility) as a test platform to verify the effectiveness of the proposed neural network model. - **Transfer Learning**: Demonstrate that the learned neural network model can be transferred between road networks with different levels of complexity, providing the possibility for cross - regional intelligent transportation systems. ### Experimental Results The experimental results show that the neural network based on attention modeling can significantly outperform traditional PDE models in lane - level traffic prediction. In particular, when considering the information of adjacent lanes, the performance of the model is significantly improved. In addition, the study also explores the generalization ability of the model in different road network topologies. ### Summary The main contributions of this paper are: - Proposing a deep - learning framework that can effectively handle the complex relationships between lanes. - Proving through experiments the superior performance of this framework in traffic flow prediction. - Exploring the transfer ability of the model in different road networks, providing directions for further research. Hope this summary can help you understand the core problems and solutions of this paper. If you have more questions or need further explanations, please feel free to let me know!