Transfer Learning with Graph Neural Networks for Short-Term Highway Traffic Forecasting

Tanwi Mallick,Prasanna Balaprakash,Eric Rask,Jane Macfarlane
DOI: https://doi.org/10.48550/arXiv.2004.08038
2020-04-20
Abstract:Highway traffic modeling and forecasting approaches are critical for intelligent transportation systems. Recently, deep-learning-based traffic forecasting methods have emerged as state of the art for a wide range of traffic forecasting tasks. However, these methods require a large amount of training data, which needs to be collected over a significant period of time. This can present a number of challenges for the development and deployment of data-driven learning methods for highway networks that suffer from lack of historical data. A promising approach to address this issue is transfer learning, where a model trained on one part of the highway network can be adapted for a different part of the highway network. We focus on diffusion convolutional recurrent neural network (DCRNN), a state-of-the-art graph neural network for highway network forecasting. It models the complex spatial and temporal dynamics of the highway network using a graph-based diffusion convolution operation within a recurrent neural network. DCRNN cannot perform transfer learning, however, because it learns location-specific traffic patterns, which cannot be used for unseen regions of the network. To that end, we develop a new transfer learning approach for DCRNN, where a single model trained on data-rich regions of the highway network can be used to forecast traffic on unseen regions of the highway network. We evaluate the ability of our approach to forecast the traffic on the entire California highway network with one year of time series data. We show that TL-DCRNN can learn from several regions of the California highway network and forecast the traffic on the unseen regions of the network with high accuracy. Moreover, we demonstrate that TL-DCRNN can learn from San Francisco region traffic data and can forecast traffic on the Los Angeles region and vice versa.
Machine Learning
What problem does this paper attempt to address?
This paper attempts to solve the problem of data scarcity encountered in highway traffic flow prediction, especially in new or data - insufficient areas. Specifically, deep - learning methods perform well in traffic prediction tasks, but they require a large amount of historical data for training, and such data are often difficult to obtain or accumulate. To overcome this challenge, the author proposes a transfer - learning method based on Graph Neural Networks (GNNs) to achieve cross - regional traffic flow prediction. ### Main problems 1. **Data scarcity problem**: Many regions lack sufficient historical traffic data, which makes it difficult for traditional deep - learning models to be directly applied. 2. **Limitations of existing models**: The existing Diffusion Convolutional Recurrent Neural Network (DCRNN) model, although performing well in traffic prediction, learns spatio - temporal patterns at specific locations and cannot be directly applied to unseen regions. ### Solutions To solve the above problems, the author has developed a new transfer - learning method - TL - DCRNN (Transfer Learning DCRNN), with the following main features: 1. **Graph partitioning and sub - graph training**: Divide the entire highway network into multiple sub - graphs and train a single encoder - decoder model on these sub - graphs. In this way, the model can learn more general traffic dynamic patterns rather than patterns specific to a certain location. 2. **Transfer - learning ability**: Train the model in data - rich regions and then apply it to new data - scarce regions, thereby improving prediction accuracy. 3. **Experimental verification**: Extensive experiments were carried out on the real - world data set of the California highway network to verify the effectiveness of TL - DCRNN. The results show that TL - DCRNN can perform effective transfer - learning between different regions and significantly improve the prediction accuracy in unseen regions. ### Mathematical formula representation In TL - DCRNN, the diffusion convolution operation is defined as: \[ W \star_G X = \sum_{d = 0}^{K - 1} \left( W_O (D_O^{-1} A)^d+W_I (D_I^{-1} A)^d \right) X \] where: - \( K \) is the maximum number of diffusion steps, - \( D_O^{-1} \) and \( D_I^{-1} \) are the inverse matrices of the diagonal matrices of out - degree and in - degree respectively, - \( A \) is the weighted adjacency matrix, - \( W_O \) and \( W_I \) are the learning filters for the bidirectional diffusion process. By changing the graph structure \( A \) in the training process and its corresponding time - series data, the diffusion convolution filter becomes more general, thereby improving the generalization ability of the model. ### Summary The main contribution of this paper is to propose a transfer - learning method based on graph neural networks, which solves the problem of data scarcity in traffic prediction and enables the model to perform accurate short - term traffic prediction in new data - insufficient regions.