Abstract:Traffic prediction problem is one of the essential tasks of intelligent transportation system (ITS), alleviating traffic congestion effectively and promoting the intelligent development of urban traffic. To accommodate long-range dependencies, Transformer-based methods have been used in traffic prediction tasks due to the parallelizable processing of sequences and explanation of attention matrices compared with recurrent neural units (RNNs). However, the Transformer-based model has two limitations, on the one hand, it ignores the local correlation in the traffic state in its parallel processing of the sequence, on the other hand, the absolute positional embedding is adopted to represent the positional relationship of time nodes is destroyed when it comes to calculate attention score. To address two embarrassing shortcomings, a novel framework called RPConvformer is proposed, where the improved parts are 1D causal convolutional sequence embedding and relative position encoding. In sequence embedding, we develop a sequence embedding layer composed of convolutional units, which consist of origin 1D convolutional and 1D causal convolutional. The size of the receptive field of the convolution can focus on the local region correlation of the sequence. In relative position encoding, we introduce a bias vector to automatically learn the relative position information of time nodes when linearly mapping the feature tensor. We respect the encoding and decoding framework of the Transformer, the encoder is responsible for extracting historical traffic state information, and the decoder autoregressively predicts the future traffic state. The multi-head attention mechanism is adopted by both encoder and decoder aims to focus on rich temporal feature patterns. Moreover, key mask technique is used after computing attention matrix to mask the traffic state at missing moments improving the resilience of the model. Extensive experiments on two real-world traffic flow datasets. The results show that RPConvformer achieves the best performance compared to state-of-the-art time series models. Ablation experiments show that considering the local correlation of time series has a higher gain on prediction performance. Random mask experiments show that the model is robust when the historical data is less than 10% missing. In addition, multi-head attention matrix provides further explanation for the dependence between time nodes. RPConvformer as an improved Transformer-based model can provide new ideas for molding temporal dimension in traffic prediction tasks. Our code has been open-sourced at (https://github.com/YanJieWen/RPConvformer).

A multi-head attention-based transformer model for traffic flow forecasting with a comparative analysis to recurrent neural networks

Short-Term Speed Forecasting of Large-Scale Urban Road Network Based on Transformer

Learning dynamic and hierarchical traffic spatiotemporal features with Transformer

Spatial–Temporal Transformer Networks for Traffic Flow Forecasting Using a Pre-Trained Language Model

Adaptive Graph Spatial-Temporal Transformer Network for Traffic Flow Forecasting

Traffic Transformer: Transformer-based framework for temporal traffic accident prediction

A spatial‐temporal graph gated transformer for traffic forecasting

A Multi-Layer Model Based on Transformer and Deep Learning for Traffic Flow Prediction

A Multi-Channel Spatial-Temporal Transformer Model for Traffic Flow Forecasting

Spatio-Temporal Parallel Transformer based model for Traffic Prediction

Traffic Flow Forecasting Based on Transformer with Diffusion Graph Attention Network

Attention-based spatial-temporal graph transformer for traffic flow forecasting

EGFormer: An Enhanced Transformer Model with Efficient Attention Mechanism for Traffic Flow Forecasting

Traffic transformer: Capturing the continuity and periodicity of time series for traffic forecasting

Traffic flow prediction based on transformer

Transformer network with decoupled spatial–temporal embedding for traffic flow forecasting

RPConvformer: A novel Transformer-based deep neural networks for traffic flow prediction

STGformer: Efficient Spatiotemporal Graph Transformer for Traffic Forecasting

CCDSReFormer: Traffic Flow Prediction with a Criss-Crossed Dual-Stream Enhanced Rectified Transformer Model

Short-term passenger flow prediction for multi-traffic modes: A residual network and Transformer based multi-task learning method

ADCT-Net: Adaptive traffic forecasting neural network via dual-graphic cross-fused transformer