Abstract:In urban computing, precise and swift forecasting of multivariate time series data from traffic networks is crucial. This data incorporates additional spatial contexts such as sensor placements and road network layouts, and exhibits complex temporal patterns that amplify challenges for predictive learning in traffic management, smart mobility demand, and urban planning. Consequently, there is an increasing need to forecast traffic flow across broader geographic regions and for higher temporal coverage. However, current research encounters limitations because of the inherent inefficiency of model and their unsuitability for large-scale traffic network applications due to model complexity. This paper proposes a novel framework, named PreMixer, designed to bridge this gap. It features a predictive model and a pre-training mechanism, both based on the principles of Multi-Layer Perceptrons (MLP). The PreMixer comprehensively consider temporal dependencies of traffic patterns in different time windows and processes the spatial dynamics as well. Additionally, we integrate spatio-temporal positional encoding to manage spatiotemporal heterogeneity without relying on predefined graphs. Furthermore, our innovative pre-training model uses a simple patch-wise MLP to conduct masked time series modeling, learning from long-term historical data segmented into patches to generate enriched contextual representations. This approach enhances the downstream forecasting model without incurring significant time consumption or computational resource demands owing to improved learning efficiency and data handling flexibility. Our framework achieves comparable state-of-the-art performance while maintaining high computational efficiency, as verified by extensive experiments on large-scale traffic datasets.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is: in a large - scale transportation network, how to achieve accurate and efficient multi - variable time - series data prediction. Specifically, the paper focuses on the traffic flow prediction problem in urban computing, especially how to handle complex time patterns and spatial dynamics under a wider geographical area and higher time coverage while ensuring the efficiency and scalability of the model. ### Problem Background 1. **Complex Time Patterns and Spatial Dynamics** - Traffic data contains not only complex patterns in the time dimension (such as periodicity and trends) but also dynamic changes in the spatial dimension (such as sensor locations and road network layouts). These factors make the traffic prediction task more challenging. - The non - stationarity and complexity in large - scale transportation systems lead to complex long - term patterns in spatio - temporal data, such as periodicity and trends. 2. **Limitations of Existing Models** - Existing traffic prediction models, especially those based on graph neural networks (STGNNs) and Transformer methods, encounter problems of efficiency and scalability when dealing with large - scale transportation networks. These models usually require a large amount of computing resources, and as the number of nodes and the time coverage increase, the model complexity also rises sharply. - Directly inputting long - term spatio - temporal data into these models will result in overly long training and inference times, and optimizing the model also becomes more difficult. 3. **Lack of Effective Utilization of Long - Term Features** - Although some pre - trained models can enhance the performance of downstream tasks, they usually rely on complex architectures (such as Transformer), which increases the demand for time and computing resources, especially when deployed on large - scale transportation networks. - Existing methods often overlook features within a long - time span, which limits the model's ability to learn long - term patterns, thus affecting the prediction performance. ### Solution To solve the above problems, the paper proposes a new framework named PreMixer. The main features of this framework include: 1. **MLP - Based Prediction Model** - PreMixer uses MLP - Mixer as the basic architecture and captures the time and spatial information of the input data through interleaved MLP layers. This architecture is simple and efficient and can handle large - scale traffic data. 2. **Spatio - Temporal Position Encoding (STPE)** - Spatio - temporal position encoding is introduced to encode time and spatial position information simultaneously without relying on a predefined graph structure. This helps the model obtain additional context information while significantly reducing the computational complexity. 3. **MLP - Based Pre - trained Model (PIEncoder)** - A simple MLP - based pre - trained model PIEncoder is designed to learn useful representations from long - term historical data. This model improves the learning efficiency and reduces the demand for computing resources by splitting the time - series data into multiple segments and independently embedding each segment. - PIEncoder adopts a masked auto - encoding strategy and generates context - rich representations by reconstructing the masked segments, enhancing the ability of the downstream prediction model. 4. **Contrastive Learning** - Complementary Contrastive Learning (CL) is utilized to further enhance the segment - level time - series representations. By generating positive sample pairs of different views, CL can effectively capture time - dependencies and dynamic changes, improving the generalization and discrimination ability of the model. ### Summary By introducing the PreMixer framework, the paper aims to solve the efficiency and scalability problems in large - scale traffic prediction, while fully utilizing long - term spatio - temporal features to improve the prediction accuracy. The experimental results show that PreMixer achieves performance comparable to or even better than existing advanced methods on multiple large - scale traffic datasets while maintaining high computational efficiency.

PreMixer: MLP-Based Pre-training Enhanced MLP-Mixers for Large-scale Traffic Forecasting

Spatial-Temporal Graph Multi-Gate Mixture-of-Expert Model for Traffic Prediction

MLPST: MLP is All You Need for Spatio-Temporal Prediction

PreSTNet: Pre-trained Spatio-Temporal Network for traffic forecasting

Spatio-temporal Hierarchical MLP Network for Traffic Forecasting

Fusion Matrix Prompt Enhanced Self-Attention Spatial-Temporal Interactive Traffic Forecasting Framework

Traffic Flow and Speed Forecasting Through a Bayesian Deep Multi-Linear Relationship Network.

Contextualizing MLP-Mixers Spatiotemporally for Urban Data Forecast at Scale

ModWaveMLP: MLP-Based Mode Decomposition and Wavelet Denoising Model to Defeat Complex Structures in Traffic Forecasting

TSMixer: An all-MLP Architecture for Time Series Forecasting

ODMixer: Fine-grained Spatial-temporal MLP for Metro Origin-Destination Prediction

TimeMixer: Decomposable Multiscale Mixing for Time Series Forecasting

ProSTformer: Pre-trained Progressive Space-Time Self-attention Model for Traffic Flow Forecasting

An Effective Dynamic Spatio-temporal Framework with Multi-Source Information for Traffic Prediction

RPMixer: Shaking Up Time Series Forecasting with Random Projections for Large Spatial-Temporal Data

Multi-scale feature enhanced spatio-temporal learning for traffic flow forecasting

A Hybrid Deep Learning Model for Short-Term Traffic Flow Pre-Diction Considering Spatiotemporal Features

A Bidirectional Context-Aware and Multi-Scale Fusion Hybrid Network for Short-Term Traffic Flow Prediction

A Multifeature Fusion Short-Term Traffic Flow Prediction Model Based on Deep Learnings

MmgFra: A multiscale multigraph learning framework for traffic prediction in smart cities

Phased Deep Spatio-temporal Learning for Highway Traffic Volume Prediction