Traffic Flow Prediction Based on Hybrid Deep Learning Models Considering Missing Data and Multiple Factors

Wenbao Zeng,Ketong Wang,Jianghua Zhou,Rongjun Cheng
DOI: https://doi.org/10.3390/su151411092
IF: 3.9
2023-07-17
Sustainability
Abstract:In the case of missing data, traffic forecasting becomes challenging. Many existing studies on traffic flow forecasting with missing data often overlook the relationship between data imputation and external factors. To address this gap, this study proposes two hybrid models that incorporate multiple factors for predicting traffic flow in scenarios involving data loss. Temperature, rainfall intensity and whether it is a weekday will be introduced as multiple factors for data imputation and forecasting. Predictive mean matching (PMM) and K-nearest neighbor (KNN) can find the data that are most similar to the missing values as the interpolation value. In the forecasting module, bidirectional long short-term memory (BiLSTM) network can extract bidirectional time series features, which can improve forecasting accuracy. Therefore, PMM and KNN were combined with BiLSTM as P-BiLSTM and K-BiLSTM to forecast traffic flow, respectively. Experiments were conducted using a traffic flow dataset from the expressway S6 in Poland, considering various missing scenarios and missing rates. The experimental results showed that the proposed models outperform other traditional models in terms of prediction accuracy. Furthermore, the consideration of whether it is a working day further improves the predictive performance of the models.
environmental sciences,environmental studies,green & sustainable science & technology
What problem does this paper attempt to address?