PD-LL-Transformer: An Hourly PM2.5 Forecasting Method over the Yangtze River Delta Urban Agglomeration, China

Rongkun Zou,Heyun Huang,Xiaoman Lu,Fanmei Zeng,Chu Ren,Weiqing Wang,Liguo Zhou,Xiaoyan Dai
DOI: https://doi.org/10.3390/rs16111915
IF: 5
2024-05-27
Remote Sensing
Abstract:As the urgency of PM2.5 prediction becomes increasingly ingrained in public awareness, deep-learning methods have been widely used in forecasting concentration trends of PM2.5 and other atmospheric pollutants. Traditional time-series forecasting models, like long short-term memory (LSTM) and temporal convolutional network (TCN), were found to be efficient in atmospheric pollutant estimation, but either the model accuracy was not high enough or the models encountered certain challenges due to their own structure or some specific application scenarios. This study proposed a high-accuracy, hourly PM2.5 forecasting model, poly-dimensional local-LSTM Transformer, namely PD-LL-Transformer, by deep-learning methods, based on air pollutant data and meteorological data, and aerosol optical depth (AOD) data retrieved from the Himawari-8 satellite. This research was based on the Yangtze River Delta Urban Agglomeration (YRDUA), China for 2020–2022. The PD-LL-Transformer had three parts: a poly-dimensional embedding layer, which integrated the advantages of allocating and embedding multi-variate features in a more refined manner and combined the superiority of different temporal processing methods; a local-LSTM block, which combined the advantages of LSTM and TCN; and a Transformer encoder block. Over the test set (the whole year of 2022), the model's R2 was 0.8929, mean absolute error (MAE) was 4.4523 μg/m3, and root mean squared error (RMSE) was 7.2683 μg/m3, showing great accuracy for PM2.5 prediction. The model surpassed other existing models upon the same tasks and similar datasets, with the help of which a PM2.5 forecasting tool with better performance and applicability could be established.
environmental sciences,imaging science & photographic technology,remote sensing,geosciences, multidisciplinary
What problem does this paper attempt to address?
The paper proposes a high-accuracy hourly fine particulate matter (PM 2.5) prediction method called PD-LL-Transformer, mainly targeting the Yangtze River Delta urban agglomeration in China. The research background is the potential hazards of PM 2.5 to public health, such as causing respiratory problems and affecting atmospheric chemical reactions, hence the need for accurate prediction of its concentration. Traditional time series forecasting models such as LSTM and TCN have limited effectiveness in certain cases. In the paper, the researchers developed the PD-LL-Transformer model utilizing data on gaseous pollutants, meteorology, and aerosol optical depth (AOD) from the Himawari-8 satellite. The model consists of three parts: multidimensional embedding layer, local LSTM blocks, and Transformer encoding blocks, aimed at integrating the advantages of different time processing methods. On the test set in the Yangtze River Delta region from 2020 to 2022, the model achieved an R² of 0.8929, MAE of 4.4523 µg/m³, and RMSE of 7.2683 µg/m³, demonstrating high prediction accuracy and outperforming similar models. The research indicates that while existing models such as RNN and Transformer have limitations in time series forecasting, PD-LL-Transformer improves prediction accuracy and applicability by integrating multiple features and optimizing the structure. Additionally, the paper discusses details of data collection, processing, and model training, including preprocessing of air quality data, satellite AOD data, and meteorological data. Finally, the paper emphasizes the necessity of establishing more efficient and practical PM 2.5 prediction tools.