PD-LL-Transformer: An Hourly PM2.5 Forecasting Method over the Yangtze River Delta Urban Agglomeration, China

Rongkun Zou,Heyun Huang,Xiaoman Lu,Fanmei Zeng,Chu Ren,Weiqing Wang,Liguo Zhou,Xiaoyan Dai

DOI: https://doi.org/10.3390/rs16111915

IF: 5

2024-05-27

Remote Sensing

Abstract:As the urgency of PM2.5 prediction becomes increasingly ingrained in public awareness, deep-learning methods have been widely used in forecasting concentration trends of PM2.5 and other atmospheric pollutants. Traditional time-series forecasting models, like long short-term memory (LSTM) and temporal convolutional network (TCN), were found to be efficient in atmospheric pollutant estimation, but either the model accuracy was not high enough or the models encountered certain challenges due to their own structure or some specific application scenarios. This study proposed a high-accuracy, hourly PM2.5 forecasting model, poly-dimensional local-LSTM Transformer, namely PD-LL-Transformer, by deep-learning methods, based on air pollutant data and meteorological data, and aerosol optical depth (AOD) data retrieved from the Himawari-8 satellite. This research was based on the Yangtze River Delta Urban Agglomeration (YRDUA), China for 2020–2022. The PD-LL-Transformer had three parts: a poly-dimensional embedding layer, which integrated the advantages of allocating and embedding multi-variate features in a more refined manner and combined the superiority of different temporal processing methods; a local-LSTM block, which combined the advantages of LSTM and TCN; and a Transformer encoder block. Over the test set (the whole year of 2022), the model's R2 was 0.8929, mean absolute error (MAE) was 4.4523 μg/m3, and root mean squared error (RMSE) was 7.2683 μg/m3, showing great accuracy for PM2.5 prediction. The model surpassed other existing models upon the same tasks and similar datasets, with the help of which a PM2.5 forecasting tool with better performance and applicability could be established.

environmental sciences,imaging science & photographic technology,remote sensing,geosciences, multidisciplinary

What problem does this paper attempt to address?

The paper proposes a high-accuracy hourly fine particulate matter (PM 2.5) prediction method called PD-LL-Transformer, mainly targeting the Yangtze River Delta urban agglomeration in China. The research background is the potential hazards of PM 2.5 to public health, such as causing respiratory problems and affecting atmospheric chemical reactions, hence the need for accurate prediction of its concentration. Traditional time series forecasting models such as LSTM and TCN have limited effectiveness in certain cases. In the paper, the researchers developed the PD-LL-Transformer model utilizing data on gaseous pollutants, meteorology, and aerosol optical depth (AOD) from the Himawari-8 satellite. The model consists of three parts: multidimensional embedding layer, local LSTM blocks, and Transformer encoding blocks, aimed at integrating the advantages of different time processing methods. On the test set in the Yangtze River Delta region from 2020 to 2022, the model achieved an R² of 0.8929, MAE of 4.4523 µg/m³, and RMSE of 7.2683 µg/m³, demonstrating high prediction accuracy and outperforming similar models. The research indicates that while existing models such as RNN and Transformer have limitations in time series forecasting, PD-LL-Transformer improves prediction accuracy and applicability by integrating multiple features and optimizing the structure. Additionally, the paper discusses details of data collection, processing, and model training, including preprocessing of air quality data, satellite AOD data, and meteorological data. Finally, the paper emphasizes the necessity of establishing more efficient and practical PM 2.5 prediction tools.

PD-LL-Transformer: An Hourly PM2.5 Forecasting Method over the Yangtze River Delta Urban Agglomeration, China

Ambient PM2.5 Estimates and Variations During COVID-19 Pandemic in the Yangtze River Delta Using Machine Learning and Big Data

A hybrid deep learning technology for PM2.5 air quality forecasting

Application of the XGBoost Machine Learning Method in PM2.5 Prediction: A Case Study of Shanghai

PM2.5 Concentration Forecasting over the Central Area of the Yangtze River Delta Based on Deep Learning Considering the Spatial Diffusion Process

A Novel Short-Term PM2.5 Forecasting Approach Using Secondary Decomposition and a Hybrid Deep Learning Model

Estimating Hourly PM2.5 Concentrations Using Himawari-8 AOD and a DBSCAN-modified Deep Learning Model over the YRDUA, China

Probing the capacity of a spatiotemporal deep learning model for short-term PM2.5 forecasts in a coastal urban area

A Novel Hybrid Machine Learning Method (OR-ELM-AR) Used in Forecast of PM2.5 Concentrations and Its Forecast Performance Evaluation

Forecasting PM2.5 Using Hybrid Graph Convolution-Based Model Considering Dynamic Wind-Field to Offer the Benefit of Spatial Interpretability.

Regional aerosol forecasts based on deep learning and numerical weather prediction

Improved prediction of hourly PM2.5 concentrations with a long short-term memory and spatio-temporal causal convolutional network deep learning model

ResInformer: Residual Transformer-Based Artificial Time-Series Forecasting Model for PM2.5 Concentration in Three Major Chinese Cities

Improving PM2.5 and PM10 Predictions in China from WRF_Chem Through a Deep Learning Method: Multiscale Depth-Separable UNet

Estimation of daily ground-level PM2.5 concentrations over the Pearl River Delta using 1 km resolution MODIS AOD based on multi-feature BiLSTM

A Novel Hybrid Framework for Hourly PM2.5 Concentration Forecasting Using CEEMDAN and Deep Temporal Convolutional Neural Network

A Deep CNN-LSTM Model for Particulate Matter (PM2.5) Forecasting in Smart Cities

Encoder-Decoder Model for Forecast of PM2.5 Concentration Per Hour

Evaluation of Different Machine Learning Approaches in Forecasting PM2.5 Mass Concentrations

Prediction of PM2.5 Concentration Based on Deep Learning, Multi-Objective Optimization, and Ensemble Forecast

Deep Learning-Based PM2.5 Long Time-Series Prediction by Fusing Multisource Data—A Case Study of Beijing