PM2.5 concentration prediction based on EEMD-ALSTM

Zuhan Liu,Dong Ji,Lili Wang
DOI: https://doi.org/10.1038/s41598-024-63620-9
IF: 4.6
2024-06-03
Scientific Reports
Abstract:The concentration prediction of PM 2.5 plays a vital role in controlling the air and improving the environment. This paper proposes a prediction model (namely EEMD-ALSTM) based on Ensemble Empirical Mode Decomposition (EEMD), Attention Mechanism and Long Short-Term Memory network (LSTM). Through the combination of decomposition and LSTM, attention mechanism is introduced to realize the prediction of PM 2.5 concentration. The advantage of EEMD-ALSTM model is that it decomposes and combines the original data using the method of ensemble empirical mode decomposition, reduces the high nonlinearity of the original data, and Specially reintroduction the attention mechanism, which enhances the extraction and retention of data features by the model. Through experimental comparison, it was found that the EEMD-ALSTM model reduced its MAE and RMSE by about 15% while maintaining the same R 2 correlation coefficient, and the stability of the model in the prediction process was also improved significantly.
multidisciplinary sciences
What problem does this paper attempt to address?
The paper proposes a solution to the problem of predicting particulate matter (PM2.5) concentration. With the rapid development of China's economy and the acceleration of urbanization, air pollution, especially haze pollution caused by PM2.5, has become increasingly serious and poses a threat to the environment and public health. In order to effectively control and predict PM2.5 concentration, the paper introduces a prediction model called EEMD-ALSTM, which combines Ensemble Empirical Mode Decomposition (EEMD), Attention Mechanism, and Long Short-Term Memory network (LSTM). The EEMD-ALSTM model decomposes the original data through EEMD to reduce its nonlinearity, and then uses attention mechanism to enhance the extraction and preservation of data features. Experimental comparisons show that the EEMD-ALSTM model reduces the Root Mean Square Error (RMSE) and Mean Absolute Error (MAE) by about 15% while maintaining the same coefficient of determination (R²), and improves the stability of the model during the prediction process. Currently, the methods for predicting PM2.5 concentration mainly fall into two categories: physics-driven and data-driven. However, these methods either rely on multiple influencing factors and require a large amount of equipment to collect data, or fail to deeply understand the data when processing a large amount of information. Therefore, the proposed EEMD-ALSTM model aims to achieve long-term prediction through a single data source, reduce the nonlinearity of the data, and improve prediction accuracy. The combination of EEMD decomposition and attention mechanism has been proven to enhance the accuracy of the prediction results.