Time series prediction of the chemical components of PM 2.5 based on a deep learning model
Kai Liu,Yuanhang Zhang,Huan He,Hui Xiao,Siyuan Wang,Yuteng Zhang,Huiming Li,Xin Qian
DOI: https://doi.org/10.1016/j.chemosphere.2023.140153
IF: 8.8
2023-09-15
Chemosphere
Abstract:Modeling-based prediction methods enable rapid, reagent-free air pollution detection based on inexpensive multi-source data than traditional chemical reaction-based detection methods in order to quickly understand the air pollution situation. In this study, a convolutional neural network (CNN) and long and short-term memory (LSTM) neural networks are integrated to create a CNN-LSTM time series prediction model to predict the concentration of PM 2.5 and its chemical components (i.e., heavy metals, carbon component, and water-soluble ions) using meteorological data and air pollutants (PM 2.5 , SO 2 , NO 2 , CO, and O 3 ). In the integrated CNN-LSTM model, the CNN uses convolutional and pooling layers to extract features from the data, whereas the powerful nonlinear mapping and learning capabilities of LSTM enable the time series prediction of air pollution. The experimental results showed that the CNN-LSTM exhibited good generalization ability in the prediction of As, Cd, Cr, Cu, Ni, and Zn, with a mean R 2 above 0.9. Mean R 2 predicted for PM 2.5 , Pb, Ti, EC, OC, SO 4 2− , and NO 3 − ranged from 0.85 to 0.9. Shapley value showed that PM 2.5 , NO 2 , SO 2 , and CO had a greater influence on the predicted heavy metal results of the model. Regarding water-soluble ions, the predicted results were dominantly influenced by PM 2.5 , CO, and humidity. The prediction of the carbon fraction was affected mainly by the PM 2.5 concentration. Additionally, several input variables for various components were eliminated without affecting the prediction accuracy of the model, with R 2 between 0.70 and 0.84, thereby maximizing modeling efficiency and lowering operational costs. The fully trained model prediction results showed that most predicted components of PM 2.5 were lower during January to March 2020 than those in 2018 and 2019. This study provides insight into improving the accuracy of modeling-based detection methods and promotes the development of integrated air pollution monitoring toward a more sustainable direction.
environmental sciences