Machine learning combined with the PMF model reveal the synergistic effects of sources and meteorological factors on PM2.5 pollution

Zhongcheng Zhang,Bo Xu,Weiman Xu,Feng Wang,Jie Gao,Yue Li,Mei Li,Yinchang Feng,Guoliang Shi
DOI: https://doi.org/10.1016/j.envres.2022.113322
IF: 8.3
2022-09-01
Environmental Research
Abstract:PM<sub>2.5</sub> pollution is a complex process mainly affected by emission sources and meteorological conditions. However, it is hard to accurately assess the effects of emission sources and meteorological conditions on the variation of PM<sub>2.5</sub> concentrations in the complex atmospheric environment. In this study, the Random Forest model with Shapley Additive exPlanations (RF-SHAP) and Partial Dependence Plot (RF-PDP) was combined with Positive Matrix Factorization (PMF) to evaluate the impacts of various factors on PM<sub>2.5</sub> pollution. The results show that anthropogenic emissions and meteorological conditions contributed about 67% (40.5 μg/m<sup>3</sup>) and 33% (19.7 μg/m<sup>3</sup>) to variation in PM<sub>2.5</sub> concentrations, respectively. Specifically, secondary nitrate (SN) had the greatest impact among all sources (about 45%). Hence, we further explore the impacts of the primary sources and meteorological conditions on SN formation. Coal combustion and vehicle emissions significantly contribute to the formation of SN by providing a large number of precursor NO<sub>X</sub>. Additionally, the RF-PDP method was further employed to estimate the synergistic effects of primary sources and meteorological conditions on SN formation. The results help reveal strategies to simultaneously reduce SN by controlling primary emissions under suitable meteorological conditions. This work also suggests that the machine learning model can utilize online datasets well and provide a reliable approach for analyzing the causes of PM<sub>2.5</sub> pollution.
environmental sciences,public, environmental & occupational health
What problem does this paper attempt to address?