Estimating PM2.5 concentrations via random forest method using satellite, auxiliary, and ground-level station dataset at multiple temporal scales across China in 2017

Bin Guo,Dingming Zhang,Lin Pei,Yi Su,Xiaoxia Wang,Yi Bian,Donghai Zhang,Wanqiang Yao,Zixiang Zhou,Liyu Guo
DOI: https://doi.org/10.1016/j.scitotenv.2021.146288
2021-07-01
Abstract:<p>Fine particulate matter with aerodynamic diameters less than 2.5 μm (PM<sub>2.5</sub>) poses adverse impacts on public health and the environment. It is still a great challenge to estimate high-resolution PM<sub>2.5</sub> concentrations at moderate scales. The current study calibrated PM<sub>2.5</sub> concentrations at a 1 Km resolution scale using ground-level monitoring data, Aerosol Optical Depth (AOD), meteorological data, and auxiliary data via Random Forest (RF) model across China in 2017. The three ten-folded cross-validations (CV) methods including sample-based, time-based, and spatial-based validation combined with Coefficient Square (R<sup>2</sup>), Root-Mean-Square Error (RMSE), and Mean Predictive Error (MPE) have been used for validation at different temporal scales in terms of daily, monthly, heating seasonal, and non-heating seasonal. Finally, the distribution map of PM<sub>2.5</sub> concentrations was illustrated based on the RF model. Some findings were achieved. The RF model performed well, with a relatively high sample-based cross-validation R<sup>2</sup> of 0.74, a low RMSE of 16.29 μg × m<sup>−3</sup>, and a small MPE of −0.282 μg × m<sup>−3</sup>. Meanwhile, the performance of the RF model in inferring the PM<sub>2.5</sub> concentrations was well at urban scales except for Chengyu (CY). North China, the CY urban agglomeration, and the northwest of China exhibited relatively high PM<sub>2.5</sub> pollution features, especially in the heating season. The robustness of the RF model in the present study outperformed most statistical regression models for calibrating PM<sub>2.5</sub> concentrations. The outcomes can supply an up-to-date scientific dataset for epidemiological and air pollutants exposure risk studies across China.</p>
environmental sciences
What problem does this paper attempt to address?