Estimating high-resolution PM 2.5 concentration in the Sichuan Basin using a random forest model with data-driven spatial autocorrelation terms

Yi Zhang,Siwei Zhai,Jingfei Huang,Xuelin Li,Wei Wang,Tao Zhang,Fei Yin,Yue Ma
DOI: https://doi.org/10.1016/j.jclepro.2022.134890
IF: 11.1
2022-11-10
Journal of Cleaner Production
Abstract:The Sichuan Basin (SCB) is severely polluted by fine particulate matter (PM 2.5 ). Accurate PM 2.5 concentration is important for pollution control and epidemiological studies. Evidence indicates that the distribution of PM 2.5 is spatially clustered. Additionally, the high local variation in PM 2.5 in densely populated areas indicates the necessity of high-resolution PM 2.5 estimation. However, spatial clustering and local variation are not considered in current studies in the SCB, which may limit the prediction accuracy of PM 2.5 estimation. In this study, we estimated the PM 2.5 concentration at 0.01° (approximately 1 km) resolution using a random forest model with data-driven spatial autocorrelation terms (DDW-RF) considering both the first-law-of-geography-based similarity and spatial clustering of PM 2.5 . The repeated 10-fold cross-validations revealed that compared to the traditional RF model, the optimal model had an 18.31% decrease in the root mean square error (RMSE) and a 4.68% increase in the coefficient of determination (R 2 ). The distribution of PM 2.5 revealed another heavily polluted area in the northeastern SCB, including Nanchong and Dazhou besides the two commonly known heavily PM 2.5 polluted areas in the western and southern SCB. Then, we built a downscaled model in the megacity Chengdu, which estimated PM 2.5 at 0.001° resolution with a 0.156 μg/m 3 (1.72%) decrease in RMSE compared to those of 0.01° estimations. The accurate and high-resolution PM 2.5 estimates generated by DDW-RF and downscaled models in this study could be beneficial for accurate health effect estimation not only in the whole SCB but also in the city areas with high variable concentrations.
environmental sciences,green & sustainable science & technology,engineering, environmental
What problem does this paper attempt to address?