A Spatial-Temporal Interpretable Deep Learning Model for improving interpretability and predictive accuracy of satellite-based PM2.5

Xing Yan,Zhou Zang,Yize Jiang,Wenzhong Shi,Yushan Guo,Dan Li,Chuanfeng Zhao,Letu Husi
DOI: https://doi.org/10.1016/j.envpol.2021.116459
IF: 8.9
2021-03-01
Environmental Pollution
Abstract:<p>Being able to monitor PM<sub>2.5</sub> across a range of scales is incredibly important for our ability to understand and counteract air pollution. Remote monitoring PM<sub>2.5</sub> using satellite-based data would be incredibly advantageous to this effort, but current machine learning methods lack necessary interpretability and predictive accuracy. This study details the development of a new Spatial-Temporal Interpretable Deep Learning Model (SIDLM) to improve the interpretability and predictive accuracy of satellite-based PM<sub>2.5</sub> measurements. In contrast to traditional deep learning models, the SIDLM is both "wide" and "deep." We comprehensively evaluated the proposed model in China using different input data (top-of-atmosphere (TOA) measurements-based and aerosol optical depth (AOD)-based, with or without meteorological data) and different spatial resolutions (10 km, 3 km, and 250 m). TOA-based SIDLM PM<sub>2.5</sub> achieved the best predictive accuracy in China, with root-mean-square errors (RMSE) of 15.30 and 15.96 μg/m<sup>3</sup>, and R<sup>2</sup> values of 0.70 and 0.66 for PM<sub>2.5</sub> predictions at 10 km and 3 km spatial resolutions, respectively. Additionally, we tested the SIDLM in PM<sub>2.5</sub> retrievals at a 250 m spatial resolution over Beijing, China (RMSE=16.01 μg/m<sup>3</sup>, R<sup>2</sup>=0.62). Furthermore, SIDLM demonstrated higher accuracy than five machine learning inversion methods, and also outperformed them regarding feature extraction and the interpretability of its inversion results. In particular, modeling results indicated the strong influence of the Tongzhou district on the principle PM<sub>2.5</sub> in the Beijing urban area. SIDLM-extracted temporal characteristics revealed that summer months (June-August) might have contributed less to PM<sub>2.5</sub> concentrations, indicating the limited accumulation of PM<sub>2.5</sub> in these months. Our study shows that SIDLM could become an important tool for other earth observation data in deep learning-based predictions and spatiotemporal analysis.</p>
environmental sciences
What problem does this paper attempt to address?