A machine learning-based ensemble model for estimating diurnal variations of nitrogen oxide concentrations in Taiwan

Aji Kusumaning Asri,Hsiao-Yun Lee,Yu-Ling Chen,Pei-Yi Wong,Chin-Yu Hsu,Pau-Chung Chen,Shih-Chun Candice Lung,Yu-Cheng Chen,Chih-Da Wu
DOI: https://doi.org/10.1016/j.scitotenv.2024.170209
IF: 9.8
2024-01-29
The Science of The Total Environment
Abstract:Air pollution is inextricable from human activity patterns. This is especially true for nitrogen oxide (NO x ), a pollutant that exists naturally and also as a result of anthropogenic factors. Assessing exposure by considering diurnal variation is a challenge that has not been widely studied. Incorporating 27 years of data, we attempted to estimate diurnal variations in NO x across Taiwan. We developed a machine learning-based ensemble model that integrated hybrid kriging-LUR, machine-learning, and an ensemble learning approach. Hybrid kriging-LUR was performed to select the most influential predictors, and machine-learning algorithms were applied to improve model performance. The three best machine-learning algorithms were suited and reassessed to develop ensemble learning that was designed to improve model performance. Our ensemble model resulted in estimates of daytime, nighttime, and daily NO x with high explanatory powers (Adj-R 2 ) of 0.93, 0.98, and 0.94, respectively. These explanatory powers increased from the initial model that used only hybrid kriging-LUR. Additionally, the results depicted the temporal variation of NO x , with concentrations higher during the daytime than the nighttime. Regarding spatial variation, the highest NO x concentrations were identified in northern and western Taiwan. Model evaluations confirmed the reliability of the models. This study could serve as a reference for regional planning supporting emission control for environmental and human health.
environmental sciences
What problem does this paper attempt to address?