Abstract:Abstract. As air pollution is regarded as the single largest environmental health risk in Europe it is important that communication to the public is up to date and accurate and provides means to avoid exposure to high air pollution levels. Long- and short-term exposure to outdoor air pollution is associated with increased risks of mortality and morbidity. Up-to-date information on present and coming days' air quality helps people avoid exposure during episodes with high levels of air pollution. Air quality forecasts can be based on deterministic dispersion modelling, but to be accurate this requires detailed information on future emissions, meteorological conditions and process-oriented dispersion modelling. In this paper, we apply different machine learning (ML) algorithms – random forest (RF), extreme gradient boosting (XGB), and long short-term memory (LSTM) – to improve 1, 2, and 3 d deterministic forecasts of PM10, NOx, and O3 at different sites in Greater Stockholm, Sweden. It is shown that the deterministic forecasts can be significantly improved using the ML models but that the degree of improvement of the deterministic forecasts depends more on pollutant and site than on what ML algorithm is applied. Also, four feature importance methods, namely the mean decrease in impurity (MDI) method, permutation method, gradient-based method, and Shapley additive explanations (SHAP) method, are utilized to identify significant features that are common and robust across all models and methods for a pollutant. Deterministic forecasts of PM10 are improved by the ML models through the input of lagged measurements and Julian day partly reflecting seasonal variations not properly parameterized in the deterministic forecasts. A systematic discrepancy by the deterministic forecasts in the diurnal cycle of NOx is removed by the ML models considering lagged measurements and calendar data like hour and weekday, reflecting the influence of local traffic emissions. For O3 at the urban background site, the local photochemistry is not properly accounted for by the relatively coarse Copernicus Atmosphere Monitoring Service ensemble model (CAMS) used here for forecasting O3 but is compensated for using the ML models by taking lagged measurements into account. Through multiple repetitions of the training process, the resulting ML models achieved improvements for all sites and pollutants. For NOx at street canyon sites, mean squared error (MSE) decreased by up to 60 %, and seven metrics, such as R2 and mean absolute percentage error (MAPE), exhibited consistent results. The prediction of PM10 is improved significantly at the urban background site, whereas the ML models at street sites have difficulty capturing more information. The prediction accuracy of O3 also modestly increased, with differences between metrics. Further work is needed to reduce deviations between model results and measurements for short periods with relatively high concentrations (peaks) at the street canyon sites. Such peaks can be due to a combination of non-typical emissions and unfavourable meteorological conditions, which are rather difficult to forecast. Furthermore, we show that general models trained using data from selected street sites can improve the deterministic forecasts of NOx at the station not involved in model training. For PM10 this was only possible using more complex LSTM models. An important aspect to consider when choosing ML algorithms is the computational requirements for training the models in the deployment of the system. Tree-based models (RF and XGB) require fewer computational resources and yield comparable performance in comparison to LSTM. Therefore, tree-based models are now implemented operationally in the forecasts of air pollution and health risks in Stockholm. Nevertheless, there is big potential to develop generic models using advanced ML to take into account not only local temporal variation but also spatial variation at different stations.

Temporal Heterogeneity in the Performance of Machine Learning Models for PM2.5 Concentration Estimation

Estimating PM2.5 from Multisource Data: A Comparison of Different Machine Learning Models in the Pearl River Delta of China

Short-Term PM2.5 Concentration Changes Prediction: A Comparison of Meteorological and Historical Data

A Machine Learning Method to Estimate PM2.5 Concentrations Across China with Remote Sensing, Meteorological and Land Use Information.

Application of the XGBoost Machine Learning Method in PM2.5 Prediction: A Case Study of Shanghai

Predicting PM2.5 levels and exceedance days using machine learning methods

Spatiotemporal Continuous Estimates of PM2.5 Concentrations in China, 2000-2016: A Machine Learning Method with Inputs from Satellites, Chemical Transport Model, and Ground Observations.

Machine learning and deep learning modeling and simulation for predicting PM2.5 concentrations

Prediction of PM2.5 Concentration Using Spatiotemporal Data with Machine Learning Models

High-resolution Estimation of PM2.5 Concentrations Across China Using Multiple Machine Learning Approaches and Model Fusion

Spatiotemporal dynamics and exposure analysis of daily PM2.5 using a remote sensing-based machine learning model and multi-time meteorological parameters

Seasonal Prediction of Daily PM2.5 Concentrations with Interpretable Machine Learning: a Case Study of Beijing, China

Time series-based PM2.5 concentration prediction in Jing-Jin-Ji area using machine learning algorithm models

Optimizing Modeling Windows to Better Capture the Long-Term Variation of PM2.5 Concentrations in China During 2005-2019.

Predicting Personal Exposure to PM2.5 Using Different Determinants and Machine Learning Algorithms in Two Megacities, China

An Intercomparison of Weather Normalization of PM2.5 Concentration Using Traditional Statistical Methods, Machine Learning, and Chemistry Transport Models

Enhanced PM2.5 estimation across China: An AOD-independent two-stage approach incorporating improved spatiotemporal heterogeneity representations

High-Resolution Spatiotemporal Modeling for Ambient PM2.5 Exposure Assessment in China from 2013 to 2019.

Extracting Regional and Temporal Features to Improve Machine Learning for Hourly Air Pollutants in Urban India

Estimating particulate matter concentrations and meteorological contributions in China during 2000–2020

Improving 3-day deterministic air pollution forecasts using machine learning algorithms