Abstract:Climate change and air pollution are emerging topics due to their possible enormous implications for health and social perspectives. In recent years, tropospheric ozone has been recognized as an important greenhouse gas and pollutant that is detrimental to human health, agriculture, and natural ecosystems, and has shown a trend of increasing interest. Machine-learning-based approaches have been widely applied to the estimation of tropospheric ozone concentrations, but few studies have included tropospheric ozone profiles. This study aimed to predict the Northern Hemisphere distribution of Lower-Stratosphere-to-Troposphere (LST) ozone at a pressure of 100 hPa to the near surface by employing a deep learning Long Short-Term Memory (LSTM) model. We referred to a history of all the observed parameters (meteorological data of European Centre for Medium-Range Weather Forecasts (ECMWF) Reanalysis v5 (ERA5), satellite data, and the ozone profiles of the World Ozone and Ultraviolet Data Center (WOUDC)) between 2014 and 2018 for training the predictive models. Model–measurement comparisons for the monitoring sites of WOUDC for the period 2019–2020 show that the mean correlation coefficients (R2) in the Northern Hemisphere at high latitude (NH), Northern Hemisphere at middle latitude (NM), and Northern Hemisphere at low latitude (NL) are 0.928, 0.885, and 0.590, respectively, indicating reasonable performance for the LSTM forecasting model. To improve the performance of the model, we applied the LSTM migration models to the Civil Aircraft for the Regular Investigation of the Atmosphere Based on an Instrument Container (CARIBIC) flights in the Northern Hemisphere from 2018 to 2019 and three urban agglomerations (the Sichuan Basin (SCB), North China Plain (NCP), and Yangtze River Delta region (YRD)) between 2018 and 2019. The results show that our models performed well on the CARIBIC data set, with a high R2 equal to 0.754. The daily and monthly surface ozone concentrations for 2018–2019 in the three urban agglomerations were estimated from meteorological and ancillary variables. Our results suggest that the LSTM models can accurately estimate the monthly surface ozone concentrations in the three clusters, with relatively high coefficients of 0.815–0.889, root mean square errors (RMSEs) of 7.769–8.729 ppb, and mean absolute errors (MAEs) of 6.111–6.930 ppb. The daily scale performance was not as high as the monthly scale performance, with the accuracy of R2 = 0.636~0.737, RMSE = 14.543–16.916 ppb, MAE = 11.130–12.687 ppb. In general, the trained module based on LSTM is robust and can capture the variation of the atmospheric ozone distribution. Moreover, it also contributes to our understanding of the mechanism of air pollution, especially increasing our comprehension of pollutant areas.

Adjusting prediction of ozone concentration based on CMAQ model and machine learning methods in Sichuan-Chongqing region, China

Ozone Concentration Estimation and Meteorological Impact Quantification in the Beijing‐Tianjin‐Hebei Region Based on Machine Learning Models

Predicting plateau atmospheric ozone concentrations by a machine learning approach: A case study of a typical city on the southwestern plateau of China

Estimating ground-level high-resolution ozone concentration across China using a stacked machine-learning method

Study of statistically correcting model CMAQ-MOS for forecasting regional air quality

Spatiotemporal distributions of surface ozone levels in China from 2005 to 2017: A machine learning approach

Estimation of Lower-Stratosphere-to-Troposphere Ozone Profile Using Long Short-Term Memory (LSTM)

Evaluating the spatiotemporal ozone characteristics with high-resolution predictions in mainland China, 2013–2019

Spatiotemporal variations in meteorological influences on ambient ozone in China: A machine learning approach

Development of a High-Performance Machine Learning Model to Predict Ground Ozone Pollution in Typical Cities of China.

A machine learning approach to quantify meteorological drivers of ozone pollution in China from 2015 to 2019

Machine-learning-based corrections of CMIP6 historical surface ozone in China during 1950-2014

Study on Circulation Classification Based Surface Ozone Concentration Prediction Model

Hybrid machine learning model for hourly ozone concentrations prediction and exposure risk assessment

Spatiotemporal Variations of Air Pollutants and Ozone Prediction Using Machine Learning Algorithms in the Beijing-Tianjin-Hebei Region from 2014 to 2021.

Understanding the spatial and seasonal variation of the ground-level ozone in Southeast China with an interpretable machine learning and multi-source remote sensing

Prediction and explanation for ozone variability using cross-stacked ensemble learning model

A novel framework for daily forecasting of ozone mass concentrations based on cycle reservoir with regular jumps neural networks

Estimation of Near-Ground Ozone With High Spatio-Temporal Resolution in the Yangtze River Delta Region of China Based on a Temporally Ensemble Model

A Generic Model to Estimate Ozone Concentration From Landsat 8 Satellite Data Based on Machine Learning Technique

Multiple strategies for a novel hybrid forecasting algorithm of ozone based on data-driven models