Abstract:Abstract. As the deep learning algorithm has become a popular data analytic technique, atmospheric scientists should have a balanced perception of its strengths and limitations so that they can provide a powerful analysis of complex data with well-established procedures. Despite the enormous success of the algorithm in numerous applications, certain issues related to its applications in air quality forecasting (AQF) require further analysis and discussion. This study addresses significant limitations of an advanced deep learning algorithm, the convolutional neural network (CNN), in two common applications: (i) a real-time AQF model, and (ii) a post-processing tool in a dynamical AQF model, the Community Multi-scale Air Quality Model (CMAQ). In both cases, the CNN model shows promising accuracy for ozone prediction 24 hours in advance in both the United States and South Korea (with an overall index of agreement exceeding 0.8). For the first case, we use the wavelet transform to determine the reasons behind the poor performance of CNN during the nighttime, cold months, and high ozone episodes. We find that when fine wavelet modes (hourly and daily) are relatively weak or when coarse wavelet modes (weekly) are strong, the CNN model produces less accurate forecasts. For the second case, we use the dynamic time warping (DTW) distance analysis to compare post-processed results with their CMAQ counterparts (as a base model). For CMAQ results that show a consistent DTW distance from the observation, the post-processing approach properly addresses the modeling bias with predicted IOAs exceeding 0.85. When the DTW distance of CMAQ-vs-observation is irregular, the post-processing approach is unlikely to perform satisfactorily. Awareness of the limitations in CNN models will enable scientists to develop more accurate regional or local air quality forecasting systems by identifying the affecting factors in high concentration episodes.

Using wavelet transform and dynamic time warping to identify the limitations of the CNN model as an air quality forecasting system

A Novel CMAQ-CNN Hybrid Model to Forecast Hourly Surface-Ozone Concentrations Fourteen Days in Advance

A real-time hourly ozone prediction system using deep convolutional neural network

Enhancing real-time PM 2.5 forecasts: A hybrid approach of WRF-CMAQ model and CNN algorithm

Deep-AIR: A Hybrid CNN-LSTM Framework forFine-Grained Air Pollution Forecast

A Deep Convolutional Neural Network Model for improving WRF Forecasts

An attention-based CNN model integrating observational and simulation data for high-resolution spatial estimation of urban air quality

Deep-AIR: A Hybrid CNN-LSTM Framework for Air Quality Modeling in Metropolitan Cities

Air quality forecasting using convolutional neural networks

Predicting high-resolution air quality using machine learning: Integration of large eddy simulation and urban morphology data

Implementing heuristic-based multiscale depth-wise separable adaptive temporal convolutional network for ambient air quality prediction using real time data

Real-time early warning and the prediction of air pollutants for sustainable development in smart cities

Deep-AIR: A Hybrid CNN-LSTM Framework for Fine-Grained Air Pollution Estimation and Forecast in Metropolitan Cities

A hybrid CNN-Transformer model for ozone concentration prediction

Prediction of atmospheric pollutants in urban environment based on coupled deep learning model and sensitivity analysis

Multi-hour and multi-site air quality index forecasting in Beijing using CNN, LSTM, CNN-LSTM, and spatiotemporal clustering

Leveraging Machine Learning for Fault-Tolerant Air Pollutants Monitoring for a Smart City Design

Real time image-based air quality forecasts using a 3D-CNN approach with an attention mechanism

Prediction of Air Pollutant Concentration Based on One-Dimensional Multi-Scale CNN-LSTM Considering Spatial-Temporal Characteristics: A Case Study of Xi’an, China

Forecasting air pollutant concentration using a novel spatiotemporal deep learning model based on clustering, feature selection and empirical wavelet transform

A Deep CNN-LSTM Model for Particulate Matter (PM2.5) Forecasting in Smart Cities