A Novel Recursive Model Based on a Convolutional Long Short-Term Memory Neural Network for Air Pollution Prediction

Weilin Wang,Wenjing Mao,Xueli Tong,Gang Xu
DOI: https://doi.org/10.3390/rs13071284
IF: 5
2021-03-27
Remote Sensing
Abstract:Deep learning provides a promising approach for air pollution prediction. The existing deep learning-based predicted models generally consider either the temporal correlations of air quality monitoring stations or the nonlinear relationship between the PM2.5 (particulate matter with an aerodynamic diameter of less than 2.5 μm) concentrations and explanatory variables. Spatial correlation has not been effectively incorporated into prediction models, therefore exhibiting poor performance in PM2.5 prediction tasks. Additionally, determining the manner by which to expand longer-term prediction tasks is still challenging. In this paper, to allow for spatiotemporal correlations, a spatiotemporal convolutional recursive long short-term memory (CR-LSTM) neural network model is proposed for predicting the PM2.5 concentrations in long-term prediction tasks by combining a convolutional long short-term memory (ConvLSTM) neural network and a recursive strategy. Herein, the ConvLSTM network was used to capture the complex spatiotemporal correlations and to predict the future PM2.5 concentrations; the recursive strategy was used for expanding the long-term prediction tasks. The CR-LSTM model was used to realize the prediction of the future 24 h of PM2.5 concentrations for 12 air quality monitoring stations in Beijing by configuring both the appropriate time lag derived from the temporal correlations and the spatial neighborhood, including the hourly historical PM2.5 concentrations, the daily mean meteorological data, and the annual nighttime light and normalized difference vegetation index (NDVI). The results showed that the proposed CR-LSTM model achieved better performance (coefficient of determination (R2) = 0.74; root mean square error (RMSE) = 18.96 μg/m3) than other common models, such as multiple linear regression (MLR), support vector regression (SVR), the conventional LSTM model, the LSTM extended (LSTME) model, and the temporal sliding LSTM extended (TS-LSTME) model. The proposed CR-LSTM model, implementing a combination of geographical rules, recursive strategy, and deep learning, shows improved performance in longer-term prediction tasks.
environmental sciences,imaging science & photographic technology,remote sensing,geosciences, multidisciplinary
What problem does this paper attempt to address?
### Problems Addressed by the Paper The paper primarily addresses two key issues in air pollution prediction: 1. **Ineffective Utilization of Spatial Correlation**: Existing deep learning-based air pollution prediction models typically only consider the temporal correlation of air quality monitoring stations or the nonlinear relationship between PM2.5 concentration and explanatory variables, without effectively incorporating spatial correlation into the prediction model. This leads to poor performance in PM2.5 concentration prediction tasks. 2. **Difficulty in Extending Long-term Prediction Tasks**: Even though existing models based on Long Short-Term Memory (LSTM) networks are suitable for short-term prediction tasks, they generally perform poorly in long-term prediction tasks (such as predictions over 24 hours). ### Solution To address the above issues, the authors propose a Spatio-Temporal Convolutional Recurrent Long Short-Term Memory (CR-LSTM) model based on Convolutional LSTM (ConvLSTM) neural networks and a recursive strategy. This model captures complex spatio-temporal correlations and achieves long-term prediction tasks by combining ConvLSTM networks and a recursive strategy. ### Main Contributions - **Handling Spatio-Temporal Correlations**: Capturing complex spatio-temporal correlations using ConvLSTM networks. - **Application of Recursive Strategy**: Extending long-term prediction tasks by determining an appropriate recursive period through the analysis of temporal correlations. - **Model Performance Validation**: Validating the proposed CR-LSTM model's superiority over other common models, including Multiple Linear Regression (MLR), Support Vector Regression (SVR), and traditional LSTM models, through a 24-hour PM2.5 concentration prediction task at 12 air quality monitoring stations in Beijing. ### Conclusion This study proposes an innovative approach, the CR-LSTM model, which combines geographical rules, recursive strategies, and deep learning techniques, significantly improving the performance of long-term air pollution prediction tasks. It has broad application prospects in atmospheric environmental science.