Abstract:In recent years, the air quality in China has become a matter of serious concern. Among the available indicators for evaluating air quality, PM2.5 is one of the most important. It comprises a complex mixture of extremely small particles and liquid droplets emitted into the air, whose diameters are no more than 2.5 μm. Environments with a high PM2.5 index are extremely harmful to human health. Once inhaled, these particles can affect the heart and lungs and cause serious health problems. Air pollution is closely related to meteorological conditions such as wind speed, wind direction, atmospheric stability, temperature, and air humidity. With the development of various machine learning methods, deep learning models based on neural networks are increasingly applied in air pollution research. In this study, the temperature, humidity, wind velocity data at different pressure altitudes from 8 locations around Beijing and average of PM2.5 data in Beijing were analyzed and normalized. Multi-dimensional data was ideal for research applications using machine learning methods. and three neural network models were built, including the back propagation (BP), convolutional neural network (CNN), and long short-term memory (LSTM) models, and trained them using the meteorological and PM2.5 data.The results indicate that the accuracies of the back propagation and convolutional neural network models in predicting the PM2.5 pollution level in the next hour is much lower than that of the long short-term memory model. The PM2.5 pollution index predicted for the next hour by the long short-term memory model is very close to the actual value. This result reveals the strong relationship between the PM2.5 pollution index of Beijing and the local meteorological conditions. The long short-term memory model is trained using meteorological data from different pressure altitudes, and found it to be more accurate in predicting pollution levels when using near-surface meteorological data than that obtained from multiple altitudes.

Evaluation of Different Machine Learning Approaches to Forecasting PM2.5 Mass Concentrations

Evaluation of Different Machine Learning Approaches in Forecasting PM2.5 Mass Concentrations

Application of the XGBoost Machine Learning Method in PM2.5 Prediction: A Case Study of Shanghai

PM 2.5 concentration forecasting: Development of integrated multivariate variational mode decomposition with kernel Ridge regression and weighted mean of vectors optimization

PM2.5 Concentration Forecasting: Development of Integrated Multivariate Variational Mode Decomposition with Kernel Ridge Regression and Weighted Mean of Vectors Optimization

Machine-learning-based Model and Simulation Analysis of PM2. 5 Concentration Prediction in Beijing

Time series-based PM2.5 concentration prediction in Jing-Jin-Ji area using machine learning algorithm models

PM2.5 concentrations forecasting in Beijing through deep learning with different inputs, model structures and forecast time

A Deep CNN-LSTM Model for Particulate Matter (PM2.5) Forecasting in Smart Cities

Data-driven predictive modeling of PM2.5 concentrations using machine learning and deep learning techniques: a case study of Delhi, India

Evaluation of Time Series Forecasting Models for Estimation of PM2.5 Levels in Air

A hybrid model for enhanced forecasting of PM2.5 spatiotemporal concentrations with high resolution and accuracy

Novel MIA-LSTM Deep Learning Hybrid Model with Data Preprocessing for Forecasting of PM2.5

Application of machine learning algorithms to improve numerical simulation prediction of PM2.5 and chemical components

Multi-step Forecast of PM2.5 and PM10 Concentrations Using Convolutional Neural Network Integrated with Spatial–temporal Attention and Residual Learning

A Machine Learning Method to Estimate PM2.5 Concentrations Across China with Remote Sensing, Meteorological and Land Use Information.

PM2.5 Prediction Based on Random Forest, XGBoost, and Deep Learning Using Multisource Remote Sensing Data

Predicting PM2.5 levels and exceedance days using machine learning methods

A Hybrid CNN-LSTM Model for Forecasting Particulate Matter (PM2.5)

Long Short-Term Memory based PM2.5 Concentration Prediction Method

Development of a Data-Driven Three-Dimensional PM2.5 Forecast Model Based on Machine Learning Algorithms