Abstract:Air pollution, such as PM2.5 (particulate matter with an aerodynamic equivalent diameter of less than 2.5 mu m), PM10 (particulate matter with an aerodynamic equivalent diameter of less than 10 mu m), NOx, and SOx, is a global concern because it may cause many chronic and fatal diseases, especially in developing countries. To better address air pollution problems, an important step is the timely and accurate prediction of air quality. Traditional methods are mainly based on meteorological data, regression model data, remote sensing data and different retrieval methods. Numerous studies on deep learning methods have suggested that these approaches may be able to perform accurate predictions for complex systems. In this paper, a long short-term memory (LSTM) approach for predicting air quality is proposed; moreover, meteorological data are used and Chinese social media is investigated as a proxy for public perceptions and responses for air quality prediction. We gathered daily air quality data, meteorological data and Weibo check-in data for Beijing, China from January 1, 2015 to December 31, 2016. The average sentiment of the related Weibo posts was selected as the public response proxy. The performance of our proposed model is evaluated based on real data. The root-mean-square error (RMSE) and the mean absolute error (MAE) indicated that our method presented better prediction results than traditional methods in terms of the PM2.5, PM10, O-3, NO2, SO2 and CO concentrations. We focused on the prediction performance during the 2015 China Victory Day Parade period, during which social and political factors played an important role in air quality predictions. The results indicated that the proposed method, which incorporates public response data, was especially suitable for predicting the air quality in extreme short-term social events and provides a timely social measurement and feedback for environmental problems.

Application of Data Mining to the Analysis of Meteorological Data for Air Quality Prediction: A Case Study in Shenyang

Spatiotemporal variation in the impact of meteorological conditions on PM<sub>2.5</sub> pollution in China from 2000 to 2017

Application of the XGBoost Machine Learning Method in PM2.5 Prediction: A Case Study of Shanghai

Relevance Analysis and Short-Term Prediction of PM2.5 Concentrations in Beijing Based on Multi-Source Data

Association of PM2.5 Pollution with the Pattern of Human Activity: A Case Study of a Developed City in Eastern China

Effects of Ginkgo biloba extract on acute cerebral ischemia in rats analyzed by magnetic resonance spectroscopy.

A Case Analysis of Dust Weather and Prediction of PM10 Concentration Based on Machine Learning at the Tibetan Plateau

Reliability Assessment of PM2.5 Concentration Monitoring Data: A Case Study of China

Hybrid Data Mining Forecasting System Based on Multi-Objective Optimization and Selection Model for Air pollutants

Decision intelligence-driven predictive modelling of air quality index in surface mining

Prediction of PM2.5 Concentration Using Spatiotemporal Data with Machine Learning Models

A Long Short-Term Memory Approach to Predicting Air Quality Based on Social Media Data

Predicting monthly high-resolution PM2.5 concentrations with random forest model in the North China Plain

Machine learning and deep learning modeling and simulation for predicting PM2.5 concentrations

Predicting ambient PM2.5 concentrations via time series models in Anhui Province, China

Prediction into the future: A novel intelligent approach for PM2.5 forecasting in the ambient air of open-pit mining

Automatic detection of targets against cluttered backgrounds using a fractal-oriented statistical analysis and Radon transform

Time series-based PM2.5 concentration prediction in Jing-Jin-Ji area using machine learning algorithm models

Evaluating drivers of PM 2.5 air pollution at urban scales using interpretable machine learning

Association rule mining of air quality through an improved Apriori algorithm: A case study in 244 Chinese cities

Spatiotemporal dynamics and exposure analysis of daily PM2.5 using a remote sensing-based machine learning model and multi-time meteorological parameters