Abstract:Fine-grained city-scale outdoor air pollution maps provide important environmental information for both city managers and residents. Installing portable sensors on vehicles (e.g., taxis, Ubers) provides a low-cost, easy-maintenance, and high-coverage approach to collecting data for air pollution estimation. However, as non-dedicated platforms, vehicles like taxis usually prefer gathering at busy areas of a city where it is more likely to pick up riders. This leaves many parts of the city unsensed or less-sensed. In addition, due to the natural changes in a city and the movements of the vehicles, the sensed and unsensed areas change overtime. Consequently, challenges of air pollution estimation with data collected by non-dedicated mobile platforms are twofold: i. data coverage is sparse; ii. data coverage changes over time. Therefore, the major research question is: how can we derive accurate and robust fine-grained field (e.g., air pollution) estimation given dynamic and sparse data collected from uncontrollable mobile sensing platforms? This paper presents adaptive HMSS, an adaptive hybrid model-enabled sensing system for fine-grained air pollution estimation with dynamic and sparse data collected from uncontrollable mobile sensing platforms, which is achieved by combining the advantages of a physics guided model and a data driven model. To address the challenge of sparse coverage, the physical understanding of the spatiotemporal correlation for air pollution distribution in the physics guided model is utilized to infer values at unsensed sparse areas. Meanwhile, the data driven model is adopted to estimate the air pollution influential factors (e.g., buildings) not included in the physics guided mode!. To address the challenge of time-varying coverage, an adaptive model combination algorithm is designed to enable the system bias to either of the two models according to the amount of data collection and uncertainty of the model. To evaluate the system performance, we deployed 47 air pollution sensing devices on taxis and fixed locations in 2 cities for both controlled and uncontrolled experiments for over two weeks. The results show that with a resolution of 500 m by 500 m by 1 hour, our system achieves up to 3.2 x error reduction when compared to the baseline approaches.

A Data-Driven Supervised Machine Learning Approach to Estimating Global Ambient Air Pollution Concentrations With Associated Prediction Intervals

A Framework for Scalable Ambient Air Pollution Concentration Estimation

Data-Driven Air Quality Characterization for Urban Environments: A Case Study

Predicting air quality via multimodal AI and satellite imagery

Air Quality Forecasting Using Machine Learning: A Global perspective with Relevance to Low-Resource Settings

Machine Learning-Based Approach Using Open Data to Estimate PM2.5 over Europe

Urban Air Pollution Forecasting: a Machine Learning Approach leveraging Satellite Observations and Meteorological Forecasts

Estimation of Air Pollution with Remote Sensing Data: Revealing Greenhouse Gas Emissions from Space

Air Pollution Monitoring and Prediction using Machine Learning Algorithms

Guiding the Data Learning Process with Physical Model in Air Pollution Inference.

Machine Learning for a Low-cost Air Pollution Network

Multi-Site and Multi-Pollutant Air Quality Data Modeling

High spatio-temporal resolution predictions of PM 2.5 using low-cost sensor data

A comparison of statistical and machine learning models for spatio-temporal prediction of ambient air pollutant concentrations in Scotland

Modeling fine-grained spatio-temporal pollution maps with low-cost sensors

A comparison of statistical and machine learning methods for creating national daily maps of ambient PM$_{2.5}$ concentration

Adaptive Hybrid Model-Enabled Sensing System (HMSS) for Mobile Fine-Grained Air Pollution Estimation

Estimating hourly PM2.5 concentrations at the neighborhood scale using a low-cost air sensor network: A Los Angeles case study

Pragmatic estimation of a spatio-temporal air quality model with irregular monitoring data

Estimating Ambient Air Pollution Using Structural Properties of Road Networks

Data-driven Air Quality Characterisation for Urban Environments: a Case Study