Environmental Insights: Democratizing Access to Ambient Air Pollution Data and Predictive Analytics with an Open-Source Python Package

Liam J Berrisford,Ronaldo Menezes
2024-03-06
Abstract:Ambient air pollution is a pervasive issue with wide-ranging effects on human health, ecosystem vitality, and economic structures. Utilizing data on ambient air pollution concentrations, researchers can perform comprehensive analyses to uncover the multifaceted impacts of air pollution across society. To this end, we introduce Environmental Insights, an open-source Python package designed to democratize access to air pollution concentration data. This tool enables users to easily retrieve historical air pollution data and employ a Machine Learning model for forecasting potential future conditions. Moreover, Environmental Insights includes a suite of tools aimed at facilitating the dissemination of analytical findings and enhancing user engagement through dynamic visualizations. This comprehensive approach ensures that the package caters to the diverse needs of individuals looking to explore and understand air pollution trends and their implications.
Physics and Society,Machine Learning
What problem does this paper attempt to address?
This paper introduces an open-source Python package called "Environmental Insights" aiming to popularize and facilitate access to environmental air quality pollution data and predictive analysis. The current issue is that, despite numerous studies on air pollution concentration prediction, integrating these predictions with the actual needs of stakeholders remains a challenge. The proposed approach in the paper addresses this challenge by providing historical data downloads, machine learning model predictions of future conditions, and dynamic visualization tools to enable non-experts to understand and participate in the study of air pollution trends. The main contribution of the paper is the development of a user-friendly platform that allows users to access high-resolution air pollution data and make predictions. Additionally, it includes a set of tools to disseminate the analysis results and enhance public engagement. The platform specifically focuses on reducing the entry barriers for individuals and communities with limited resources or technical knowledge to promote public calls for air quality improvement. The paper also discusses application cases of air pollution data in various fields such as health assessment, ecosystem conservation, economic impact analysis, and fundamental research. Existing software tools, although capable of data analysis, lack lightweight and user-friendly predictive models. Therefore, "Environmental Insights" utilizes machine learning to improve prediction speed, making it runnable on regular laptops without specialized knowledge, enabling more stakeholders to participate in air pollution intervention measures. In conclusion, this paper attempts to address how to make air pollution data accessible and understandable to more people, perform predictive analysis, and actively participate in decision-making and actions related to air pollution through an open-source Python package.