Rule based Complex Event Processing for an Air Quality Monitoring System in Smart City

Shashi Shekhar Kumar,Ritesh Chandra,Sonali Agarwal
2024-03-16
Abstract:In recent years, smart city-based development has gained momentum due to its versatile nature in architecture and planning for the systematic habitation of human beings. According to World Health Organization (WHO) report, air pollution causes serious respiratory diseases. Hence, it becomes necessary to real-time monitoring of air quality to minimize effect by taking time-bound decisions by the stakeholders. The air pollution comprises various compositions such as NH3, O3, SO2, NO2, etc., and their concentrations vary from location to location.The research work proposes an integrated framework for monitoring air quality using rule-based Complex Event Processing (CEP) and SPARQL queries. CEP works with the data stream based on predefined rules to detect the complex pattern, which helps in decision support for stakeholders. Initially, the dataset was collected from the Central Pollution Control Board (CPCB) of India and this data was then preprocessed and passed through Apache Kafka. Then a knowledge graph developed based on the air quality paradigm. Consequently, convert preprocessed data into Resource Description Framework (RDF) data, and integrate with Knowledge graph which is ingested to CEP engine using Apache Jena for enhancing the decision support . Simultaneously, rules are extracted using a decision tree, and some ground truth parameters of CPCB are added and ingested to the CEP engine to determine the complex patterns. Consequently, the SPARQL query is used on real-time RDF dataset for fetching the condition of air quality as good, poor, severe, hazardous etc based on complex events detection. For validating the proposed approach various chunks of RDF are used for the deployment of events to the CEP engine, and its performance is examined over time while performing simple and complex queries.
Computers and Society,Databases
What problem does this paper attempt to address?
The problem that this paper attempts to solve is: in the smart city environment, how to realize real - time air quality monitoring through the rule - based Complex Event Processing (CEP) system, so as to help relevant stakeholders make timely decisions, thereby reducing the impact of air pollution on human health. Specifically, the paper focuses on the following aspects: 1. **Real - time air quality monitoring**: By collecting and processing air quality data from the Central Pollution Control Board (CPCB), real - time monitoring of the concentrations of air pollutants (such as NH₃, O₃, SO₂, NO₂, etc.) is achieved. 2. **Complex event detection**: Using rule - driven complex event processing technology, complex patterns are detected from the data stream to support decision - making. 3. **Knowledge graph and SPARQL query**: The pre - processed data is converted into RDF format, and combined with the knowledge graph for complex event detection. SPARQL queries are used to obtain air quality conditions (such as good, poor, serious, dangerous, etc.). 4. **Performance evaluation**: Verify the performance of the proposed framework when processing simple and complex queries, ensuring that it can efficiently process large amounts of data and provide real - time insights. ### Key problems and solutions - **Problem**: How to effectively monitor and analyze air quality data in the city in real - time, so as to take measures to deal with air pollution quickly? - **Solutions**: - **Data collection and pre - processing**: Collect air quality data from CPCB and use Apache Kafka for stream processing. - **Rule generation**: Generate rules through the decision tree algorithm and CPCB standard parameters for detecting complex events. - **Event processing**: Use the Siddhi CEP engine to process the data stream and extract useful information. - **Knowledge graph and query**: Convert the data into RDF format, combine with the knowledge graph for complex event detection, and use SPARQL query to obtain air quality status. ### Main contributions 1. Designed an integrated framework for deploying events from the data stream to the CEP engine. 2. Extracted rules verified by the decision tree and CPCB standard parameters. 3. Achieved efficient processing of simple and complex queries. 4. Provided real - time insights on air quality data streams from a purely environmental perspective. 5. Evaluated the event processing performance based on the Siddhi CEP engine, especially the execution time on different data blocks. 6. Built a CEP - based air quality monitoring decision support system (DSS). Through these methods, the paper aims to improve the efficiency and accuracy of air quality monitoring in smart cities, thereby better protecting the health of residents.