LLM-based event abstraction and integration for IoT-sourced logs

Mohsen Shirali,Mohammadreza Fani Sani,Zahra Ahmadi,Estefania Serral
2024-09-05
Abstract:The continuous flow of data collected by Internet of Things (IoT) devices, has revolutionised our ability to understand and interact with the world across various applications. However, this data must be prepared and transformed into event data before analysis can begin. In this paper, we shed light on the potential of leveraging Large Language Models (LLMs) in event abstraction and integration. Our approach aims to create event records from raw sensor readings and merge the logs from multiple IoT sources into a single event log suitable for further Process Mining applications. We demonstrate the capabilities of LLMs in event abstraction considering a case study for IoT application in elderly care and longitudinal health monitoring. The results, showing on average an accuracy of 90% in detecting high-level activities. These results highlight LLMs' promising potential in addressing event abstraction and integration challenges, effectively bridging the existing gap.
Databases,Emerging Technologies,Machine Learning
What problem does this paper attempt to address?
The problem that this paper attempts to solve is: how to use large - language models (LLMs) to automatically process and integrate raw sensor data from Internet of Things (IoT) devices to generate event logs suitable for further analysis (such as process mining). Specifically, the paper focuses on abstracting low - level sensor readings into high - level event records and merging logs from multiple IoT sources into a unified event log. ### Main problems 1. **Event abstraction**: Extract meaningful event information from raw sensor data. 2. **Log integration**: Merge logs from multiple different IoT sources into a comprehensive view for more comprehensive analysis. ### Background and challenges - **Unstructured nature of raw data**: The raw sensor data generated by IoT devices is usually unstructured and uninformative, and is not suitable for direct use in data analysis. - **Limitations of manual processing**: Traditional event log generation methods usually require human intervention, which is not only time - consuming but also error - prone, especially when dealing with multi - source heterogeneous data. - **Complexity of multi - modal data**: The data sources in IoT systems are diverse, and the formats and granularities are inconsistent, which increases the difficulty of data integration. ### Solutions The paper proposes an LLMs - based method to automatically handle these challenges: - **Event abstraction**: Convert low - level sensor readings into high - level activity events through LLMs. - **Log integration**: Use LLMs to merge logs from multiple IoT sources into a unified event log for subsequent process mining and other data analysis applications. ### Experimental verification To verify the effectiveness of this method, the paper uses an actual IoT data set, which contains the daily activity records of a 60 - year - old woman in an independent living apartment for 146 days. The experimental results show that the average accuracy rate of LLMs in detecting high - order activities is 90%, which proves its potential in event abstraction and integration tasks. ### Key contributions 1. **Automated event detection and labeling**: Use LLMs to automatically detect and label low - level sensor data to generate abstract events. 2. **Real - time event log generation**: Develop a data - flow - based event abstraction and generation method, which is suitable for online applications. 3. **Multi - source log integration**: Merge information from multiple sources into a unified event log to enhance the analysis ability. Through these contributions, the paper shows the great potential of LLMs in simplifying the event log generation process, improving efficiency and accessibility, thereby enhancing the application value of IoT data in data - driven analysis.