Improving process discovery by filtering noises based on event dependency

Dongjin Yu,Ke Ni,Zhongyang Li,Shengyi Zhang,Xiaoxiao Sun,Wenjie Hou,Yuke Ying
DOI: https://doi.org/10.3233/ida-230118
IF: 1.7
2024-02-03
Intelligent Data Analysis
Abstract:Process discovery techniques analyze process logs to extract models that characterize the behavior of business processes. In real-life logs, however, noises exist and adversely affect the extraction and thus decrease the understandability of discovered models. In this paper, we propose a novel double granularity filtering method, executed on both the event and trace levels, to detect noises by analyzing the directly-following and parallel relations between events. Based on the probability of an event occurring in a sequence, the infrequent behaviors and redundant events in the logs can be filtered out. In addition, the missing events in parallel blocks are detected to further improve the performance of filtering. Experiments on synthetic logs and five real-life datasets demonstrate that our method significantly outperforms other state-of-the-art methods.
computer science, artificial intelligence
What problem does this paper attempt to address?