Data- & compute-efficient deviance mining via active learning and fast ensembles

Francesco Folino,Gianluigi Folino,Massimo Guarascio,Luigi Pontieri
DOI: https://doi.org/10.1007/s10844-024-00841-4
2024-01-24
Journal of Intelligent Information Systems
Abstract:Detecting deviant traces in business process logs is crucial for modern organizations, given the harmful impact of deviant behaviours (e.g., attacks or faults). However, training a Deviance Prediction Model (DPM) by solely using supervised learning methods is impractical in scenarios where only few examples are labelled. To address this challenge, we propose an Active-Learning-based approach that leverages multiple DPMs and a temporal ensembling method that can train and merge them in a few training epochs. Our method needs expert supervision only for a few unlabelled traces exhibiting high prediction uncertainty. Tests on real data (of either complete or ongoing process instances) confirm the effectiveness of the proposed approach.
computer science, information systems, artificial intelligence
What problem does this paper attempt to address?