PM4Py-GPU: a High-Performance General-Purpose Library for Process Mining

Alessandro Berti,Minh Phan Nghia,Wil M.P. van der Aalst
DOI: https://doi.org/10.48550/arXiv.2204.04898
2022-04-11
Abstract:Open-source process mining provides many algorithms for the analysis of event data which could be used to analyze mainstream processes (e.g., O2C, P2P, CRM). However, compared to commercial tools, they lack the performance and struggle to analyze large amounts of data. This paper presents PM4Py-GPU, a Python process mining library based on the NVIDIA RAPIDS framework. Thanks to the dataframe columnar storage and the high level of parallelism, a significant speed-up is achieved on classic process mining computations and processing activities.
Databases
What problem does this paper attempt to address?