Advancing Object-Centric Process Mining with Multi-Dimensional Data Operations

Shahrzad Khayatbashi,Najmeh Miri,Amin Jalali
2024-11-30
Abstract:Analyzing process data at varying levels of granularity is important to derive actionable insights and support informed decision-making. Object-Centric Event Data (OCED) enhances process mining by capturing interactions among multiple objects within events, leading to the discovery of more detailed and realistic yet complex process models. The lack of methods to adjust the granularity of the analysis limits users to leverage the full potential of Object-Centric Process Mining (OCPM). To address this gap, we propose four operations: drill-down, roll-up, unfold, and fold, which enable changing the granularity of analysis when working with Object-Centric Event Logs (OCEL). These operations allow analysts to seamlessly transition between detailed and aggregated process models, facilitating the discovery of insights that require varying levels of abstraction. We formally define these operations and implement them in an open-source Python library. To validate their utility, we applied the approach to real-world OCEL data extracted from a learning management system that covered a four-year period and approximately 400 students. Our evaluation demonstrates significant improvements in precision and fitness metrics for models discovered before and after applying these operations. This approach can empower analysts to perform more flexible and comprehensive process exploration, unlocking actionable insights through adaptable granularity adjustments.
Databases
What problem does this paper attempt to address?
### What problem does this paper attempt to solve? This paper aims to solve the problem of being unable to flexibly adjust the analysis granularity when analyzing process data in Object - Centric Process Mining (OCPM). Specifically, traditional OCPM methods are usually only able to analyze at a relatively high level of abstraction, which limits the user's ability to discover more detailed and realistic process models from different granularity levels. #### Main problems: 1. **Lack of methods for adjusting analysis granularity**: Most existing OCPM algorithms operate at a relatively high level of abstraction, making it difficult to reveal certain detailed information, such as the direct relationships between different tests. 2. **Need for dynamic adjustment of analysis granularity**: In order to better understand complex data environments and adjust the granularity according to specific analysis goals, new operations need to be introduced to support this requirement. #### Proposed solutions: To solve the above problems, the author proposes four operations: drill - down, roll - up, unfold, and fold. These operations can change the analysis granularity of Object - Centric Event Logs (OCEL). Through these operations, users can seamlessly switch between detailed low - level views and high - level overviews, thereby discovering insights that require different levels of abstraction. - **Drill - down**: Combine object types with specific attribute values to distinguish different types of objects. For example, subdivide the "test" object type into "ECG test" and "blood test". - **Roll - up**: Re - aggregate the subdivided object types back to the original object type. For example, re - aggregate "ECG test" and "blood test" back to "test". - **Unfold**: Expand the event type into a combination of event type and object type in order to distinguish activities according to specific object types. For example, expand "order test" to "order ECG test" and "order blood test". - **Fold**: Re - aggregate the expanded event types back to the original event type. Through these operations, users can explore object - centric event logs more flexibly and reveal process patterns hidden at different granularity levels. The paper also verifies the effectiveness of these operations through a practical case, demonstrating their significant improvement in improving the accuracy and adaptability of process models. #### Summary: This paper solves the problem of insufficient adjustment of analysis granularity in OCPM by introducing new multi - dimensional data analysis operations, enabling users to explore and understand complex process data more flexibly.