Cross-course Process Mining of Student Clickstream Data -- Aggregation and Group Comparison

Tobias Hildebrandt,Lars Mehnen
2024-09-04
Abstract:This paper introduces novel methods for preparing and analyzing student interaction data extracted from course management systems like Moodle to facilitate process mining, like the creation of graphs that show the process flow. Such graphs can get very complex as Moodle courses can contain hundreds of different activities, which makes it difficult to compare the paths of different student cohorts. Moreover, existing research often confines its focus to individual courses, overlooking potential patterns that may transcend course boundaries. Our research addresses these challenges by implementing an automated dataflow that directly queries data from the Moodle database via SQL, offering the flexibility of filtering on individual courses if needed. In addition to analyzing individual Moodle activities, we explore patterns at an aggregated course section level. Furthermore, we present a method for standardizing section labels across courses, facilitating cross-course analysis to uncover broader usage patterns. Our findings reveal, among other insights, that higher-performing students demonstrate a propensity to engage more frequently with available activities and exhibit more dynamic movement between objects. While these patterns are discernible when analyzing individual course activity-events, they become more pronounced when aggregated to the section level and analyzed across multiple courses.
Computers and Society,Human-Computer Interaction
What problem does this paper attempt to address?
This paper attempts to solve the following problems: 1. **How to prepare LMS click - stream data for cross - course process analysis?** - This research aims to develop an automated workflow to extract and process student interaction data from the Moodle database for cross - course process mining. This includes standardizing activity names in different courses and aggregating data to the course section level, thereby simplifying cross - course comparisons. 2. **What are the differences in LMS path characteristics between high - performing students and low - performing students?** - The study found that high - performing students participate in available activities more frequently and move more dynamically between objects. These patterns are more evident when aggregated to the course section level and analyzed across multiple courses. Specifically, high - performing students are more likely to engage in self - study before attending class, while low - performing students are more likely to skip the self - study part and directly enter the classroom. 3. **Are there distinguishable cross - course student learning path patterns? Do these patterns vary by academic performance?** - The research revealed patterns in student learning paths through cross - course analysis and found that these patterns do indeed vary by academic performance. For example, high - performing students jump between course sections more frequently, showing more complex path behaviors. ### Specific Problem Analysis #### 1. Data Preparation and Cross - Course Analysis To achieve cross - course process analysis, researchers proposed an automated workflow to directly query data from the Moodle database and filter data for specific courses through SQL statements. In addition, they explored methods of aggregating data at the course section level to simplify complexity and reveal broader usage patterns. By standardizing section labels in different courses, researchers were able to conduct cross - course analysis, thereby obtaining a more comprehensive view of student participation and learning behaviors. #### 2. Differences in Student Path Characteristics By comparing the learning paths of high - performing and low - performing students, the study found: - High - performing students participate in various activities more frequently and move more flexibly between different objects. - These patterns are already visible at the individual event level, but are more significant when aggregated to the course section level and analyzed across multiple courses. - For example, in some courses, high - performing students are more likely to engage in self - study (such as the "self study" section) before class, while low - performing students are more likely to skip these sections and directly enter the classroom (such as the "class" section). #### 3. Cross - Course Learning Path Patterns Through cross - course analysis, researchers found some common learning path patterns: - High - performing students jump between course sections more frequently, showing more complex path behaviors. - For example, high - performing students switch back and forth between the self - study section and the classroom more frequently, while low - performing students show this behavior less often. - In addition, as the course content deepens, the interaction frequency of all students with advanced sections decreases, but high - performing students still maintain a relatively high level of participation. ### Conclusion This research has successfully addressed the challenges of cross - course process mining by developing a new toolchain and method. The research results show that high - performing students exhibit more positive and diverse patterns in course participation and learning paths, and these patterns are particularly evident in cross - course analysis. This finding provides valuable insights for educators, helping to optimize instructional design and improve students' learning outcomes.