Implementation of Random Forest Algorithm for Graduation Prediction

Fajar Riskiyono,Deni Mahdiana
DOI: https://doi.org/10.33395/sinkron.v8i3.13750
2024-07-10
SinkrOn
Abstract:University also has responsibility for the period of study taken by students in accordance with the level of education taken. The prediction of student study duration is designed to support the study program in guiding students to graduate on time. In this problem, data mining techniques can be applied to make predictions, namely by using the Random Forest classification method. The stages used in this study are data collecting, namely collecting student data, the data selection stage of 300 students with 5 (five) input data attributes including personal data (gender, age, marital status, job status) and academic data (grade) and 1 (one) attribute as an output containing choices about on time and late. The next stage is preprocessing with the aim of eliminating duplication, noise, and missing values, the stage of data transformation by normalizing age attributes (young and old), grade (large and small). Then the data split stage 3 times, namely 50/50, 40/60, and 30/60, the modeling stage with random forest, and finally, the evaluation stage by analyzing the confusion matrix consisting of accuracy, precision, and recall. The results of the study show that the proposed model can do well with predictions, that is, with the same results for all three data splits. The test value is 100% accuracy, 100% recall, and 100% precision. With this value, the success rate for predicting the timeliness of student graduation will be more accurate
What problem does this paper attempt to address?