A Method for Predicting the Impact of Labor Education at Institutions of Higher Education Based on a Two-Stage Clustering Algorithm

Lei Zhang
DOI: https://doi.org/10.1142/s0129156425400208
2024-01-01
International Journal of High Speed Electronics and Systems
Abstract:In order to develop more accurate teaching plans and methods based on students’ characteristics and needs, and improve the effectiveness of labor education, a two-stage clustering algorithm based method for predicting the effectiveness of labor education in universities is studied. A two-stage clustering algorithm-based method for predicting the effectiveness of labor education in universities has been proposed. Through in-depth analysis of labor education data in universities, principal component analysis (PCA) was used to select 16 most representative variables from the original features. Subsequently, self-organizing map (SOM) neural network was used for preliminary clustering, and the optimal number of clusters was determined to be 8. Furthermore, the K-means clustering algorithm performs fine clustering based on the initial centers provided by SOM, significantly improving the stability and accuracy of clustering. At the same time, processing the outliers in the clustering results has reduced their impact on the clustering effect. Finally, the processed data was input into an improved XGBoost prediction model, which achieved a true positive rate (TPR) of over 85% on the ROC curve when predicting the effectiveness of labor education in universities. While maintaining a low false positive rate (FPR), the model outperformed other comparison methods significantly. This achievement not only validates the effectiveness of the proposed method in feature selection, clustering optimization, and prediction model construction, but also confirms its ability to accurately predict the labor education effectiveness of students in different grades, providing strong support for higher education institutions to optimize teaching plans and improve educational outcomes.
What problem does this paper attempt to address?