A Confident Learning-Based Support Vector Machine for Robust Ground Classification in Noisy Label Environments

Xin-Yue Zhang,Xiao-Ping Zhang,Hong-Gan Yu,Quan-Sheng Liu
DOI: https://doi.org/10.1016/j.tust.2024.106128
IF: 6.9
2025-01-01
Tunnelling and Underground Space Technology
Abstract:Geological labels obtained from field exploration have potential errors due to technique limitations and subjective interference, leading to noisy labels when developing ground-machine interaction models for TBM tunneling. The present study proposes a novel confident learning-based support vector machine (CL-SVM) to eliminate label noise, thereby improving the accuracy and credibility of ground classification. The proposed model optimizes confidence values for each label and recognizes those with low confidence values as potential noise. Its effectiveness and superiority are confirmed through a noise test. The results indicate that the maximum acceptable noise ratio of the CL-SVM is 35%, while that of the conventional SVM is only 10%. In addition, the CLSVM consistently emerges as a superior performer compared to the SVM in noisy label environments. The CLSVM is further verified through its application on a class-imbalanced dataset collected from a metro tunnel project in Wuhan, China. Here, the accuracy metric F1-score for the most noise-interfered class is significantly improved from 0.7273 to 0.88. To enhance the model's practical value, a confidence criterion is established to evaluate the credibility of individual predictions, which requires reliable predictions to have higher confidence values than specified thresholds. Without prior knowledge of true sample labels, this criterion distinguishes mispredictions from correct predictions with a remarkable precision of 99.08%. In summary, the proposed CLSVM exhibits significantly better robustness to noisy labels than conventional models, demonstrating great potential for ground perception in tunnel projects.
What problem does this paper attempt to address?