Prediction of Cervical Lymph Nodes Recurrence after Radiotherapy for Early Nasopharyngeal Carcinoma Via Unsupervised Diagnostic Feature Learning and Supervised Ensemble Classifier Learning

Zhenkun Lu,Haohan Wei,Fengyu Ye,Sheng Li,Qinghua Huang
DOI: https://doi.org/10.1016/j.bspc.2024.106075
IF: 5.1
2024-01-01
Biomedical Signal Processing and Control
Abstract:Purpose: Nasopharyngeal carcinoma is one of the most prevalent malignant tumors in Guangdong, China. In the field of medicine, predicting the recurrence of early -stage nasopharyngeal carcinoma after radiotherapy holds significant importance. Our objective was to develop a novel classification model aimed at predicting recurrence following radiotherapy for early nasopharyngeal carcinoma. Methods and Materials: This paper introduces an innovative approach by combining unsupervised diagnostic feature learning (biclustering algorithm) with Adaboost ensemble learning to create a novel classification model. Notably, the nasopharyngeal carcinoma dataset underwent biclustering algorithm application for the first time. Initially, the Borderline -SMOTE oversampling algorithm was employed to address the dataset's imbalance issue. Subsequently, the biclustering algorithm, which is based on an improved multi -objective genetic algorithm (NSGA-II), was utilized to seek bicluster outcomes exhibiting consistent representation patterns. The attributes of these bicluster outcomes were assessed and employed as diagnostic rules for constructing weak classifiers. Ultimately, the Adaboost ensemble learning technique was employed to amalgamate the weak classifiers into a robust classifier. Results: Following the application of 10 -fold cross -validation, the model exhibited an accuracy of 80.33%, sensitivity of 79.55%, specificity of 80.43%, and GMean of 79.99%, accompanied by an AUC value of 0.904. Conclusions: The presented classification model outperformed alternative classification models in terms of classification accuracy and generalization when applied to the nasopharyngeal carcinoma dataset. Consequently, it serves as a valuable tool to aid medical professionals in predicting the likelihood of recurrence in the cervical lymph nodes following radiotherapy for early -stage nasopharyngeal carcinoma.
What problem does this paper attempt to address?