Raman Spectroscopy and Machine Learning for the Classification of Esophageal Squamous Carcinoma
Wenhua Huang,Qixin Shang,Xin Xiao,Hanlu Zhang,Yimin Gu,Lin Yang,Guidong Shi,Yushang Yang,Yang Hu,Yong Yuan,Aifang Ji,Longqi Chen
DOI: https://doi.org/10.1016/j.saa.2022.121654
IF: 4.831
2022-01-01
Spectrochimica Acta Part A Molecular and Biomolecular Spectroscopy
Abstract:Early diagnosis of esophageal squamous cell carcinoma (ESCC), a common malignant tumor with a low overall survival rate due to metastasis and recurrence, is critical for effective treatment and improved prognosis. Raman spectroscopy, an advanced detection technology for esophageal cancer, was developed to improve diagnosis sensitivity, specificity, and accuracy. This study proposed a novel, effective, and noninvasive Raman spectroscopy technique to differentiate and classify ESCC cell lines. Seven ESCC cell lines and tissues of an ESCC patient with staging of T3N1M0 and T3N2M0 at low and high differentiation levels were investigated through Raman spectroscopy. Raman spectral data analysis was performed with four machine learning algorithms, namely principal components analysis (PCA)- linear discriminant analysis (LDA), PCA-eXtreme gradient boosting (XGB), PCA- support vector machine (SVM), and PCA- (LDA, XGB, SVM)-stacked Gradient Boosting Machine (GBM). Four machine learning algorithms were able to classifiy ESCC cell subtypes from normal esophageal cells. The PCA-XGB model achieved an overall predictive accuracy of 85% for classifying ESCC and adjacent tissues. Moreover, an overall predictive accuracy of 90.3% was achieved in distinguishing low differentiation and high differentiation ESCC tissues with the same stage when PCA-LDA, XGM, and SVM models were combined. This study illustrated the Raman spectral traits of ESCC cell lines and esophageal tissues related to clinical pathological diagnosis. Future studies should investigate the role of Raman spectral features in ESCC pathogenesis.