Feature fusion combined with tissue Raman spectroscopy to screen cervical cancer
Bo Yang,Cheng Chen,Fangfang Chen,Cailing Ma,Chen Chen,Huiting Zhang,Rui Gao,Shuailei Zhang,Xiaoyi Lv
DOI: https://doi.org/10.1002/jrs.6246
IF: 2.727
2021-01-01
Journal of Raman Spectroscopy
Abstract:In this experiment, we collected 45 samples of cervicitis, 29 samples of low-grade squamous intraepithelial lesion (LSIL), 44 samples of high-grade squamous intraepithelial lesion (HSIL), 39 samples of cervical squamous cell carcinoma, and 38 cases of cervical adenocarcinoma. After preprocessing of the Raman spectral data, partial least squares (PLS) was used to reduce the dimensionality, and then extreme gradient boosting (XGBoost) was used for feature selection to obtain the first 30-dimensional features. The preprocessed Raman spectral data also used a fast Fourier transform (FFT) to obtain amplitude information, and then PLS and XGBoost were used to obtain the first 30-dimensional features. Finally, K nearest neighbor (KNN), extreme learning machine (ELM), artificial bee colony support vector machine (ABC-SVM), support vector machine optimized by the cuckoo search algorithm (CS-SVM), particle swarm optimization coupled with support vector machine (PSO-SVM), and the convolutional neural network combined with long- and short-term memory (CNN-LSTM) classification models were established. In the raw Raman spectral features experiments, the classification accuracies of KNN, ELM, ABC-SVM, CS-SVM, PSO-SVM, and CNN-LTSM were 60.76%, 65.81%, 76.21%, 77.66%, 73.50%, and 69.19%, respectively. In the feature fusion experiments, the classification accuracies were 60.91%, 67.84%, 77.64%, 78.49%, 75.54%, and 70.72%, respectively. The experimental results show that feature fusion can further improve model performance regardless of whether using linear classification models or nonlinear classification models. Therefore, it provides a new strategy for extracting features and screening multiple cervical pathological tissues in the future.