Multi-scale sequential feature selection for disease classification using Raman spectroscopy data

Yue Wei,Hechang Chen,Bo Yu,Chengyou Jia,Xianling Cong,Lele Cong
DOI: https://doi.org/10.1016/j.compbiomed.2023.107053
Abstract:Raman spectroscopy (RS) optical technology promises non-destructive and fast application in medical disease diagnosis in a single step. However, achieving clinically relevant performance levels remains challenging due to the inability to search for significant Raman signals at different scales. Here we propose a multi-scale sequential feature selection method that can capture global sequential features and local peak features for disease classification using RS data. Specifically, we utilize the Long short-term memory network (LSTM) module to extract global sequential features in the Raman spectra, as it can capture long-term dependencies present in the Raman spectral sequences. Meanwhile, the attention mechanism is employed to select local peak features that were ignored before and are the key to distinguishing different diseases. Experimental results on three public and in-house datasets demonstrate the superiority of our model compared with state-of-the-art methods for RS classification. In particular, our model achieves an accuracy of 97.9 ± 0.2% on the COVID-19 dataset, 76.3 ± 0.4% on the H-IV dataset, and 96.8 ± 1.9% on the H-V dataset.
What problem does this paper attempt to address?