An Analysis Model of Protein Mass Spectrometry Data and Its Application

Pingan He,Longao Hou,Hong Tao,Qi Dai,Yuhua Yao
DOI: https://doi.org/10.2174/1574893614666191202150844
2020-01-01
Current Bioinformatics
Abstract:Backgroud: The impact of cancer in society created the necessity of new and faster theoretical models for the early diagnosis of cancer. Methods: In this work, a mass spectrometry (MS) data analysis method based on the star-like graph of protein and support vector machine (SVM) was proposed and applied to the ovarian cancer early classification in the MS data set. Firstly, the MS data is reduced and transformed into the corresponding protein sequence. Then, the topological indexes of the star-like graph are calculated to describe each MS data of the cancer sample. Finally, the SVM model is suggested to classify the MS data. Results: Using independent training and testing experiments 10 times to evaluate the ovarian cancer detection models, the average prediction accuracy, sensitivity, and specificity of the model were 96.45%, 96.88%, and 95.67%, respectively, for [0,1] normalization data, and 94.43%, 96.25%, and 91.11% for [-1,1] normalization data. Conclusion: The model combined with the SELDI-TOF-MS technology has a prospect in early clinical detection and diagnosis of ovarian cancer.
What problem does this paper attempt to address?