Peptide Identification Based on Fuzzy Classification and Clustering

Xijun Liang,Zhonghang Xia,Xinnan Niu,Andrew J Link,Liping Pang,Fang-Xiang Wu,Hongwei Zhang
DOI: https://doi.org/10.1186/1477-5956-11-s1-s10
2013-01-01
Proteome Science
Abstract:Background The sequence database searching has been the dominant method for peptide identification, in which a large number of peptide spectra generated from LC/MS/MS experiments are searched using a search engine against theoretical fragmentation spectra derived from a protein sequences database or a spectral library. Selecting trustworthy peptide spectrum matches (PSMs) remains a challenge. Results A novel scoring method named FC-Ranker is developed to assign a nonnegative weight to each target PSM based on the possibility of its being correct. Particularly, the scores of PSMs are updated by using a fuzzy SVM classification model and a fuzzy silhouette index iteratively. Trustworthy PSMs will be assigned high scores when the algorithm stops. Conclusions Our experimental studies show that FC-Ranker outperforms other post-database search algorithms over a variety of datasets, and it can be extended to solve a general classification problem with uncertain labels.
What problem does this paper attempt to address?