Quality Assessment of MS/MS Spectra using Variable Selection and Support Vector Machine

Hanchang Sun,Jiyang Zhang,Hui Liu,Wei Zhang,Changming Xu,Tengjiao Wang,Hongwei Xie
DOI: https://doi.org/10.1016/j.egypro.2011.10.525
2011-01-01
Energy Procedia
Abstract:High-throughput proteomics experiments produce large amounts of MS/MS data, but many are of too low quality to be utilized. Filtering out the low quality MS/MS spectra is one of the strategies to increase computational speed of database searching. We investigate the variables proposed in previous literatures, and collect 63 features for our study, including 8 new variables proposed in this paper. MRMR is used to select important variable set for 4 kinds of mass spectrometer platforms: Thermo LTQ-FT, LTQ LCQ, and Waters/Micromass QTOF. A Support Vector Machine based method is used to assess the quality of MS/MS spectra. Datasets from different mass spectrometers are applied to test the accuracy of our method. In order to prove the capability of our method, we compare it with msmsEval on the test datasets, and the results show that our method achieves higher accuracy and better performance than msmsEval. The programs can be obtained from the authors by request. (C) 2011 Published by Elsevier Ltd. Selection and/or peer-review under responsibility of the Organizers of 2011 International Conference on Energy and Environmental Science.
What problem does this paper attempt to address?