Protein inference method based on probabilistic graphical model

Can ZHAO,Qiong DUAN,Zengyou HE
DOI: https://doi.org/10.11992/tis.201603051
2016-01-01
Abstract:Proteomics is an emerging discipline that focuses on the large-scale study of proteins expressed inan or?ganism. An explicit goal of proteomics is the prompt and accurate identification of all proteins in a cell or tissue. Generally, protein identification can be divided into two parts:peptide identification and protein inference. In pep?tide identification, the peptide sequence is identified from raw tandem mass spectrometry , while the goal of protein inference is to identify which of these identified proteins is truly present in the sample. Because of the inherent un?certainty of MS data and the complexity of the proteome, there are several challenges in protein identification. In this article, we propose a novel method based on the probabilistic graphical model ( PGMPi) that introduces the in?fluence of tandem mass spectrometry. This method transforms the protein inference problem into a probabilistic graphical model problem to be solved, in which the maximum posteriori probabilities of proteins are identified in or?der to identify the protein set that is actually present in the sample. PGMPi can not only achieve efficient perform?ance in terms of identification, but also introduces only one parameter, which ensures the algorithm's stability. The experimental results demonstrate that our method is superior to existing state-of-the-art protein inference algo?rithms.
What problem does this paper attempt to address?