A new algorithm for essential proteins identification based on the integration of protein complex co-expression information and edge clustering coefficient

Jiawei Luo,Juan Wu
DOI: https://doi.org/10.1504/IJDMB.2015.069654
2015-01-01
International Journal of Data Mining and Bioinformatics
Abstract:Essential proteins provide valuable information for the development of biology and medical research from the system level. The accuracy of topological centrality only based methods is deeply affected by noise in the network. Therefore, exploring efficient methods for identifying essential proteins would be of great value. Using biological features to identify essential proteins is efficient in reducing the noise in PPI network. In this paper, based on the consideration that essential proteins evolve slowly and play a central role within a network, a new algorithm, named CED, is proposed. CED mainly employs gene expression level, protein complex information and edge clustering coefficient to predict essential proteins. The performance of CED is validated based on the yeast Protein-Protein Interaction PPI network obtained from DIP database and BioGRID database. The prediction accuracy of CED outperforms other seven algorithms when applied to the two databases.
What problem does this paper attempt to address?