A self-learning graph clustering approach for protein complexes detection

Jia ZHU,Xing-cheng WU,Xue-qin LIN,Dan-yang XIAO,Jing XIAO,Jin HUANG,Chao-bo HE
DOI: https://doi.org/10.7641/CTA.2017.60581
2017-01-01
Abstract:Protein complex is a group of two or more associated polypeptide chains which plays essential roles in biological process. Given a graph representing protein-protein interactions (PPI) data, it is important but non-trivial to find protein complexes, the subsets of proteins that are closely coupled, from it, particularly in the condition that the PPI network has increased greatly in capacity in the recent years. In this paper, we propose a graph based clustering approach by adopting symmetric non-negative matrix factorization, which can effectively detect densely connected subgraphs from complex networks. We compare the performance of our approach with state-of-the-art approaches in three PPI networks with a well known benchmark complexes. The experimental results show that our approach significantly outperforms other methods in three PPI networks with different data sizes and densities.
What problem does this paper attempt to address?