A novel graph clustering method with a greedy heuristic search algorithm for mining protein complexes from dynamic and static PPI networks

Rongquan Wang,Caixia Wang,Guixia Liu
DOI: https://doi.org/10.1016/j.ins.2020.02.063
IF: 8.1
2020-06-01
Information Sciences
Abstract:Discovering protein complexes from protein-protein interaction (PPI) networks is one of the main tasks in bioinformatics. However, most of the state-of-the-art methods still face some challenges, such as the inability to discover overlapping protein complexes, failure to consider the inherent structure of real protein complexes, and not utilizing biological information. Based on the above mentioned aspects, we present a novel graph clustering method with a greedy heuristic search algorithm for mining protein complexes using a new clustering model in dynamic and static weighted PPI networks (named MPC-C). First, MPC-C constructs dynamic and static weighted PPI networks by combining biological and topological information. Second, initial clusters are obtained using core and multifunctional proteins and then we propose a greedy heuristic search algorithm to expand each initial cluster and form candidate protein complexes in dynamic and static weighted PPI networks. Finally, unreliable and highly overlapping protein complexes are discarded. To demonstrate the performance of MPC-C, we tested this method on five PPI networks and compared it with nine other effective methods. The experimental results showed that MPC-C outperformed state-of-the-art methods in terms of various computational and biologically relevant metrics.
computer science, information systems
What problem does this paper attempt to address?