Abstract:Background Identifying protein complexes is crucial to understanding principles of cellular organization and functional mechanisms. As many evidences have indicated that the subgraphs with high density or with high modularity in PPI network usually correspond to protein complexes, protein complexes detection methods based on PPI network focused on subgraph's density or its modularity in PPI network. However, dense subgraphs may have low modularity and subgraph with high modularity may have low density, which results that protein complexes may be subgraphs with low modularity or with low density in the PPI network. As the density-based methods are difficult to mine protein complexes with low density, and the modularity-based methods are difficult to mine protein complexes with low modularity, both two methods have limitation for identifying protein complexes with various density and modularity. Results To identify protein complexes with various density and modularity, including those have low density but high modularity and those have low modularity but high density, we define a novel subgraph's fitness, f ρ , as f ρ = ( density ) ρ *( modularity ) 1- ρ , and propose a novel algorithm, named LF_PIN, to identify protein complexes by expanding seed edges to subgraphs with the local maximum fitness value. Experimental results of LF-PIN in S.cerevisiae show that compared with the results of fitness equal to density (ρ = 1) or equal to modularity (ρ = 0), the LF-PIN identifies known protein complexes more effectively when the fitness value is decided by both density and modularity (0<ρ<1). Compared with the results of seven competing protein complex detection methods (CMC, Core-Attachment, CPM, DPClus, HC-PIN, MCL, and NFC) in S.cerevisiae and E.coli , LF-PIN outperforms other seven methods in terms of matching with known complexes and functional enrichment. Moreover, LF-PIN has better performance in identifying protein complexes with low density or with low modularity. Conclusions By considering both the density and the modularity, LF-PIN outperforms other protein complexes detection methods that only consider density or modularity, especially in identifying known protein complexes with low density or low modularity.

Double-layer Clustering Method to Predict Protein Complexes Based on Power-Law Distribution and Protein Sublocalization.

Framework to Identify Protein Complexes Based on Similarity Preclustering

Identifying Protein Complexes Based on Density and Modularity in Protein-Protein Interaction Network

Accurately Detecting Protein Complexes by Graph Embedding and Combining Functions with Interactions

Identifying Protein Complexes Based on Local Fitness Method

Protein complex prediction based on mutually exclusive interactions in protein interaction network

Protein Complex Detection in PPI Networks Based on Data Integration and Supervised Learning Method

Modifying the DPClus Algorithm for Identifying Protein Complexes Based on New Topological Structures.

Discovering Protein Complexes from Protein-Protein Interaction Data by Local Cluster Detecting Algorithm

Detecting Protein Complexes Based on Sequence Information in the Weighted Protein-Protein Interaction Network

Identifying Protein Complexes in Protein-Protein Interaction Networks by Using Clique Seeds and Graph Entropy

A Fast Hierarchical Clustering Algorithm for Functional Modules Discovery in Protein Interaction Networks

CPL: Detecting Protein Complexes by Propagating Labels on Protein-Protein Interaction Network

An effective approach to detecting both small and large complexes from protein-protein interaction networks

Detecting Protein Complexes from DPINs by Density Based Clustering with Pigeon-Inspired Optimization Algorithm

Identification of Protein Complexes Using Weighted PageRank-Nibble Algorithm and Core-Attachment Structure

From function to interaction: a new paradigm for accurately predicting protein complexes based on protein-to-protein interaction networks

A Degree-Distribution Based Hierarchical Agglomerative Clustering Algorithm for Protein Complexes Identification

A Novel Identified Temporal Protein Complexes Strategy Inspired by Density-Distance and Brainstorming Process.

Protein Complexes Prediction Via Positive and Unlabeled Learning of the PPI Networks

A Two-Layer Integration Framework for Protein Complex Detection