Abstract:Background: Experimental methods for the identification of essential proteins are always costly, time-consuming, and laborious. It is a challenging task to find protein essentiality only through experiments. With the development of high throughput technologies, a vast amount of protein-protein interactions are available, which enable the identification of essential proteins from the network level. Many computational methods for such task have been proposed based on the topological properties of protein-protein interaction (PPI) networks. However, the currently available PPI networks for each species are not complete, i.e. false negatives, and very noisy, i.e. high false positives, network topology-based centrality measures are often very sensitive to such noise. Therefore, exploring robust methods for identifying essential proteins would be of great value.Method: In this paper, a new essential protein discovery method, named CoEWC (Co-Expression Weighted by Clustering coefficient), has been proposed. CoEWC is based on the integration of the topological properties of PPI network and the co-expression of interacting proteins. The aim of CoEWC is to capture the common features of essential proteins in both date hubs and party hubs. The performance of CoEWC is validated based on the PPI network of Saccharomyces cerevisiae. Experimental results show that CoEWC significantly outperforms the classical centrality measures, and that it also outperforms PeC, a newly proposed essential protein discovery method which outperforms 15 other centrality measures on the PPI network of Saccharomyces cerevisiae. Especially, when predicting no more than 500 proteins, even more than 50% improvements are obtained by CoEWC over degree centrality (DC), a better centrality measure for identifying protein essentiality.Conclusions: We demonstrate that more robust essential protein discovery method can be developed by integrating the topological properties of PPI network and the co-expression of interacting proteins. The proposed centrality measure, CoEWC, is effective for the discovery of essential proteins.

Two new methods for identifying proteins based on the domain protein complexes and topological properties

United Complex Centrality for Identification of Essential Proteins from PPI Networks.

Identification Of Essential Proteins Based On A New Combination Of Local Interaction Density And Protein Complexes

Identification of Essential Proteins by Using Complexes and Interaction Network.

A mixed clustering coefficient centrality for identifying essential proteins

A New Integration-Centric Algorithm of Identifying Essential Proteins Based on Topology Structure of Protein-Protein Interaction Network and Complex Information

A new essential protein discovery method based on the integration of protein-protein interaction and gene expression data

Discovering Essential Proteins Based on PPI Network and Protein Complex.

Effective Identification of Essential Proteins Based on Priori Knowledge, Network Topology and Gene Expressions.

Identification of Essential Proteins Based on Edge Clustering Coefficient

An Efficient Method to Identify Essential Proteins for Different Species by Integrating Protein Subcellular Localization Information

Prediction of Essential Proteins Based on Local Interaction Density

A New Method For Predicting Essential Proteins Based On Participation Degree In Protein Complex And Subgraph Density

A new algorithm for essential proteins identification based on the integration of protein complex co-expression information and edge clustering coefficient

Identifying Essential Proteins Based on Sub-Network Partition and Prioritization by Integrating Subcellular Localization Information

United Neighborhood Closeness Centrality and Orthology for Predicting Essential Proteins

A New Method For The Discovery Of Essential Proteins

A New Method for Predicting Essential Proteins Based on Topology Potential

Prediction of Essential Proteins by Integration of PPI Network Topology and Protein Complexes Information

A New Method for Identification of Essential Proteins by Information Entropy of Protein Complex and Subcellular Localization

A New Method For Identifying Essential Proteins Based On Edge Clustering Coefficient