Abstract:Detection of the modular structure of biological networks is of interest to researchers adopting a systems perspective for the analysis of omics data. Computational systems biology has provided a rich array of methods for network clustering. To date, the majority of approaches address this task through a network node classification based on topological or external quantifiable properties of network nodes. Conversely, numerical properties of network edges are underused, even though the information content which can be associated with network edges has augmented due to steady advances in molecular biology technology over the last decade. Properly accounting for network edges in the development of clustering approaches can become crucial to improve quantitative interpretation of omics data, finally resulting in more biologically plausible models. In this study, we present a novel technique for network module detection, named WG-Cluster (Weighted Graph CLUSTERing). WG-Cluster's notable features, compared to current approaches, lie in: (1) the simultaneous exploitation of network node and edge weights to improve the biological interpretability of the connected components detected, (2) the assessment of their statistical significance, and (3) the identification of emerging topological properties in the detected connected components. WG-Cluster utilizes three major steps: (i) an unsupervised version of k-means edge-based algorithm detects sub-graphs with similar edge weights, (ii) a fast-greedy algorithm detects connected components which are then scored and selected according to the statistical significance of their scores, and (iii) an analysis of the convolution between sub-graph mean edge weight and connected component score provides a summarizing view of the connected components. WG-Cluster can be applied to directed and undirected networks of different types of interacting entities and scales up to large omics data sets. Here, we show that WG-Cluster can be successfully used in the differential analysis of physical protein-protein interaction (PPI) networks. Specifically, applying WG-Cluster to a PPI network weighted by measurements of differential gene expression permits to explore the changes in network topology under two distinct (normal vs. tumor) conditions. WG-Cluster code is available at https://sites.google.com/site/paolaleccapersonalpage/.

Identifying edge clusters in networks via edge graphlet degree vectors (edge-GDVs) and edge-GDV-similarities

Detecting modules in biological networks by edge weight clustering and entropy significance

Uncovering Biological Network Function via Graphlet Degree Signatures

Graphlet-based measures are suitable for biological network comparison

Functional geometry of protein-protein interaction networks

Clustering Edges in Directed Graphs

Discriminative topological features reveal biological network mechanisms

Entropy-Based Graph Clustering of PPI Networks for Predicting Overlapping Functional Modules of Proteins

Comparison and Evaluation of Network Clustering Algorithms Applied to Genetic Interaction Networks

Exploiting Edge Features in Graphs with Fused Network Gromov-Wasserstein Distance

CASCADE: a novel quasi all paths-based network analysis algorithm for clustering biological interactions

Classification in biological networks with hypergraphlet kernels

The probability of edge existence due to node degree: a baseline for network-based predictions

Complexes Detection in Biological Networks via Diversified Dense Subgraphs Mining

Detection of Complexes in Biological Networks Through Diversified Dense Subgraph Mining

On the impact of data integration and edge enrichment in mining significant signals from biological networks

Clustering on the Edge: Learning Structure in Graphs

Identification of Essential Proteins Based on Edge Clustering Coefficient

GLIDE: combining local methods and diffusion state embeddings to predict missing interactions in biological networks

The networked partial correlation and its application to the analysis of genetic interactions

Effectiveness and efficiency: label-aware hierarchical subgraph learning for protein-protein interaction