Abstract:Objective: We propose a novel method to compare directed networks by decomposing the network into small modules, the so-called network subgraph approach, which is distinct from the network motif approach because it does not depend on null model assumptions. Method: We developed an alignment-free algorithm called the Subgraph Identification Algorithm (SIA), which could generate all subgraphs that have five connected nodes (5-node subgraph). There were 9,364 such modules. Then, we applied the SIA method to examine 17 cancer networks and measured the similarity between the two networks by gauging the similarity level using Jensen- Shannon entropy (HJS). Method: We developed an alignment-free algorithm called the Subgraph Identification Algorithm (SIA), which could generate all subgraphs that have five connected nodes (5-node subgraph). There were 9,364 such modules. Then, we applied the SIA method to examine 17 cancer networks and measured the similarity between the two networks by gauging the similarity level using Jensen- Shannon entropy (HJS). Results:: We identified and examined the biological meaning of 5-node regulatory modules and pairs of cancer networks with the smallest HJS values. The two pairs of networks that show similar patterns are (i) endometrial cancer and hepatocellular carcinoma and (ii) breast cancer and pathways in cancer. Some studies have provided experimental data supporting the 5-node regulatory modules. result: We identify and examine the biological meaning of 5-node regulatory modules and pairs of cancer networks which have the smallest HJS values. These two pairs of networks that show similar patterns are (i) endometrial cancer and hepatocellular carcinoma, and (ii) breast cancer and pathways in cancer. Some literature studies provide experimental data to support the 5-node regulatory modules. Conclusion: Our method is an alignment-free approach that measures the topological similarity of 5-node regulatory modules and aligns two directed networks based on their topology. These modules capture complex interactions among multiple genes that cannot be detected using existing methods that only consider single-gene relations. We analyzed the biological relevance of the regulatory modules and used the subgraph method to identify the modules that shared the same topology across 2 cancer networks out of 17 cancer networks. We validated our findings using evidence from the literature.

Aligning graphs and finding substructures by a cavity approach

Solving Maximum Clique Problem for Protein Structure Similarity

Parallel maximal common subgraphs with labels for molecular biology

Maximum Cliques in Protein Structure Comparison

Cavity approach for the approximation of spectral density of graphs with heterogeneous structures

On finding bicliques in bipartite graphs: a novel algorithm and its application to the integration of diverse biological data types

Comparing Graph Representations of Protein Structure for Mining Family-Specific Residue-Based Packing Motifs

Network Subgraph-based Method: Alignment-free Technique for Molecular Network Analysis

The Algorithmic Phase Transition of Random Graph Alignment Problem

Finding Largest Common Substructures of Molecules in Quadratic Time

Structure alignment via Delaunay tetrahedralization

Spectral Alignment of Graphs

CavDetect: A DBSCAN Algorithm based Novel Cavity Detection Model on Protein Structure

Multiple Network Alignment on Quantum Computers

Finding the Hierarchy of Dense Subgraphs using Nucleus Decompositions

Aligning random graphs with a sub-tree similarity message-passing algorithm

Biclustering Protein Complex Interactions with a Biclique Finding Algorithm

Detecting highly overlapping community structure by greedy clique expansion

A linear delay algorithm for enumerating all connected induced subgraphs

CLEVER: Clique-Enumerating Variant Finder