Network mutual information measures for graph similarity

Helcio Felippe,Federico Battiston,Alec Kirkley
DOI: https://doi.org/10.1038/s42005-024-01830-3
2024-10-14
Abstract:A wide range of tasks in network analysis, such as clustering network populations or identifying anomalies in temporal graph streams, require a measure of the similarity between two graphs. To provide a meaningful data summary for downstream scientific analyses, the graph similarity measures used for these tasks must be principled, interpretable, and capable of distinguishing meaningful overlapping network structure from statistical noise at different scales of interest. Here we derive a family of graph mutual information measures that satisfy these criteria and are constructed using only fundamental information theoretic principles. Our measures capture the information shared among networks according to different encodings of their structural information, with our mesoscale mutual information measure allowing for network comparison under any specified network coarse-graining. We test our measures in a range of applications on real and synthetic network data, finding that they effectively highlight intuitive aspects of network similarity across scales in a variety of systems.
Physics and Society,Social and Information Networks
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to effectively measure the similarity between two graphs in network analysis. Specifically, the paper focuses on constructing a graph similarity measurement method that can meet the following conditions: 1. **Principle - based**: It is constructed based on basic information - theoretic principles to ensure a solid theoretical foundation for the method. 2. **Interpretability**: It can clearly explain why two graphs are considered similar or dissimilar. 3. **Discriminative ability**: It can distinguish meaningful overlapping network structures from statistical noise at different scales, especially for meso - scale and macro - scale network structures. To achieve these goals, the paper proposes a class of graph similarity measurement methods based on mutual information, called "Network Mutual Information" (NMI). These methods capture the shared information between graphs through different encoding schemes, including: - **Standard NMI**: It measures the similarity of graphs based on the edge - overlap situation. - **Degree - Corrected NMI (DC - NMI)**: It takes into account the distribution of node degrees and pays more attention to the overlap of node neighborhoods. - **Mesoscale NMI (MesoNMI)**: It compares the mesoscale structures of graphs, such as community structures, under specified network coarsening. These methods can not only effectively distinguish the similarity of network structures at different scales but also provide intuitive and meaningful results in practical applications. The paper verifies the effectiveness of these methods through a series of experiments, especially in the applications of synthetic networks and multi - layer networks.