Incremental Measurement of Structural Entropy for Dynamic Graphs

Runze Yang,Hao Peng,Chunyang Liu,Angsheng Li
2024-06-26
Abstract:Structural entropy is a metric that measures the amount of information embedded in graph structure data under a strategy of hierarchical abstracting. To measure the structural entropy of a dynamic graph, we need to decode the optimal encoding tree corresponding to the best community partitioning for each snapshot. However, the current methods do not support dynamic encoding tree updating and incremental structural entropy computation. To address this issue, we propose Incre-2dSE, a novel incremental measurement framework that dynamically adjusts the community partitioning and efficiently computes the updated structural entropy for each updated graph. Specifically, Incre-2dSE includes incremental algorithms based on two dynamic adjustment strategies for two-dimensional encoding trees, i.e., the naive adjustment strategy and the node-shifting adjustment strategy, which support theoretical analysis of updated structural entropy and incrementally optimize community partitioning towards a lower structural entropy. We conduct extensive experiments on 3 artificial datasets generated by Hawkes Process and 3 real-world datasets. Experimental results confirm that our incremental algorithms effectively capture the dynamic evolution of the communities, reduce time consumption, and provide great interpretability.
Information Theory
What problem does this paper attempt to address?
### Problems the Paper Attempts to Solve This paper aims to address the issue of incremental measurement of structural entropy in dynamic graphs. Specifically, structural entropy is a method to measure the amount of embedded information in graph structure data, achieved through a hierarchical abstraction strategy. To measure the structural entropy of dynamic graphs, it is necessary to decode the optimal coding tree corresponding to the best community partition at each time point. However, existing methods do not support dynamic coding tree updates and incremental calculation of structural entropy. To solve this problem, the authors propose a new incremental measurement framework—**Incre-2dSE**, which can dynamically adjust community partitions and efficiently calculate the structural entropy after each graph update. ### Main Contributions 1. **Propose Two Dynamic Adjustment Strategies**: For the 2-dimensional coding tree, two dynamic adjustment strategies are proposed—the Naive Adjustment Strategy and the Node-Shifting Adjustment Strategy, avoiding the need to rebuild the coding tree with each graph update. 2. **Design Incremental Framework**: An incremental framework (Incre-2dSE) is designed to efficiently measure the updated 2-dimensional structural entropy with low time complexity. 3. **Extend to Weighted Graphs**: The proposed method is extended to undirected weighted graphs, and an incremental calculation method for 1-dimensional structural entropy of directed weighted graphs is provided. 4. **Experimental Validation**: Extensive experiments are conducted on both synthetic and real-world datasets to verify the effectiveness and efficiency of the method, especially in real-time monitoring of community partition quality and community optimization. ### Method Overview - **Naive Adjustment Strategy**: Keeps the old community partition unchanged, supporting theoretical structural entropy analysis. When a new node connects to an existing node, the new node is assigned to the community of its neighbors rather than any arbitrary community. - **Node-Shifting Adjustment Strategy**: Dynamically adjusts community partitions by moving nodes to their Optimal Preferred Community (OPC) to minimize structural entropy. It supports handling multiple incremental edges and deleting existing edges simultaneously. ### Experimental Results Experimental results show that the proposed incremental algorithm can effectively capture the dynamic evolution characteristics of communities, significantly reduce time consumption, and have good interpretability. All source code and data are publicly available on GitHub for researchers to reference and use.