Discovering overlapping communities in multi-layer directed networks

Huan Qing
2024-07-23
Abstract:This article explores the challenging problem of detecting overlapping communities in multi-layer directed networks. Our goal is to understand the underlying asymmetric overlapping community structure by analyzing the mixed memberships of nodes. We introduce a new model, the multi-layer mixed membership stochastic co-block model (multi-layer MM-ScBM), to model multi-layer directed networks in which nodes can belong to multiple communities. We develop a spectral procedure to estimate nodes' memberships in both sending and receiving patterns. Our method uses a successive projection algorithm on a few leading eigenvectors of two debiased aggregation matrices. To our knowledge, this is the first work to detect asymmetric overlapping communities in multi-layer directed networks. We demonstrate the consistent estimation properties of our method by providing per-node error rates under the multi-layer MM-ScBM framework. Our theoretical analysis reveals that increasing the overall sparsity, the number of nodes, or the number of layers can improve the accuracy of overlapping community detection. Extensive numerical experiments are conducted to validate these theoretical findings. We also apply our method to one real-world multi-layer directed network, gaining insightful results.
Social and Information Networks,Methodology
What problem does this paper attempt to address?
The paper attempts to address the problem of detecting overlapping communities in multi-layer directed networks. Specifically, the paper proposes a new model—the multi-layer mixed membership stochastic co-block model (multi-layer MM-ScBM)—to model multi-layer directed networks where nodes can belong to multiple communities. The main contributions of the paper include: 1. **Proposing a flexible and interpretable statistical model**: The multi-layer MM-ScBM allows nodes in multi-layer directed networks to belong to multiple communities. 2. **Proposing a spectral method**: Estimating the mixed membership of nodes by running a vertex search algorithm on the leading eigenvectors of two debiased aggregated matrices. 3. **Theoretical analysis**: By deriving node error bounds, the consistency of the proposed method is demonstrated, and the impact of increasing overall sparsity, the number of nodes, or the number of layers on the accuracy of overlapping community detection is revealed. These theoretical findings are validated through synthetic data and a real-world multi-layer directed network (FAO multi-layer trade network), showing that the proposed method has significant advantages in detecting overlapping communities.