Network community detection using higher-order structures

X Yu,J Zhu
DOI: https://doi.org/10.1093/biomet/asae014
IF: 3.0279
2024-03-30
Biometrika
Abstract:Summary In many real-world networks, it is often observed that subgraphs or higher-order structures of certain configurations, e.g., triangles and by-fans, are overly abundant compared to standard randomly generated networks (Milo et al., 2002). However, statistical models accounting for this phenomenon are limited, especially when community structure is of interest. This limitation is coupled with a lack of community detection methods that leverage subgraphs or higher-order structures. In this paper, we propose a new community detection method that effectively uses higher-order structures in a network. Furthermore, for the community detection accuracy, under an edge-dependent network model that consists of both community and triangle structures, we develop a finite-sample error bound characterized by the expected triangle degree, which leads to the consistency of the proposed method. To the best of our knowledge, this is the first statistical error bound and consistency result for community detection of a single network considering a network model with dependent edges. We also show, in both simulation studies and a real-world data example, that our method unveils network communities that are otherwise invisible to methods that ignore higher-order structures.
statistics & probability,mathematical & computational biology,biology
What problem does this paper attempt to address?