An Efficient and Exact Algorithm for Locally h-Clique Densest Subgraph Discovery

Xiaojia Xu,Haoyu Liu,Xiaowei Lv,Yongcai Wang,Deying Li
2024-08-26
Abstract:Detecting locally, non-overlapping, near-clique densest subgraphs is a crucial problem for community search in social networks. As a vertex may be involved in multiple overlapped local cliques, detecting locally densest sub-structures considering h-clique density, i.e., locally h-clique densest subgraph (LhCDS) attracts great interests. This paper investigates the LhCDS detection problem and proposes an efficient and exact algorithm to list the top-k non-overlapping, locally h-clique dense, and compact subgraphs. We in particular jointly consider h-clique compact number and LhCDS and design a new "Iterative Propose-Prune-and-Verify" pipeline (IPPV) for top-k LhCDS detection. (1) In the proposal part, we derive initial bounds for h-clique compact numbers; prove the validity, and extend a convex programming method to tighten the bounds for proposing LhCDS candidates without missing any. (2) Then a tentative graph decomposition method is proposed to solve the challenging case where a clique spans multiple subgraphs in graph decomposition. (3) To deal with the verification difficulty, both a basic and a fast verification method are proposed, where the fast method constructs a smaller-scale flow network to improve efficiency while preserving the verification correctness. The verified LhCDSes are returned, while the candidates that remained unsure reenter the IPPV pipeline. (4) We further extend the proposed methods to locally more general pattern densest subgraph detection problems. We prove the exactness and low complexity of the proposed algorithm. Extensive experiments on real datasets show the effectiveness and high efficiency of IPPV.
Data Structures and Algorithms
What problem does this paper attempt to address?
The paper attempts to address the problem of efficiently and accurately detecting the densest subgraphs that are local, non-overlapping, and near-cliques in a graph. Specifically, the paper focuses on how to find the Locally ℎ-Clique Densest Subgraph (LℎCDS), which means finding the top 𝑘 non-overlapping, locally densest, and compact ℎ-clique subgraphs in a given graph for a given integer ℎ. The paper proposes a new iterative algorithm framework called IPPV (Iterative Propose-Prune-and-Verify), which achieves this through the following steps: 1. **Propose Initial Boundaries**: Derive initial upper and lower bounds of the ℎ-clique compactness based on the structure of the graph. 2. **Graph Decomposition Method**: Handle cases where a clique spans multiple subgraphs and generate correct decomposition proposals. 3. **Verification Algorithm**: Propose a fast verification algorithm that constructs a smaller flow network to improve efficiency while maintaining verification correctness. Additionally, the paper proves that the proposed algorithm is both accurate and of low complexity, and demonstrates the effectiveness and efficiency of the IPPV algorithm through extensive experiments on real-world datasets.