Core Decomposition on Uncertain Graphs Revisited

Qiangqiang Dai,Rong-Hua Li,Guoren Wang,Rui Mao,Zhiwei Zhang,Ye Yuan,Ronghua Li
DOI: https://doi.org/10.1109/tkde.2021.3088504
IF: 9.235
2021-01-01
IEEE Transactions on Knowledge and Data Engineering
Abstract:Core decomposition on uncertain graphs is a fundamental problem in graph analysis. Given an uncertain graph $\mathcal {G}$G, the core decomposition problem is to determine all $(k,\eta)\text{-cores}$(k,η)-cores in $\mathcal {G}$G, where a $(k,\eta)\text{-core}$(k,η)-core is a maximal subgraph of $\mathcal {G}$G such that each node has an $\eta \text{-}$η-${\mathsf {degree}}$degree no less than $k$k within the subgraph. The $\eta \text{-}$η-${\mathsf {degree}}$degree of a node $v$v is defined as the maximum integer $r$r such that the probability that $v$v has a degree no less than $r$r is larger than or equal to the threshold $\eta \in [0,1]$η∈[0,1]. The state-of-the-art algorithm for solving this problem is based on a peeling technique which iteratively removes the nodes with the smallest $\eta \text{-}$η-${\mathsf {degrees}}$degrees and also dynamically updates their neighbors’ $\eta \text{-}$η-${\mathsf {degrees}}$degrees. Unfortunately, we find that such a peeling algorithm with the dynamical $\eta \text{-}$η-${\mathsf {degree}}$degree updating technique is incorrect due to the inaccuracy of the recursive floating-point number division operations involved in the dynamical updating procedure. To correctly compute the $(k,\eta)\text{-cores}$(k,η)-cores, we first propose a bottom-up algorithm based on an on-demand $\eta \text{-}$η-${\mathsf {degree}}$degree computational strategy. To further improve the efficiency, we also develop a more efficient top-down algorithm with several nontrivial optimization techniques. Both of our algorithms do not involve any floating-point number division operations, thus the correctness can be guaranteed. In addition, we also develop the parallel variants of all the proposed algorithms. Finally, we conduct extensive experiments to evaluate the proposed algorithms using five large real-life datasets. The results show that our algorithms are at least three orders of magnitude faster than the existing exact algorithms on large uncertain graphs. The results also demonstrate the high scalability and parallel performance of the proposed algorithms.
computer science, information systems, artificial intelligence,engineering, electrical & electronic
What problem does this paper attempt to address?