Hub-Accelerator: Fast and Exact Shortest Path Computation in Large Social Networks

Ruoming Jin,Ning Ruan,Bo You,Haixun Wang
DOI: https://doi.org/10.48550/arXiv.1305.0507
2013-05-03
Abstract:Shortest path computation is one of the most fundamental operations for managing and analyzing large social networks. Though existing techniques are quite effective for finding the shortest path on large but sparse road networks, social graphs have quite different characteristics: they are generally non-spatial, non-weighted, scale-free, and they exhibit small-world properties in addition to their massive size. In particular, the existence of hubs, those vertices with a large number of connections, explodes the search space, making the shortest path computation surprisingly challenging. In this paper, we introduce a set of novel techniques centered around hubs, collectively referred to as the Hub-Accelerator framework, to compute the k-degree shortest path (finding the shortest path between two vertices if their distance is within k). These techniques enable us to significantly reduce the search space by either greatly limiting the expansion scope of hubs (using the novel distance- preserving Hub-Network concept) or completely pruning away the hubs in the online search (using the Hub2-Labeling approach). The Hub-Accelerator approaches are more than two orders of magnitude faster than BFS and the state-of-the-art approximate shortest path method Sketch for the shortest path computation. The Hub- Network approach does not introduce additional index cost with light pre-computation cost; the index size and index construction cost of Hub2-Labeling are also moderate and better than or comparable to the approximation indexing Sketch method.
Social and Information Networks,Databases,Physics and Society
What problem does this paper attempt to address?
This paper attempts to solve the problem of calculating the shortest path in large - scale social networks. Specifically, social networks have characteristics different from traditional road networks, such as non - spatiality, unweightedness, scale - freedom and small - world characteristics. In particular, there are a large number of connected nodes (called "hubs"), and these characteristics make it extremely difficult to find the shortest path in social networks. The paper proposes a new method named the Hub - Accelerator framework, aiming to accelerate the calculation of the shortest path through techniques around hub nodes while significantly reducing the search space. This method can be more than two orders of magnitude faster than the existing BFS (Breadth - First Search) and other state - of - the - art approximate shortest - path methods. The core problem of the paper can be summarized as follows: how to efficiently calculate the k - degree shortest path in large - scale social networks (that is, when the distance between two nodes is no more than k, find the shortest path between them). This problem is particularly important in the applications of social networks, such as showing your connection link with a certain user in LinkedIn or showing how you are connected with other users in Facebook. These problems usually require quickly calculating the shortest path when the distance between users is less than a certain threshold (such as 6 steps).