Efficient querying k-trusses on temporal graphs
Yuting Tan,Chunhua Wang,Junfeng Zhou,Ming Du,Guohao Sun,Weiguo Zheng
DOI: https://doi.org/10.1016/j.ipm.2024.104014
IF: 7.466
2024-12-12
Information Processing & Management
Abstract:The real-world entities and their relationships can be modeled using temporal graphs, where each edge is associated with a timestamp denoting the time of interaction between two entities at that moment. k -truss is one of the typical dense subgraph models and has received considerable attention. Given a query time interval and k(k≥3) , querying k -truss on a temporal graph G aims to return the k -truss of the corresponding snapshot graph of the query time interval. Although k -truss computation has been studied extensively on simple graphs without timestamps, none of the existing works supports the efficient querying of k -truss on temporal graphs.To address this problem, we propose a time interval based index, PHT . PHT maintains each edge and its corresponding effective k -truss intervals, which can support efficient query performance. When answering a k -truss query based on PHT , we only need to verify the satisfiability of edges in the k -truss subgraph of all possible snapshot graphs, such that to avoid the expensive truss-decomposition operation. However, the number of qualified edges is usually much smaller than the total number of verified edges. To improve query efficiency, we further propose the SIHT index. Compared with PHT , the difference lies in two aspects. First, SIHT maintains the shortest intervals. Second, in SIHT , each shortest interval is associated with a set of edges. When answering a k -truss query based on SIHT , we only need to find the set of shortest intervals contained by the given interval, then we directly obtain the qualified edges in the result. In this way, we avoid processing useless edges. We conducted experiments on real-world datasets, the results indicate that PHT -based query algorithm outperforms the online algorithm significantly, and SIHT -based algorithm is about one to two orders of magnitude faster than PHT -based query algorithm when querying k -truss on temporal graphs.
computer science, information systems,information science & library science