MOSS-5: A Fast Method of Approximating Counts of 5-Node Graphlets in Large Graphs

Pinghui Wang,Junzhou Zhao,Xiangliang Zhang,Zhenguo Li,Jiefeng Cheng,John C. S. Lui,Don Towsley,Jing Tao,Xiaohong Guan
DOI: https://doi.org/10.1109/tkde.2017.2756836
IF: 9.235
2018-01-01
IEEE Transactions on Knowledge and Data Engineering
Abstract:Despite recent efforts in counting 3-node and 4-node graphlets, little attention has been paid to characterizing 5-node graphlets. In this paper, we develop a computationally efficient sampling method to estimate 5-node graphlet counts. We not only provide a fast sampling method and unbiased estimators of graphlet counts, but also derive simple yet exact formulas for the variances of the estimators which are of great value in practice-the variances can be used to bound the estimates' errors and determine the smallest necessary sampling budget for a desired accuracy. We conduct experiments on a variety of real-world datasets, and the results show that our method is several orders of magnitude faster than the state-of-the-art methods with the same accuracy.
What problem does this paper attempt to address?