Randomized Spectral Co-Clustering for Large-Scale Directed Networks

Xiao Guo,Yixuan Qiu,Hai Zhang,Xiangyu Chang
DOI: https://doi.org/10.48550/arxiv.2004.12164
2023-01-01
Abstract:Directed networks are broadly used to represent asymmetric relationshipsamong units. Co-clustering aims to cluster the senders and receivers ofdirected networks simultaneously. In particular, the well-known spectralclustering algorithm could be modified as the spectral co-clustering toco-cluster directed networks. However, large-scale networks pose greatcomputational challenges to it. In this paper, we leverage sketching techniquesand derive two randomized spectral co-clustering algorithms, onerandom-projection-based and the other random-sampling-based, toaccelerate the co-clustering of large-scale directed networks. We theoreticallyanalyze the resulting algorithms under two generative models – the stochasticco-block model and the degree-corrected stochastic co-block model, andestablish their approximation error rates and misclustering error rates,indicating better bounds than the state-of-the-art results of co-clusteringliterature. Numerically, we design and conduct simulations to support ourtheoretical results and test the efficiency of the algorithms on real networkswith up to millions of nodes. A publicly available R package is developed for better usability and reproducibility of the proposed methods.
What problem does this paper attempt to address?