An evolutionary clustering algorithm of the heterogeneous information network based on embedding technology

Limin CHEN,Jing YANG,Jianpei ZHANG
DOI: https://doi.org/10.3969/j.issn.2014-0026.201410026
2015-01-01
Abstract:In order to cluster dynamic heterogeneous information networks, a fast evolutionary clustering algorithm for dynamic heterogeneous information networks with star schema is proposed in this paper by using the sparsity of heterogeneous information networks. First, the heterogeneous information network is transformed into multiple com?patible bipartite graphs from the point of view of compatibility and a temporal smoothing bipartite graph is construc?ted so that it can represent the relation between the nodes at a time and the time before it. Next, the approximate commute time embedding for each temporal smoothing bipartite graph is computed via random mapping and a linear time solver, thereby the multiple embedding subsets for target dataset are obtained. Finally, the sum of the weighted distances is computed by using all the indicators in embedding subsets to indicate the identical object and all the centers of the clusters with identical label. The clusters of the heterogeneous information network can be acquired by k-means. This proposed algorithm is validated with higher accuracy rate and faster computation speed in dividing dynamic heterogeneous information networks.
What problem does this paper attempt to address?