Enhanced Affinity Propagation Clustering on Heterogeneous Information Network

Debinal Rajan,Shouyong Jiang,Dewei Yi,Wei Pang,George M. Coghill
DOI: https://doi.org/10.1007/978-3-031-55568-8_28
2024-01-01
Abstract:The real world data sets with multi-typed objects and multityped relations can be structured as heterogeneous information networks (HIN). Clustering is one of the most significant process in HIN since it provides useful insights of hidden patterns of objects and their complex relation structure. However, grouping multi-relational target objects without losing their rich semantics and unknown number of clusters is a challenging task. Hence, we use the meta-path concepts to compute the similarity matrix between each pair of objects by exploring the different relations to preserve their semantics. Subsequently, we employ the Affinity Propagation (AP) clustering approach that can automatically generate clusters and corresponding exemplars (cluster center) for each object based on the similarity matrix. The basic motivation of using AP algorithm is its effectiveness, scalability and the speed on detecting community/clustering of networked data and yet it has not been applied in HIN. However, the performance of AP algorithm depends on two parameters: i) preference p and ii) damping factor. which causes the algorithm to be non-converged and produce unsatisfactory clustering results. Although some existing methods have been developed to handle this issue, it still faces two challenges: i) slow convergence ii) high computation for finding optimal clustering. In this paper, we presented an enhanced AP (EAP) clustering approach to overcome this issue by updating their parameter values based on different strategies, to improve the AP performance on an HIN data set. The experimental results show that the proposed method can accelerate the algorithm's convergence to evaluate optimal clustering compared to the other methods.
What problem does this paper attempt to address?