Similarity-Based Classification in Partially Labeled Networks

Qian-Ming Zhang,Ming-Sheng Shang,Linyuan Lue
DOI: https://doi.org/10.1142/s012918311001549x
2010-01-01
International Journal of Modern Physics C
Abstract:Two main difficulties in the problem of classification in partially labeled networks are the sparsity of the known labeled nodes and inconsistency of label information. To address these two difficulties, we propose a similarity-based method, where the basic assumption is that two nodes are more likely to be categorized into the same class if they are more similar. In this paper, we introduce ten similarity indices defined based on the network structure. Empirical results on the co-purchase network of political books show that the similarity-based method can, to some extent, overcome these two difficulties and give higher accurate classification than the relational neighbors method, especially when the labeled nodes are sparse. Furthermore, we find that when the information of known labeled nodes is sufficient, the indices considering only local information can perform as good as those global indices while having much lower computational complexity.
What problem does this paper attempt to address?