Semi-supervised Classification Based on Clustering Adjusted Similarity

Xia Chen,Chang Lu,Qiaoyu Tan,Guoxian Yu
DOI: https://doi.org/10.1080/1206212x.2017.1329262
2017-01-01
International Journal of Computers and Applications
Abstract:Graph plays crucially important roles in graph-based semi-supervised learning (SSL). Most SSL methods construct a single graph over all instances to explore the manifold structure of instances, and then enforce the smoothness constraint over such graph. However, instances in the real world are not always evenly distributed. Some instances from different classes but close to decision boundary may be close to each other, and thus they are easy to be misclassified. To mitigate this issue, we propose an approach called semi-supervised classification based on clustering adjusted similarity (SSC-CAS). SSC-CAS firstly takes advantage of clustering on both labeled and unlabeled instances to explore the global structure and discrimination of instances, and then quantifies the similarity between pairwise cluster centers. Second, it adjusts the similarity between pairwise instances by multiplying the similarity between centers of clusters they belong to. In this way, if two instances are from different clusters, the similarity between them is reduced; otherwise, unchanged. After that, SSC-CAS performs graph-based semi-supervised classification on the graph constructed by the adjusted similarity. Empirical study on both synthetic and UCI data-sets demonstrates that SSC-CAS not only has better performance than other related comparing methods, but also is robust to the input parameters.
What problem does this paper attempt to address?