Abstract:The cluster assumption, which assumes that "similar instances should share the same label," is a basic assumption in semi-supervised classification learning, and has been found very useful in many successful semi-supervised classification methods. It is rarely noticed that when the cluster assumption is adopted, there is an implicit assumption that every instance should have a crisp class label assignment. In real applications, however, there are cases where it is difficult to tell that an instance definitely belongs to one class and does not belong to other neighboring classes. In such cases, it is more adequate to assume that "similar instances should share similar label memberships" rather than sharing a crisp label assignment. Here "label memberships" can be represented as a vector, where each element corresponds to a class, and the value at the element expresses the likelihood of the concerned instance belonging to the class. By adopting this modified cluster assumption, in this paper we propose a new semi-supervised classification method, that is, semi-supervised classification based on class membership (SSCCM). Specifically, we try to solve the decision function and adequate label memberships for instances simultaneously, and constrain that an instance and its "local weighted mean" (LWM) share the same label membership vector, where the LWM is a robust image of the instance, constructed by calculating the weighted mean of its neighboring instances. We formulate the problem in a unified objective function for the labeled, unlabeled data and their LWMs based on the square loss function, and take an alternating iterative strategy to solve it, in which each step generates a closed-form solution, and the convergence is guaranteed. The solution will provide both the decision function and the label membership function for classification, their classification results can verify each other, and the reliability of semi-supervised classification learning might be enhanced by checking the consistency between those two predictions. Experiments show that SSCCM obtains encouraging results compared to state-of-the-art semi-supervised classification methods.

Local Homogeneous Consistent Safe Semi-Supervised Clustering.

Locating High-Density Clusters with Noisy Queries.

Semi-supervised clustering based on spectral clustering

New semi-supervised classification method based on modified cluster assumption.

K-GBS3FCM -- KNN Graph-Based Safe Semi-Supervised Fuzzy C-Means

Graph-based Semi-supervised Local Clustering with Few Labeled Nodes

Semi-supervised Clustering Using Incomplete Prior Knowledge

A semi-supervised clustering approach using labeled data

Semi-supervised Clustering Guided by Pairwise Constraints and Local Density Structures

Using Cluster Information to Improve Label Propagation

A Semi-Supervised Color Image Segmentation Method

A Group-Based Distance Learning Method for Semisupervised Fuzzy Clustering

Stratification-based Semi-supervised Clustering Algorithm for Arbitrary Shaped Datasets

Semi-Supervised Clustering for Financial Risk Analysis

Towards Making Unlabeled Data Never Hurt

An Efficient Semi-Supervised Clustering Algorithm with Sequential Constraints

Safe semi-supervised learning: a brief introduction

Open-Domain Semi-Supervised Learning via Glocal Cluster Structure Exploitation

An efficient semi-supervised balanced cut with hard pairwise constraints and partial labels

Strongly Local P-Norm-cut Algorithms for Semi-Supervised Learning and Local Graph Clustering

Semisupervised Classification with Cluster Regularization