Abstract:The cluster assumption, which assumes that "similar instances should share the same label," is a basic assumption in semi-supervised classification learning, and has been found very useful in many successful semi-supervised classification methods. It is rarely noticed that when the cluster assumption is adopted, there is an implicit assumption that every instance should have a crisp class label assignment. In real applications, however, there are cases where it is difficult to tell that an instance definitely belongs to one class and does not belong to other neighboring classes. In such cases, it is more adequate to assume that "similar instances should share similar label memberships" rather than sharing a crisp label assignment. Here "label memberships" can be represented as a vector, where each element corresponds to a class, and the value at the element expresses the likelihood of the concerned instance belonging to the class. By adopting this modified cluster assumption, in this paper we propose a new semi-supervised classification method, that is, semi-supervised classification based on class membership (SSCCM). Specifically, we try to solve the decision function and adequate label memberships for instances simultaneously, and constrain that an instance and its "local weighted mean" (LWM) share the same label membership vector, where the LWM is a robust image of the instance, constructed by calculating the weighted mean of its neighboring instances. We formulate the problem in a unified objective function for the labeled, unlabeled data and their LWMs based on the square loss function, and take an alternating iterative strategy to solve it, in which each step generates a closed-form solution, and the convergence is guaranteed. The solution will provide both the decision function and the label membership function for classification, their classification results can verify each other, and the reliability of semi-supervised classification learning might be enhanced by checking the consistency between those two predictions. Experiments show that SSCCM obtains encouraging results compared to state-of-the-art semi-supervised classification methods.

A Probabilistic Approach Towards an Unbiased Semi-Supervised Cluster Tree.

Semisupervised Prior Free Rare Category Detection with Mixed Criteria

Dual-Classifier Collaborative Method Based on Semi-Supervised Active Learning

Using Cluster Information to Improve Label Propagation

Semi-supervised clustering based on spectral clustering

Semi-supervised Hierarchical Clustering Analysis for High Dimensional Data

A semi-supervised approach to growing classification trees

Stratification-based Semi-supervised Clustering Algorithm for Arbitrary Shaped Datasets

Semi-Supervised Clustering for Financial Risk Analysis

New semi-supervised classification method based on modified cluster assumption.

Generating Unbiased Pseudo-labels via a Theoretically Guaranteed Chebyshev Constraint to Unify Semi-supervised Classification and Regression

Comparison of semi-supervised and supervised approaches for classification of e-nose datasets: Case studies of tomato juices

Mixed-Integer Linear Optimization for Semi-Supervised Optimal Classification Trees

An efficient semi-supervised balanced cut with hard pairwise constraints and partial labels

Statistical Inference for Cluster Trees

Two Novel Kernel-Based Semi-Supervised Clustering Methods By Seeding

A Cluster-Based Semisupervised Ensemble for Multiclass Classification.

Semi-supervised Hierarchical Optimization-Based Affinity Propagation Algorithm and Its Applications

A Semi-Supervised Graph Neural Network with Confidence Discrimination

Semi-supervised Clustering Guided by Pairwise Constraints and Local Density Structures

A new semi-supervised clustering algorithm for probability density functions and applications