Abstract:The cluster assumption, which assumes that "similar instances should share the same label," is a basic assumption in semi-supervised classification learning, and has been found very useful in many successful semi-supervised classification methods. It is rarely noticed that when the cluster assumption is adopted, there is an implicit assumption that every instance should have a crisp class label assignment. In real applications, however, there are cases where it is difficult to tell that an instance definitely belongs to one class and does not belong to other neighboring classes. In such cases, it is more adequate to assume that "similar instances should share similar label memberships" rather than sharing a crisp label assignment. Here "label memberships" can be represented as a vector, where each element corresponds to a class, and the value at the element expresses the likelihood of the concerned instance belonging to the class. By adopting this modified cluster assumption, in this paper we propose a new semi-supervised classification method, that is, semi-supervised classification based on class membership (SSCCM). Specifically, we try to solve the decision function and adequate label memberships for instances simultaneously, and constrain that an instance and its "local weighted mean" (LWM) share the same label membership vector, where the LWM is a robust image of the instance, constructed by calculating the weighted mean of its neighboring instances. We formulate the problem in a unified objective function for the labeled, unlabeled data and their LWMs based on the square loss function, and take an alternating iterative strategy to solve it, in which each step generates a closed-form solution, and the convergence is guaranteed. The solution will provide both the decision function and the label membership function for classification, their classification results can verify each other, and the reliability of semi-supervised classification learning might be enhanced by checking the consistency between those two predictions. Experiments show that SSCCM obtains encouraging results compared to state-of-the-art semi-supervised classification methods.

Semi-supervised Classification Forests

Conformalized Semi-supervised Random Forest for Classification and Abnormality Detection

A survey on semi-supervised learning

A semi-supervised approach to growing classification trees

Mixed-Integer Linear Optimization for Semi-Supervised Optimal Classification Trees

On Discriminative Semi-Supervised Classification.

Semi-supervised Learning for Biomedical Image Segmentation Via Forest Oriented Super Pixels(Voxels).

Mixed-Integer Linear Optimization for Cardinality-Constrained Random Forests

Muffled Semi-Supervised Learning

Semi-supervised Node Splitting for Random Forest Construction

SSXCS:Semi-supervised learning classifier system

Building semi-supervised decision trees with semi-cart algorithm

Semi-Unsupervised Learning: Clustering and Classifying using Ultra-Sparse Labels

A semi-supervised hierarchical classifier based on local information

Semi-supervised Classification for Natural Language Processing

New semi-supervised classification method based on modified cluster assumption.

Semi-Supervised Classification with Universum

Semi-Supervised Graph Classification: A Hierarchical Graph Perspective

Supervised Classification: Quite a Brief Overview

Semi-supervised Multi-label Learning by Solving a Sylvester Equation.

Fractionally-Supervised Classification