A Review of Semi Supervised Learning Theories and Recent Advances

Enmei Tu,Jie Yang
DOI: https://doi.org/10.48550/arXiv.1905.11590
2019-05-28
Abstract:Semi-supervised learning, which has emerged from the beginning of this century, is a new type of learning method between traditional supervised learning and unsupervised learning. The main idea of semi-supervised learning is to introduce unlabeled samples into the model training process to avoid performance (or model) degeneration due to insufficiency of labeled samples. Semi-supervised learning has been applied successfully in many fields. This paper reviews the development process and main theories of semi-supervised learning, as well as its recent advances and importance in solving real-world problems demonstrated by typical application examples.
Machine Learning
What problem does this paper attempt to address?
The core problem that this paper attempts to solve is how to avoid the performance degradation of traditional supervised learning when the number of labeled samples is very small by introducing unlabeled samples in model training. Specifically, semi - supervised learning aims to use a large amount of easily - obtained unlabeled data to enhance the generalization ability and learning efficiency of the model, especially in cases where the cost of obtaining labeled samples is high or it is difficult to obtain a large number, such as in fields like medical diagnosis, destructive experiments (car - crash experiments, rocket launches, etc.). By reviewing the development history, main theories and the latest progress of semi - supervised learning, and combining with the analysis of application examples, the paper shows the important role and potential of semi - supervised learning in solving practical problems.