Safe semi-supervised learning: a brief introduction

Yu-Feng Li,De-Ming Liang
DOI: https://doi.org/10.1007/s11704-019-8452-2
IF: 2.6688
2019-06-18
Frontiers of Computer Science
Abstract:Semi-supervised learning constructs the predictive model by learning from a few labeled training examples and a large pool of unlabeled ones. It has a wide range of application scenarios and has attracted much attention in the past decades. However, it is noteworthy that although the learning performance is expected to be improved by exploiting unlabeled data, some empirical studies show that there are situations where the use of unlabeled data may degenerate the performance. Thus, it is advisable to be able to exploit unlabeled data safely. This article reviews some research progress of safe semi-supervised learning, focusing on three types of safeness issue: data quality, where the training data is risky or of low-quality; model uncertainty, where the learning algorithm fails to handle the uncertainty during training; measure diversity, where the safe performance could be adapted to diverse measures.
computer science, information systems, theory & methods, software engineering
What problem does this paper attempt to address?