Semisupervised Prior Free Rare Category Detection with Mixed Criteria

Ding Tu,Ling Chen,Xiaokang Yu,Gencai Chen
DOI: https://doi.org/10.1109/tcyb.2016.2626295
IF: 11.8
2018-01-01
IEEE Transactions on Cybernetics
Abstract:Rare category detection aims to find interesting and statistically significant anomalies and incorporates ideas from active learning and semisupervised learning. The challenge of rare category detection is to find the rare classes of the anomalies in a data set where the data distribution is skewed. Most existing rare category detection methods suppose that the user knows the specific number of all classes in advance, which cannot be satisfied in most real scenarios. In this paper, we propose a new rare category detection framework composed of active learning and semisupervised hierarchical density-based clustering. The advantage of our method is that it is prior free and can benefit the rare category detecting process with the labeled data. In addition, the proposed framework can handle tasks with non-linear mappings, which increases the ability to find rare classes when the class boundary is sophisticated. Compared to existing methods, better results are achieved by our method on both real and synthetic data sets in the experiment.
What problem does this paper attempt to address?