Combination Of Competitive Em Algorithm And Support Vector Machine For Active Learning

X Yi,CS Zhang,BB Zhang
2003-01-01
Abstract:In this paper, we combine Competitive EM (CEM) and SVM for pool-based active learning. It consists of two stages: applying CEM to discover the probabilistic distribution's structure of unlabeled data in the pool in the Discovering Confident Regions stage, applying SVM Active Learning algorithm to find informative data points near decision boundaries and utilize them to adjust the decision hyperplane's position in the Querying Informative Data stage. Experiments on two data sets (the USPS data set and the breast cancer data set in UCI repository) show that our algorithm efficiently queries informative points near the decision boundaries into a secondary pool and prevents learner's separating hyperplane's position from changing suddenly during the learning procedure. With the help of unsupervised data' information discovered in the first stage, it performs very stably in the second stage.
What problem does this paper attempt to address?