A Parallelized Semi-Supervised Na(i)ve Bayes Classifier

JIANG Kai,GAO Yang
DOI: https://doi.org/10.3778/j.issn.1673-9418.2012.10.006
2012-01-01
Abstract:Nowadays TBs or even PBs data burst out every day,but there are so few labeled instances for training.For these two problems,this paper combines a semi-supervised Nave Bayes algorithm and the Map-Reduce programming model,and proposes a new algorithm called parallelized semi-supervised Nave Bayes(PSNB) algorithm.Experimental results show that the proposed algorithm can tackle with massive data efficiently,and use the unlabeled instances to improve the performance of the classifier.
What problem does this paper attempt to address?