Web-scale classification with naive bayes.

Congle Zhang,Gui-Rong Xue,Yong Yu,Hongyuan Zha
DOI: https://doi.org/10.1145/1526709.1526867
2009-01-01
Abstract:Traditional Naive Bayes Classifier performs miserably on web-scale taxonomies. In this paper, we investigate the reasons behind such bad performance. We discover that the low performance are not completely caused by the intrinsic limitations of Naive Bayes, but mainly comes from two largely ignored problems: contradiction pair problem and discriminative evidence cancelation problem. We propose modifications that can alleviate the two problems while preserving the advantages of Naive Bayes. The experimental results show our modified Naive Bayes can significantly improve the performance on real web-scale taxonomies.
What problem does this paper attempt to address?