Binarization approaches to email categorization

Yunqing Xia,Kam-Fai Wong
DOI: https://doi.org/10.1007/11940098_50
2006-01-01
Abstract:Email categorization becomes very popular today in personal information management. However, most n-way classification methods suffer from feature unevenness problem, namely, features learned from training samples distribute unevenly in various folders. We argue that the binarization approaches can handle this problem effectively. In this paper, three binarization techniques are implemented, i.e. one-against-rest, one-against-one and some-against-rest, using two assembling techniques, i.e. round robin and elimination. Experiments on email categorization prove that significant improvement has been achieved in these binarization approaches over an n-way baseline classifier.
What problem does this paper attempt to address?