Efficient Text Categorization Using A Min-Max Modular Support Vector Machine

Feng-Yao Liu,Kai-An Wang,Bao-Liang Lu,Masao Utiyama,Hitoshi Isahara
DOI: https://doi.org/10.1007/1-4020-4043-1_2
2006-01-01
Abstract:The min-max modular support vector machine (M-3-SVM) has been proposed for solving large-scale and complex multiclass classification problems. In this paper, we apply the M-3-SVM to multilabel text categorization and introduce two task decomposition strategies into M-3-SVMs. A multilabel classification task can be split up into a set of two-class classification tasks. These two-class tasks are to discriminate class C from non-class C. If these two class tasks are still hard to be learned, we can further divide them into a set of two-class tasks as small as needed and fast training of SVMs on massive multilabel texts can be easily implemented in a massively parallel way. Furthermore, we proposed a new task decomposition strategy called hyperplane task decomposition to improve generalization performance. The experimental results indicate that the new method has better generalization performance than traditional SVMs and previous M-3-SVMs using random task decomposition, and is much faster than traditional SVMs.
What problem does this paper attempt to address?