Reduction Of Training Datasets Via Fuzzy Entropy For Support Vector Machines

Zhongdong Wu,Jianping Yu,Weixin Xie,Xinbo Gao
DOI: https://doi.org/10.1109/ICSMC.2004.1400685
2004-01-01
Abstract:Support Vector Machines (SVM) are currently the state-of-the-art models for many classification problems but they suffer from the complexity of their training algorithm that is at least quadratic with respect to the number of examples. Hence, it is hard to try to solve real-life problems with more than a few hundreds of thousands examples by SVM. The present paper proposes a new heuristic method based on the fuzzy entropy. Under the circumstances that there are a little support vectors in original training set, this new method can effectively pre-select the boundary subset which contain overwhelming majority support vectors. By substituting the boundary subset for original training set, our method greatly reduces the training time, while the ability of support vector machine to classification is unaffected. Comparing to other analogous methods, the merit of our method is that there are no parameters for determining the border of subset. The Preliminary experimental results indicate that our approach is efficient and practical.
What problem does this paper attempt to address?