K-SVM: an Effective SVM Algorithm Based on K-means Clustering
Yukai Yao,Yang Liu,Yongqing Yu,Hong Xu,Weiming Lv,Zhao Li,Xiaoyun Chen
DOI: https://doi.org/10.4304/jcp.8.10.2632-2639
2013-01-01
Journal of Computers
Abstract:Support Vector Machine (SVM) is one of the most popular and effective data mining algorithms which can be used to resolve classification or regression problems, and has attracted much attention these years. SVM could find the optimal separating hyperplane between classes, which afford outstanding generalization ability with it. Usually all the labeled records are used as training set. However, the optimal separating hyperplane only depends on a few crucial samples (Support Vectors, SVs), we needn't train SVM model on the whole training set. In this paper a novel SVM model based on K-means clustering is presented, in which only a small subset of the original training set is selected to constitute the final training set, and the SVM classifier is built through training on these selected samples. This greatly decrease the scale of the training set, and effectively saves the training and predicting cost of SVM, meanwhile guarantees its generalization performance.