One-class support vector machines for large-scale data sets

Zhibo Xiao,Huangang Wang,Yingchao Xiao,Wenli Xu
DOI: https://doi.org/10.3969/j.issn.1001-0505.2013.S1.043
2013-01-01
Abstract:A method to train one-class support vector machine (OCSVM) on the large-scale data sets is proposed. The proposed method selects inner points representing the distribution characteristics of the original large-scale data sets based on the principle of k-nearest neighbor, and generates the edge points using the inner points selected. A new data set is formed by combining these two kinds of points to train OCSVM. The new data set not only reduces the volume of the original large-scale data set greatly, but also maintains the distribution characteristics of the original data set. Thus the problems faced by OCSVM on the large-scale data sets, such as long training time, complicated models and low predicting speed, are effectively solved. Finally, the experiment is conducted on typical data sets to illustrate the effectiveness of the method proposed.
What problem does this paper attempt to address?