Abstract:Support Vector Machine (SVM) is a well-known classification technique which has achieved excellent performance in many nonlinear and high dimensional pattern recognition fields. However, due to the high time complexity of training SVM model, it’s difficult to implement it for large-scale data sets. One of the most promising solutions is to reduce the training data used for establishing the optimal classification hyperplane by means of selecting relevant support vectors which are the only factors affecting the classification rule. Thus, instance selection method is an efficient pre-processing technique to reduce the computational complexity and storage requirements of the learning process. In this manuscript, considering the geometry-distribution of data sets, we propose a Half Shell Extraction (HSE) algorithm which falls into the condensation category of instance selection methods. Moreover, fuzzy distance metric based on locality sensitive hash is employed to accelerate the instance selection process. Empirically, an experimental study involving various of data sets is carried out to compare the proposed algorithm with five competitive algorithms, and the results obtained show that the proposed algorithm consistently outperforms the other algorithms in terms of accuracy, reduction capability and runtime.

Reduction Of Training Datasets Via Fuzzy Entropy For Support Vector Machines

Fuzzy Time Series Analysis Method Based on Least Squares Support Vector Machines

A Novel Svm Modeling Approach For Highly Imbalanced And Overlapping Classification

Feature Selection and Parameter Optimization for Support Vector Machines: A New Approach Based on Genetic Algorithm with Feature Chromosomes.

A Type-2 Fuzzy Logic Ensemble SVM Classifier Based on Feature Weighting

Fuzzy Support Vector Machine For Regression Estimation

Fast instance selection method for SVM training based on fuzzy distance metric

Improved Kernel Method for Clustering Based on Fuzzy 1-SVM

M-estimator Based Support Vector Machine and Its Application

A Fast Training Method for OC-SVM Based on the Random Sampling Lemma

An Incremental Updating Method for Support Vector Machines

A New Method of Sample Reduction for Support Vector Classification

Fuzzy support vector clustering

A hybrid method for speeding SVM training

Fuzzy Chance Constrained Support Vector Machine

A Study of Modelling Non-Stationary Time Series Using Support Vector Machines with Fuzzy Segmentation Information

A Geometric Approach To Train Svm On Very Large Data Sets

Approximate Approach to Train SVM on Very Large Data Sets

An Efficient Instance Selection Algorithm to Reconstruct Training Set for Support Vector Machine

Hybrid SVM algorithm oriented to classifying imbalanced datasets

Fast extraction strategy of support vector machines