Abstract:Anomaly detection plays an important role in industry, especially in ensuring system safety and product quality. Due to the unavailability of anomalous data in many practical cases, anomaly detection is usually solved by one-class classification (OCC) methods using only normal data. As a classical OCC method, one-class support vector machine (OCSVM) is a popular discriminative approach for anomaly detection, which detects abnormal data points by establishing a decision boundary in the kernel space. However, the performance of OCSVM heavily relies on kernel parameters, whose selection is not trivial for anomaly detection problems. Moreover, for some uneven and complex data distributions, different data regions may have quite different densities and shapes, making it difficult for OCSVM to obtain good boundaries in all regions using a global kernel parameter. To address the above two issues, in this article, we propose a hybrid algorithm incorporating vector quantization and OCSVM (VQ-OCSVM). Specifically, vector quantization is used to extract distribution information of normal data, and the results are used to construct an explicit mapping function to map data into a high-dimensional feature space. Then, OCSVM is performed in the feature space to build a classifier. By introducing the explicit mapping into OCSVM, the proposed method can effectively bypass the kernel parameter selection problem of the classical OCSVM method. Furthermore, the constructed mapping carries the data distribution information, and the VQ-OCSVM model can be regarded as an integration of generative learning and discriminative learning. The complementary properties of these two paradigms make the proposed VQ-OCSVM algorithm have better generalization capacity for complex data distribution. Both qualitative and quantitative experimental results demonstrate the effectiveness and advantages of the proposed method.

One-class support vector machines for large-scale data sets

A Fast Training Method for OC-SVM Based on the Random Sampling Lemma

MMSVC: an Efficient Unsupervised Learning Approach for Large-Scale Datasets.

A Multi-Class Large Margin Classifier

Recursive Training Algorithm for One-Class Support Vector Machine Based on Active Set Method

An On-Line Learning Approach with Support Vector Dormain Classifier

Fast Incremental Learning Method for One-Class Support Vector Machine

Approximate Approach to Train SVM on Very Large Data Sets

Novel One-Vs-rest Classifier Based on SVM and Multi-Spheres

A Class-Incremental Learning Method Based on One Class Support Vector Machine

A Hybrid Algorithm Incorporating Vector Quantization and One-Class Support Vector Machine for Industrial Anomaly Detection

Large-scale support vector machine classification with redundant data reduction

A Weighted One-Class Support Vector Machine

Distributed Online One-Class Support Vector Machine for Anomaly Detection over Networks

An Online Incremental Learning Support Vector Machine for Large-Scale Data

A Geometric Approach To Train Svm On Very Large Data Sets

Robust One-Class SVM for Fault Detection

Multi-View Scaling Support Vector Machines for Classification and Feature Selection.

An Efficient Algorithm for a Class of Large-Scale Support Vector Machines Exploiting Hidden Sparsity.

Mini-batch Quasi-Newton Optimization for Large Scale Linear Support Vector Regression

A sparse semismooth Newton based augmented Lagrangian method for large-scale support vector machines