Abstract:Support Vector Machine (SVM) regression is an important technique in data mining. The SVM training is expensive and its cost is dominated by: (i) the kernel value computation, and (ii) a search operation which finds extreme training data points for adjusting the regression function in every training iteration. Existing training algorithms for SVM regression are not scalable to large datasets because: (i) each training iteration repeatedly performs expensive kernel value computations, which is inefficient and requires holding the whole training dataset in memory; (ii) the search operation used in each training iteration considers the whole search space which is very expensive. In this article, we significantly improve the scalability and efficiency of SVM regression by exploiting the high performance of Graphics Processing Units (GPUs) and solid state drives (SSDs). Our key ideas are as follows. (i) To reduce the cost of repeated kernel value computations and avoid holding the whole training dataset in the GPU memory, we precompute all the kernel values and store them in the CPU memory extended by the SSD; together with an efficient strategy to read the precomputed kernel values, reusing precomputed kernel values with an efficient retrieval is much faster than computing them on-the-fly. This also alleviates the restriction that the training dataset has to fit into the GPU memory, and hence makes our algorithm scalable to large datasets, especially for large datasets with very high dimensionality. (ii) To enhance the performance of the frequently used search operation, we design an algorithm that minimizes the search space and the number of accesses to the GPU global memory; this optimized search algorithm also avoids branch divergence (one of the causes for poor performance) among GPU threads to achieve high utilization of the GPU resources. Our proposed techniques together form a scalable solution to the SVM regression which we call SIGMA. Our extensive experimental results show that SIGMA is highly efficient and can handle very large datasets which the state-of-the-art GPU-based algorithm cannot handle. On the datasets of size that the state-of-the-art GPU-based algorithm can handle, SIGMA consistently outperforms the state-of-the-art GPU-based algorithm by an order of magnitude and achieves up to 86 times speedup.

An Efficient Algorithm for a Class of Large-Scale Support Vector Machines Exploiting Hidden Sparsity.

A sparse semismooth Newton based augmented Lagrangian method for large-scale support vector machines

A Parallel and Scalable Digital Architecture for Training Support Vector Machines

Sparse Representation Based on Projection Method in Online Least Squares Support Vector Machines

Sparse Least Squares Support Vector Machine for Function Estimation

Recursive Training Algorithm for One-Class Support Vector Machine Based on Active Set Method

Mini-batch Quasi-Newton Optimization for Large Scale Linear Support Vector Regression

Approximate Approach to Train SVM on Very Large Data Sets

Linear Regression-Based Efficient SVM Learning for Large-Scale Classification.

Support matrix machine: exploring sample sparsity, low rank, and adaptive sieving in high-performance computing

Online Kernel Learning with a Near Optimal Sparsity Bound

Scaling Up Sparse Support Vector Machines by Simultaneous Feature and Sample Reduction

An efficient Hessian based algorithm for solving large-scale sparse group Lasso problems

A Dual Semismooth Newton Based Augmented Lagrangian Method for Large-Scale Linearly Constrained Sparse Group Square-Root Lasso Problems

A Semismooth-Newton's-Method-Based Linearization and Approximation Approach for Kernel Support Vector Machines

Research of Large-Scale Sparse Rsqp Algorithm Based on Line Search of Filter Method

Scalable and Fast SVM Regression Using Modern Hardware.

A Fast Iterative Single Data Approach to Training Unconstrained Least Squares Support Vector Machines

Faster Algorithms for Structured Linear and Kernel Support Vector Machines

Sparse Least Absolute Deviation Support Vector Machine

NESVM: A Fast Gradient Method for Support Vector Machines