Abstract:Support Vector Machine (SVM) regression is an important technique in data mining. The SVM training is expensive and its cost is dominated by: (i) the kernel value computation, and (ii) a search operation which finds extreme training data points for adjusting the regression function in every training iteration. Existing training algorithms for SVM regression are not scalable to large datasets because: (i) each training iteration repeatedly performs expensive kernel value computations, which is inefficient and requires holding the whole training dataset in memory; (ii) the search operation used in each training iteration considers the whole search space which is very expensive. In this article, we significantly improve the scalability and efficiency of SVM regression by exploiting the high performance of Graphics Processing Units (GPUs) and solid state drives (SSDs). Our key ideas are as follows. (i) To reduce the cost of repeated kernel value computations and avoid holding the whole training dataset in the GPU memory, we precompute all the kernel values and store them in the CPU memory extended by the SSD; together with an efficient strategy to read the precomputed kernel values, reusing precomputed kernel values with an efficient retrieval is much faster than computing them on-the-fly. This also alleviates the restriction that the training dataset has to fit into the GPU memory, and hence makes our algorithm scalable to large datasets, especially for large datasets with very high dimensionality. (ii) To enhance the performance of the frequently used search operation, we design an algorithm that minimizes the search space and the number of accesses to the GPU global memory; this optimized search algorithm also avoids branch divergence (one of the causes for poor performance) among GPU threads to achieve high utilization of the GPU resources. Our proposed techniques together form a scalable solution to the SVM regression which we call SIGMA. Our extensive experimental results show that SIGMA is highly efficient and can handle very large datasets which the state-of-the-art GPU-based algorithm cannot handle. On the datasets of size that the state-of-the-art GPU-based algorithm can handle, SIGMA consistently outperforms the state-of-the-art GPU-based algorithm by an order of magnitude and achieves up to 86 times speedup.

GPU Acceleration of Interior Point Methods in Large Scale SVM Training

Scalable and Fast SVM Regression Using Modern Hardware.

MASCOT: Fast and Highly Scalable SVM Cross-Validation Using GPUs and SSDs

Accelerating Support Vector Machine Learning With Gpu-Based Mapreduce

Speeding up LP-SVM Using CUDA Platform

Scaling Support Vector Machines on Modern HPC Platforms

Recipe for Fast Large-scale SVM Training: Polishing, Parallelism, and more RAM!

Parallel and Distributed Structured SVM Training

A Parallel and Scalable Digital Architecture for Training Support Vector Machines

MIC-SVM: A Highly Efficient Support Vector Machine For Modern HPC Architectures

Parallelizing Support Vector Machines on Distributed Computers

A GPU-RSVM based intrusion detection classifier

CuMF_SGD: Fast and Scalable Matrix Factorization.

Fast Training Support Vector Machines Using Parallel Sequential Minimal Optimization

Support Vector Machine Implementation on MPI-CUDA and Tensorflow Framework

Improving Dense Linear Equation Solver on Hybrid CPU-GPU System.

A Scalable Hybrid Algorithm for Solving Partial Differential Equations on a Cluster of CPU/GPU

MIC-SVM: Designing a Highly Efficient Support Vector Machine for Advanced Modern Multi-core and Many-Core Architectures

Fast Parallel SVM using Data Augmentation

A Load-Balancing Divide-and-Conquer SVM Solver.

Towards a Heterogeneous Architecture Solver for the Incompressible Navier–Stokes Equations