Abstract:Structured Support Vector Machines (structured SVMs) are a fundamental machine learning algorithm, and have solid theoretical foundation and high effectiveness in applications such as natural language parsing and computer vision. However, training structured SVMs is very time-consuming, due to the large number of constraints and inferior convergence rates, especially for large training data sets. The high cost of training structured SVMs has hindered its adoption to new applications. In this article, we aim to improve the efficiency of structured SVMs by proposing a parallel and distributed solution (namely FastSSVM) for training structured SVMs building on top of MPI and OpenMP. FastSSVM exploits a series of optimizations (e.g., optimizations on data storage and synchronization) to efficiently use the resources of the nodes in a cluster and the cores of the nodes. Moreover, FastSSVM tackles the large constraint set problem by batch processing and addresses the slow convergence challenge by adapting stop conditions based on the improvement of each iteration. We theoretically prove that our solution is guaranteed to converge to a global optimum. A comprehensive experimental study shows that FastSSVM can achieve at least four times speedup over the existing solutions, and in some cases can achieve two to three orders of magnitude speedup.

A Parallel SVM Training Algorithm on Large-Scale Classification Problems

A Parallel and Scalable Digital Architecture for Training Support Vector Machines

An Improved Cascade SVM Training Algorithm with Crossed Feedbacks

An Improved Parallel SVM Algorithm on Distributed System

RESEARCH ON CASCADE-GROUPING PARALLEL SVM ALGORITHM BASED ON MAPREDUCE

An On-Line Learning Approach with Support Vector Dormain Classifier

SVM Algorithms for Large Scale Classification Problems Based on Data Partition and Ensemble Learning

Research on Parallel SVM Algorithm Based on Cascade SVM

Large-scale Linear Nonparallel SVMs

Research of Parallel SVM Algorithm Based on CUDA

Large-scale parallel SVM implementation

Vote Parallel SVM: an Extension of Parallel Support Vector Machine.

Parallel and Distributed Structured SVM Training

Fast Parallel SVM using Data Augmentation

An Improvement SVM Learning Algorithm with Parallel Processing

Parallelizing Support Vector Machines on Distributed Computers

On Parallel Learning Based on Support Vector Machines

An intelligent system for accelerating parallel SVM classification problems on large datasets using GPU

Parallel network traffic classification method based on SVM

A distributed sequential solver for large-scale svms

A Hierarchical and Parallel Support Vector Machines Algorithm for Reducing the Training Time