Abstract:<p class="a-plus-plus">SVM with an RBF kernel is usually one of the best classification algorithms for most data sets, but it is important to tune the two hyperparameters <em class="a-plus-plus">C</em> and <span class="a-plus-plus inline-equation id-i-eq1"><span class="a-plus-plus equation-source format-t-e-x"><span class="mjpage"><svg xmlns:xlink="http://www.w3.org/1999/xlink" width="1.262ex" height="2.176ex" style="vertical-align: -0.838ex;" viewBox="0 -576.1 543.5 936.9" role="img" focusable="false" xmlns="http://www.w3.org/2000/svg"><g stroke="currentColor" fill="currentColor" stroke-width="0" transform="matrix(1 0 0 -1 0 0)"> <use xlink:href="#MJMATHI-3B3" x="0" y="0"></use></g></svg></span></span></span> to the data itself. In general, the selection of the hyperparameters is a non-convex optimization problem and thus many algorithms have been proposed to solve it, among them: grid search, random search, Bayesian optimization, simulated annealing, particle swarm optimization, Nelder Mead, and others. There have also been proposals to decouple the selection of <span class="a-plus-plus inline-equation id-i-eq2"><span class="a-plus-plus equation-source format-t-e-x"><span class="mjpage"><svg xmlns:xlink="http://www.w3.org/1999/xlink" width="1.262ex" height="2.176ex" style="vertical-align: -0.838ex;" viewBox="0 -576.1 543.5 936.9" role="img" focusable="false" xmlns="http://www.w3.org/2000/svg"><g stroke="currentColor" fill="currentColor" stroke-width="0" transform="matrix(1 0 0 -1 0 0)"> <use xlink:href="#MJMATHI-3B3" x="0" y="0"></use></g></svg></span></span></span> and <em class="a-plus-plus">C</em>. We empirically compare 18 of these proposed search algorithms (with different parameterizations for a total of 47 combinations) on 115 real-life binary data sets. We find (among other things) that trees of Parzen estimators and particle swarm optimization select better hyperparameters with only a slight increase in computation time with respect to a grid search with the same number of evaluations. We also find that spending too much computational effort searching the hyperparameters will not likely result in better performance for future data and that there are no significant differences among the different procedures to select the best set of hyperparameters when more than one is found by the search algorithms.</p><svg xmlns="http://www.w3.org/2000/svg" style="display: none;"><defs id="MathJax_SVG_glyphs"><path stroke-width="1" id="MJMATHI-3B3" d="M31 249Q11 249 11 258Q11 275 26 304T66 365T129 418T206 441Q233 441 239 440Q287 429 318 386T371 255Q385 195 385 170Q385 166 386 166L398 193Q418 244 443 300T486 391T508 430Q510 431 524 431H537Q543 425 543 422Q543 418 522 378T463 251T391 71Q385 55 378 6T357 -100Q341 -165 330 -190T303 -216Q286 -216 286 -188Q286 -138 340 32L346 51L347 69Q348 79 348 100Q348 257 291 317Q251 355 196 355Q148 355 108 329T51 260Q49 251 47 251Q45 249 31 249Z"></path></defs></svg>

Effects of Random Sampling on SVM Hyper-parameter Tuning

Effectiveness of Random Search in SVM hyper-parameter tuning

How to tune the RBF SVM hyperparameters? An empirical evaluation of 18 search algorithms

A Novel Orthogonal Direction Mesh Adaptive Direct Search Approach for SVM Hyperparameter Tuning

A meta-learning recommender system for hyperparameter tuning: Predicting when tuning improves SVM classifiers

Using sequential statistical tests for efficient hyperparameter tuning

A Comparative Study of Hyperparameter Tuning Methods

Random sampling-based automatic parameter tuning for nonlinear programming solvers

Better trees: an empirical study on hyperparameter tuning of classification decision tree induction algorithms

A New Method for SVM Hyper-parameters Optimization

On the Sampling Strategy for Evaluation of Spectral-Spatial Methods in Hyperspectral Image Classification

Analyzing Search Techniques for Autotuning Image-based GPU Kernels: The Impact of Sample Sizes

Hyperparameter Tuning Algorithm Comparison with Machine Learning Algorithms

Empirical comparison of cross-validation and internal metrics for tuning SVM hyperparameters

A comparison of hyperparameter tuning procedures for clinical prediction models: A simulation study

Discrete Simulation Optimization for Tuning Machine Learning Method Hyperparameters

Search Algorithms for Automated Hyper-Parameter Tuning

Agent-based Collaborative Random Search for Hyper-parameter Tuning and Global Function Optimization

Intelligent sampling for surrogate modeling, hyperparameter optimization, and data analysis

Hyperparameter Search for Machine Learning Algorithms for Optimizing the Computational Complexity