Surrogate Model Based Hyperparameter Tuning for Deep Learning with SPOT

Thomas Bartz-Beielstein,Frederik Rehbach,Amrita Sen,Martin Zaefferer
DOI: https://doi.org/10.48550/arXiv.2105.14625
IF: 5.414
2021-05-30
Machine Learning
Abstract:A surrogate model based hyperparameter tuning approach for deep learning is presented. This article demonstrates how the architecture-level parameters (hyperparameters) of deep learning models that were implemented in Keras/tensorflow can be optimized. The implementation of the tuning procedure is 100% accessible from R, the software environment for statistical computing. With a few lines of code, existing R packages (tfruns and SPOT) can be combined to perform hyperparameter tuning. An elementary hyperparameter tuning task (neural network and the MNIST data) is used to exemplify this approach
What problem does this paper attempt to address?