Efficient Hyperparameter Optimization with Probability-based Resource Allocating on Deep Neural Networks

Wenguo Li,Xudong Yin,Mudan Ye,Pengxu Zhu,Jinghua Li,Yao Yang
DOI: https://doi.org/10.1016/j.neucom.2024.127907
IF: 6
2024-01-01
Neurocomputing
Abstract:Hyperparameter tuning is an essential step to obtain high model performance on the validation set before applying machine learning models. Hand tuning always requires massive labor and professional knowledge of specific machine learning fields. Various automatic hyperparameter optimization algorithms have been developed to address this problem. A well-known algorithm, Population Based Training (PBT), bridges parallel search and sequential optimization by introducing online adaptation of both hyperparameters and network weights, significantly enhancing optimization efficiency despite its static adaptation scheme. Inspired by this framework, we further propose a new efficient heuristic optimization algorithm called Probability-based Resource Allocating (PRA). PRA first applies a novel resource allocation scheme, concentrating resources efficiently and dynamically on well performing hyperparameter configurations and allowing the possibility of the escape from local optima. Furthermore, an innovative exploration strategy inspired by gradient descent methods is proposed to guide the search directions for new hyperparameters. Besides, PRA employs a hybrid approach of learning and evolving network weights. All of these contribute to accelerating the search for optimal hyperparameter configurations. In a series of experiments on deep neural networks, PRA achieved better performance than PBT-series algorithms and other typical Bayesian Optimization methods, demonstrating fast convergence and a low time budget.
What problem does this paper attempt to address?