Tuning the hyper-parameters of CMA-ES with tree-structured Parzen estimators

Meng Zhao,Jinlong Li
DOI: https://doi.org/10.1109/ICACI.2018.8377530
2018-01-01
Abstract:CMA-ES is widely used for non-linear and non-convex function optimization, but tuning the hyper-parameters of CMA-ES is a practical challenge. There are three hyper-parameters c <sub xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">c</sub> , c <sub xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">1</sub> and c <sub xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">μ</sub> of CMA-ES, and it is important for the covariance matrix updates to configure their values. Based on the constraints among c <sub xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">c</sub> , ci and c <sub xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">μ</sub> , we design a tree-structured graph to describe their relationships. We maximize Expected Improvement (EI) to search the configuration space of c <sub xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">c</sub> , c <sub xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">1</sub> and c <sub xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">μ</sub> , which is based on the distribution of solution quality and the conditional distribution of configuration given solution quality. The two distributions are modeled by the Tree-structured Parzen Estimators (TPE). We evaluate our approach on the BBOB noiseless problems. The experimental results show that our approach mostly gets a faster convergence towards the optimal solutions when compared with the default CMA-ES and the state-of-the-art algorithm self-CMA-ES.
What problem does this paper attempt to address?