Preference-Optimized Pareto Set Learning for Blackbox Optimization

Zhang Haishan,Diptesh Das,Koji Tsuda
2024-08-19
Abstract:Multi-Objective Optimization (MOO) is an important problem in real-world applications. However, for a non-trivial problem, no single solution exists that can optimize all the objectives simultaneously. In a typical MOO problem, the goal is to find a set of optimum solutions (Pareto set) that trades off the preferences among objectives. Scalarization in MOO is a well-established method for finding a finite set approximation of the whole Pareto set (PS). However, in real-world experimental design scenarios, it's beneficial to obtain the whole PS for flexible exploration of the design space. Recently Pareto set learning (PSL) has been introduced to approximate the whole PS. PSL involves creating a manifold representing the Pareto front of a multi-objective optimization problem. A naive approach includes finding discrete points on the Pareto front through randomly generated preference vectors and connecting them by regression. However, this approach is computationally expensive and leads to a poor PS approximation. We propose to optimize the preference points to be distributed evenly on the Pareto front. Our formulation leads to a bilevel optimization problem that can be solved by e.g. differentiable cross-entropy methods. We demonstrated the efficacy of our method for complex and difficult black-box MOO problems using both synthetic and real-world benchmark data.
Machine Learning,Optimization and Control
What problem does this paper attempt to address?
This paper attempts to address several key challenges in multi - objective optimization (MOO) problems, especially for black - box optimization (BO) scenarios. Specifically: 1. **Non - single - solution problem**: In practical multi - objective optimization problems, there is usually no single solution that can optimize all objectives simultaneously. Therefore, the research objective is to find a set of optimal solutions (Pareto set), which can make trade - offs between different objectives. 2. **Complete approximation of the Pareto set**: Most of the existing methods can only generate a finite set of Pareto solutions and cannot completely approximate the entire Pareto set. This is not very useful from the perspective of decision - makers because it may not contain solutions with the desired preferences. 3. **Computational efficiency and flexibility**: Traditional MOO methods based on scalarization (such as randomly generating preference vectors and connecting discrete points by regression) are computationally expensive and have poor performance. For complex and difficult black - box MOO problems, a more efficient and flexible method is required to explore the design space. To solve the above problems, the authors propose the **Preference - Optimized Pareto Set Learning (PO - PSL)** method. Its main contributions are as follows: - **Bi - level optimization problem**: The Pareto Set Learning (PSL) problem is formulated as a bi - level optimization problem. By optimizing the modeling, the learning process of the Pareto set model is accelerated, and it converges to the true Pareto front more quickly. - **End - to - end learning algorithm**: A preference - optimized PSL algorithm is proposed, which can learn the entire Pareto front and optimize the preference vector in an end - to - end manner. - **Better approximation ability**: Compared with existing methods, PO - PSL can better approximate the true Pareto front, especially when dealing with non - convex, irregular or degenerate Pareto fronts. ### Mathematical Formulas The mathematical formulas involved in this paper include, but are not limited to, the following: - **Definition of multi - objective optimization problem**: \[ x^*=\arg\min_{x\in X}f(x)=(f_1(x), f_2(x),\ldots, f_m(x)) \] where \(x^*\) is the solution in the decision space \(X\in\mathbb{R}^n\), and \(f:X\rightarrow\mathbb{R}^m\) is an \(m\)-dimensional vector - valued objective function. - **Pareto dominance relationship**: \[ x\prec x'\iff\forall i\in [m], f_i(x)\leq f_i(x')\text{ and }\exists j\in [m]: f_j(x)< f_j(x') \] - **Weighted - sum scalarization function**: \[ \Omega_{ws}(x, w)=\sum_{i = 1}^m w_i f_i(x) \] For a given preference vector \(w'\), the corresponding Pareto optimal solution can be found by solving the following optimization problem: \[ x^*(w')=\arg\min_{x\in X}\Omega_{ws}(x, w') \] - **Bi - level optimization problem**: \[ w^*\leftarrow\arg\min_w\Omega(x, w; \theta) \] \[ \theta^*\leftarrow\arg\min_\theta\Omega(x, w; \theta) \] Through these methods, PO - PSL aims to provide a more efficient and accurate Pareto set learning method, especially suitable for complex black - box multi - objective optimization problems.