Abstract:Quality-Diversity (QD) algorithms have exhibited promising results across many domains and applications. However, uncertainty in fitness and behaviour estimations of solutions remains a major challenge when QD is used in complex real-world applications. While several approaches have been proposed to improve the performance in uncertain applications, many fail to address a key challenge: determining how to prioritise solutions that perform consistently under uncertainty, in other words, solutions that are reproducible. Most prior methods improve fitness and reproducibility jointly, ignoring the possibility that they could be contradictory objectives. For example, in robotics, solutions may reliably walk at 90% of the maximum velocity in uncertain environments, while solutions that walk faster are also more prone to falling over. As this is a trade-off, neither one of these two solutions is "better" than the other. Thus, algorithms cannot intrinsically select one solution over the other, but can only enforce given preferences over these two contradictory objectives. In this paper, we formalise this problem as the performance-reproducibility trade-off for uncertain QD. We propose four new a-priori QD algorithms that find optimal solutions for given preferences over the trade-offs. We also propose an a-posteriori QD algorithm for when these preferences cannot be defined in advance. Our results show that our approaches successfully find solutions that satisfy given preferences. Importantly, by simply accounting for this trade-off, our approaches perform better than existing uncertain QD methods. This suggests that considering the performance-reproducibility trade-off unlocks important stepping stones that are usually missed when only performance is optimised.

What problem does this paper attempt to address?

### What problem does this paper attempt to solve? This paper mainly explores the trade - off between performance and reproducibility in Quality - Diversity (QD) algorithms in uncertain environments. Specifically, the paper points out: 1. **Challenges of QD algorithms in complex real - world applications**: - In uncertain environments, QD algorithms have difficulty accurately estimating the quality and behavioral characteristics of solutions. - There may be a contradiction between the performance and reproducibility of solutions: for example, in robot control tasks, solutions that walk fast may be more likely to fail, while slower but more stable solutions are more reliable. 2. **Limitations of existing methods**: - Existing methods usually optimize performance and reproducibility simultaneously, ignoring that these two may be conflicting goals. - This method assumes that there are solutions with both high performance and high reproducibility, but in practice such solutions may not exist, so the trade - off between performance and reproducibility needs to be considered. 3. **New problems proposed**: - The paper proposes a new problem: how to handle the trade - off between performance and reproducibility (Performance - Reproducibility Trade - off) in uncertain environments. This problem is defined as the UQD - Problem 3. 4. **Solutions**: - The paper proposes four new a - priori QD algorithms and one a - posteriori QD algorithm to handle this trade - off problem. - These algorithms can find the optimal solution according to the user - specified preferences, or return a series of different solutions for the user to choose from. 5. **Experimental verification**: - The experimental results show that these new methods not only successfully solve the trade - off problem between performance and reproducibility, but also outperform the existing UQD methods in performance estimation and reproducibility maximization. ### Summary The purpose of this paper is to solve the trade - off problem between performance and reproducibility in QD algorithms in uncertain environments, and proposes a new algorithm framework to meet this challenge. By introducing the concept of performance - reproducibility trade - off, the paper provides new directions and tools for future QD research. ### Formula representation The formulas involved in the paper are represented in Markdown format as follows: 1. **UQD optimization objective**: \[ \max_{A} \left\{ \sum_{i \in A} P[f_i \sim D_f[f_i]] \right\} \] where \( P \) represents an estimator of performance (such as the mean or median), and \( cell_i \) is the feature interval of solution \( i \) in the QD archive \( A \). 2. **Optimization objective after introducing the reproducibility constraint**: \[ \max_{A} \left\{ \sum_{i \in A} P[f_i \sim D_f[f_i]] \right\} \] \[ \text{s.t.} \quad \forall i \in A, [P[d_i \sim D_d[d_i]]] \in cell_i \] \[ \text{s.t.} \quad \forall i \in A, \max [R[d_i \sim D_d[d_i]]] \] where \( R \) represents an estimator of reproducibility (such as the standard deviation). 3. **Optimization objective considering performance - reproducibility preferences**: \[ \max_{A} \left\{ \sum_{i \in A} P[f_i \sim D_f[f_i]] \right\} \] \[ \text{s.t.} \quad \forall i \in A, [P[d_i \sim D_d[d_i]]] \in cell_i \] \[ \text{s.t.} \quad \forall i \in A, \t

Exploring the Performance-Reproducibility Trade-off in Quality-Diversity

Don't Bet on Luck Alone: Enhancing Behavioral Reproducibility of Quality-Diversity Solutions in Uncertain Domains

Benchmark tasks for Quality-Diversity applied to Uncertain domains

Illuminating the Diversity-Fitness Trade-Off in Black-Box Optimization

Assessing Quality-Diversity Neuro-Evolution Algorithms Performance in Hard Exploration Problems

Dynamic Quality-Diversity Search

Quality-Diversity with Limited Resources

Quality Diversity through Human Feedback: Towards Open-Ended Diversity-Driven Optimization

Quality and Diversity Optimization: A Unifying Modular Framework

Quality-Diversity Algorithms Can Provably Be Helpful for Optimization

Multi-objective Optimization-based Selection for Quality-Diversity by Non-surrounded-dominated Sorting.

Differentiable Quality Diversity

Online Damage Recovery for Physical Robots with Hierarchical Quality-Diversity

Runtime Analysis of Quality Diversity Algorithms

QED: Using Quality-Environment-Diversity to Evolve Resilient Robot Swarms

BOP-Elites, a Bayesian Optimisation Approach to Quality Diversity Search with Black-Box descriptor functions

Quality-Diversity Optimisation on a Physical Robot Through Dynamics-Aware and Reset-Free Learning

Objectives Are All You Need: Solving Deceptive Problems Without Explicit Diversity Maintenance

Procedural Content Generation through Quality Diversity

Empirical analysis of PGA-MAP-Elites for Neuroevolution in Uncertain Domains

BOP-Elites, a Bayesian Optimisation algorithm for Quality-Diversity search