Abstract:This study investigates the experimental design problem for identifying the arm with the highest expected outcome, referred to as best arm identification (BAI). In our experiments, the number of treatment-allocation rounds is fixed. During each round, a decision-maker allocates an arm and observes a corresponding outcome, which follows a Gaussian distribution with variances that can differ among the arms. At the end of the experiment, the decision-maker recommends one of the arms as an estimate of the best arm. To design an experiment, we first discuss lower bounds for the probability of misidentification. Our analysis highlights that the available information on the outcome distribution, such as means (expected outcomes), variances, and the choice of the best arm, significantly influences the lower bounds. Because available information is limited in actual experiments, we develop a lower bound that is valid under the unknown means and the unknown choice of the best arm, which are referred to as the worst-case lower bound. We demonstrate that the worst-case lower bound depends solely on the variances of the outcomes. Then, under the assumption that the variances are known, we propose the Generalized-Neyman-Allocation (GNA)-empirical-best-arm (EBA) strategy, an extension of the Neyman allocation proposed by Neyman (1934). We show that the GNA-EBA strategy is asymptotically optimal in the sense that its probability of misidentification aligns with the lower bounds as the sample size increases infinitely and the differences between the expected outcomes of the best and other suboptimal arms converge to the same values across arms. We refer to such strategies as asymptotically worst-case optimal.

Suboptimal Performance of the Bayes Optimal Algorithm in Frequentist Best Arm Identification

Asymptotically Optimal Fixed-Budget Best Arm Identification with Variance-Dependent Bounds

Fixed Confidence Best Arm Identification in the Bayesian Setting

Minimax Optimal Simple Regret in Two-Armed Best-Arm Identification

UCB Exploration for Fixed-Budget Bayesian Best Arm Identification

Best Arm Identification in Stochastic Bandits: Beyond $β-$optimality

Locally Optimal Fixed-Budget Best Arm Identification in Two-Armed Gaussian Bandits with Unknown Variances

Worst-Case Optimal Multi-Armed Gaussian Best Arm Identification with a Fixed Budget

Best Arm Identification with Minimal Regret

Simple Bayesian Algorithms for Best Arm Identification

Fast and Regret Optimal Best Arm Identification: Fundamental Limits and Low-Complexity Algorithms

Generalized Neyman Allocation for Locally Minimax Optimal Best-Arm Identification

Best Arm Identification with Fixed Budget: A Large Deviation Perspective

Minimax Optimal Fixed-Budget Best Arm Identification in Linear Bandits

Near-Optimal Algorithm for Non-Stationary Kernelized Bandits

A/B Testing and Best-arm Identification for Linear Bandits with Robustness to Non-stationarity

Optimal Best Arm Identification in Two-Armed Bandits with a Fixed Budget under a Small Gap

Best Arm Identification in Bandits with Limited Precision Sampling

Almost Minimax Optimal Best Arm Identification in Piecewise Stationary Linear Bandits

Optimal Multi-Fidelity Best-Arm Identification