Minimax Optimal Simple Regret in Two-Armed Best-Arm Identification

Masahiro Kato
2024-12-24
Abstract:This study investigates an asymptotically minimax optimal algorithm in the two-armed fixed-budget best-arm identification (BAI) problem. Given two treatment arms, the objective is to identify the arm with the highest expected outcome through an adaptive experiment. We focus on the Neyman allocation, where treatment arms are allocated following the ratio of their outcome standard deviations. Our primary contribution is to prove the minimax optimality of the Neyman allocation for the simple regret, defined as the difference between the expected outcomes of the true best arm and the estimated best arm. Specifically, we first derive a minimax lower bound for the expected simple regret, which characterizes the worst-case performance achievable under the location-shift distributions, including Gaussian distributions. We then show that the simple regret of the Neyman allocation asymptotically matches this lower bound, including the constant term, not just the rate in terms of the sample size, under the worst-case distribution. Notably, our optimality result holds without imposing locality restrictions on the distribution, such as the local asymptotic normality. Furthermore, we demonstrate that the Neyman allocation reduces to the uniform allocation, i.e., the standard randomized controlled trial, under Bernoulli distributions.
Machine Learning,Econometrics,Statistics Theory,Applications
What problem does this paper attempt to address?
### What problem does this paper attempt to solve? This paper aims to solve the problem of how to identify the treatment arm with the highest expected outcome through adaptive experimental design in the two - armed fixed - budget best - arm identification (BAI) problem. Specifically, the paper focuses on determining which arm has the highest expected outcome through adaptive experiments given two treatment arms, and mainly studies the performance of the Neyman allocation algorithm in this problem. #### Research Background In many fields (such as machine learning, operations research, economics, and epidemiology), the fixed - budget best - arm identification problem has been widely studied. The core of this problem is to find the optimal treatment arm through adaptive experiments under a limited number of samples (or budget). However, when the outcome variance of the treatment arms is unknown, the choice of the optimal algorithm becomes complicated. #### Main Problems The paper mainly solves the following problems: 1. **Asymptotic Minimax Optimality of Neyman Allocation**: Prove that Neyman allocation is asymptotically minimax optimal in terms of simple regret. Simple regret is defined as the difference between the expected outcomes of the true optimal arm and the estimated optimal arm. 2. **Minimax Lower Bound**: Derive the minimax lower bound of simple regret under the worst - case distribution. 3. **Matching of Upper Bound**: Prove that the simple regret of Neyman allocation can match this lower bound in the asymptotic case, including not only the growth rate of the sample size but also the constant term. #### Specific Contributions 1. **Derivation of Minimax Lower Bound**: The author derives the minimax lower bound of simple regret in the worst - case among all distributions with fixed variance. 2. **Proof of Optimality of Neyman Allocation**: Prove that the simple regret of Neyman allocation can match this lower bound in the asymptotic case, thus providing a complete solution to the problem. 3. **No Local Restrictions**: The results still hold without imposing any local restrictions (such as local asymptotic normality). 4. **Special Case under Bernoulli Distribution**: Show that under the Bernoulli distribution, Neyman allocation degenerates into uniform allocation (i.e., standard randomized controlled trial), and they have the same performance in terms of simple regret. ### Summary By introducing the Neyman allocation algorithm and combining it with the AIPW estimator, this paper proposes an asymptotically minimax - optimal solution to solve the fixed - budget best - arm identification problem in the case of unknown variance. This research not only establishes a new benchmark in theory but also provides valuable guidance for practical applications.