A review of Bayesian perspectives on sample size derivation for confirmatory trials

Kevin Kunzmann,Michael J. Grayling,Kim May Lee,David S. Robertson,Kaspar Rufibach,James M. S. Wason
DOI: https://doi.org/10.1080/00031305.2021.1901782
2020-06-29
Abstract:Sample size derivation is a crucial element of the planning phase of any confirmatory trial. A sample size is typically derived based on constraints on the maximal acceptable type I error rate and a minimal desired power. Here, power depends on the unknown true effect size. In practice, power is typically calculated either for the smallest relevant effect size or a likely point alternative. The former might be problematic if the minimal relevant effect is close to the null, thus requiring an excessively large sample size. The latter is dubious since it does not account for the a priori uncertainty about the likely alternative effect size. A Bayesian perspective on the sample size derivation for a frequentist trial naturally emerges as a way of reconciling arguments about the relative a priori plausibility of alternative effect sizes with ideas based on the relevance of effect sizes. Many suggestions as to how such `hybrid' approaches could be implemented in practice have been put forward in the literature. However, key quantities such as assurance, probability of success, or expected power are often defined in subtly different ways in the literature. Starting from the traditional and entirely frequentist approach to sample size derivation, we derive consistent definitions for the most commonly used `hybrid' quantities and highlight connections, before discussing and demonstrating their use in the context of sample size derivation for clinical trials.
Statistics
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to more reasonably determine the sample size in confirmatory trials. Specifically, it explores how to incorporate the Bayesian perspective to deal with prior uncertainty regarding the effect size during the sample size determination process. Traditional methods are usually based on the maximum acceptable Type I error rate (i.e., the false - positive rate) and the minimum desired test power (i.e., the probability of detecting a true effect), but these methods have some limitations in practical applications: 1. **The problem of the minimum clinically important difference (MCID)**: If the minimum clinically important difference is close to the null hypothesis value, the required sample size may be extremely large, which is often not feasible in practice. 2. **The problem of the choice of point alternative hypothesis**: The traditional approach is to choose a point alternative hypothesis to calculate the test power, but this does not take into account the prior uncertainty about the possible effect size. If the point alternative hypothesis is not properly chosen, it may lead to under - powered or over - powered trial designs. To solve these problems, the paper proposes a "hybrid" method, which combines the Bayesian view with the frequentist approach. In this way, it is possible to ensure that the trial has sufficient power while considering the relative prior probabilities of different effect sizes. Specifically, the paper discusses the following key concepts and their applications in sample size determination: - **Assurance**: This refers to the probability that the trial can be successful (i.e., reject the null hypothesis and the effect is relevant) given the prior distribution. - **Expected Power**: This refers to the average probability that the trial can reject the null hypothesis when the effect size is greater than the minimum clinically important difference. - **Probability of Success**: This refers to the overall probability that the trial can be successful (i.e., reject the null hypothesis and the effect is relevant). Through these concepts, the paper provides a framework that enables researchers to more reasonably determine the sample size while considering the prior uncertainty of the effect size, thereby increasing the success rate and cost - effectiveness of the trial.