Optimal allocation of sample size for randomization-based inference from 2 K factorial designs

Arun Ravichandran,Nicole E. Pashley,Brian Libgober,Tirthankar Dasgupta
DOI: https://doi.org/10.1515/jci-2023-0046
2024-01-01
Journal of Causal Inference
Abstract:Abstract Optimizing the allocation of units into treatment groups can help researchers improve the precision of causal estimators and decrease costs when running factorial experiments. However, existing optimal allocation results typically assume a super-population model and that the outcome data come from a known family of distributions. Instead, we focus on randomization-based causal inference for the finite-population setting, which does not require model specifications for the data or sampling assumptions. We propose exact theoretical solutions for optimal allocation in 2 K {2}^{K} factorial experiments under complete randomization with A-, D-, and E-optimality criteria. We then extend this work to factorial designs with block randomization. We also derive results for optimal allocations when using cost-based constraints. To connect our theory to practice, we provide convenient integer-constrained programming solutions using a greedy optimization approach to find integer optimal allocation solutions for both complete and block randomizations. The proposed methods are demonstrated using two real-life factorial experiments conducted by social scientists.
mathematics, interdisciplinary applications,social sciences, mathematical methods
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to optimize sample allocation in 2K - factor design to improve the precision of randomized inference and reduce costs. Specifically, the paper focuses on randomized causal inference in the finite population setting, a method that does not require model specification for data or sampling assumptions. The author presents the exact theoretical solution for the optimal allocation in 2K - factor experiments under complete randomization and explores it based on A -, D - and E - optimality criteria. In addition, the paper extends this work to factor designs with block randomization and derives the optimal allocation results under cost - constraint conditions. To combine theory with practice, the author provides an integer - constrained programming solution using the greedy optimization method to find the integer - optimal allocation solutions under complete randomization and block randomization. The application of the proposed method is demonstrated through two real - life factor experiments.