Valid Randomization‐based P‐values for Partially Post Hoc Subgroup Analyses

Joseph J. Lee,Donald B. Rubin
DOI: https://doi.org/10.1002/sim.6531
2015-01-01
Statistics in Medicine
Abstract:By ‘partially post‐hoc’ subgroup analyses, we mean analyses that compare existing data from a randomized experiment—from which a subgroup specification is derived—to new, subgroup‐only experimental data. We describe a motivating example in which partially post hoc subgroup analyses instigated statistical debate about a medical device's efficacy. We clarify the source of such analyses' invalidity and then propose a randomization‐based approach for generating valid posterior predictive p‐values for such partially post hoc subgroups. Lastly, we investigate the approach's operating characteristics in a simple illustrative setting through a series of simulations, showing that it can have desirable properties under both null and alternative hypotheses. Copyright © 2015 John Wiley & Sons, Ltd.
What problem does this paper attempt to address?