Subgroup analysis methods for time-to-event outcomes in heterogeneous randomized controlled trials

Valentine Perrin,Nathan Noiry,Nicolas Loiseau,Alex Nowak
2024-01-22
Abstract:Non-significant randomized control trials can hide subgroups of good responders to experimental drugs, thus hindering subsequent development. Identifying such heterogeneous treatment effects is key for precision medicine and many post-hoc analysis methods have been developed for that purpose. While several benchmarks have been carried out to identify the strengths and weaknesses of these methods, notably for binary and continuous endpoints, similar systematic empirical evaluation of subgroup analysis for time-to-event endpoints are lacking. This work aims to fill this gap by evaluating several subgroup analysis algorithms in the context of time-to-event outcomes, by means of three different research questions: Is there heterogeneity? What are the biomarkers responsible for such heterogeneity? Who are the good responders to treatment? In this context, we propose a new synthetic and semi-synthetic data generation process that allows one to explore a wide range of heterogeneity scenarios with precise control on the level of heterogeneity. We provide an open source Python package, available on Github, containing our generation process and our comprehensive benchmark framework. We hope this package will be useful to the research community for future investigations of heterogeneity of treatment effects and subgroup analysis methods benchmarking.
Machine Learning,Applications,Methodology
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to identify subgroups that respond well to experimental drugs in heterogeneous randomized controlled trials, so as to promote the development of precision medicine. Specifically, the paper focuses on evaluating the effectiveness of several subgroup analysis algorithms in the context of time - to - event outcomes (such as survival time in survival analysis), and explores through three research questions: 1. Is there heterogeneity? 2. Which biomarkers cause this heterogeneity? 3. Who are the individuals that respond well to the treatment? To answer these questions, the authors propose a new synthetic data generation process that can precisely control the level of heterogeneity, and develop an open - source Python package `hte` for generating data and conducting a comprehensive benchmarking framework. This will help future researchers further explore the evaluation of treatment - effect heterogeneity and subgroup analysis methods.