Abstract:Background: Cluster randomized trials (CRTs) are randomized trials where randomization takes place at an administrative level (e.g., hospitals, clinics, or schools) rather than at the individual level. When the number of available clusters is small, researchers may not be able to rely on simple randomization to achieve balance on cluster-level covariates across treatment conditions. If these cluster-level covariates are predictive of the outcome, covariate imbalance may distort treatment effects, threaten internal validity, lead to a loss of power, and increase the variability of treatment effects. Covariate-constrained randomization (CR) is a randomization strategy designed to reduce the risk of imbalance in cluster-level covariates when performing a CRT. Existing methods for CR have been developed and evaluated for two- and multi-arm CRTs but not for factorial CRTs. Methods: Motivated by the BEGIN study-a CRT for weight loss among patients with pre-diabetes-we develop methods for performing CR in 2 × 2 factorial cluster randomized trials with a continuous outcome and continuous cluster-level covariates. We apply our methods to the BEGIN study and use simulation to assess the performance of CR versus simple randomization for estimating treatment effects by varying the number of clusters, the degree to which clusters are associated with the outcome, the distribution of cluster level covariates, the size of the constrained randomization space, and analysis strategies. Results: Compared to simple randomization of clusters, CR in the factorial setting is effective at achieving balance across cluster-level covariates between treatment conditions and provides more precise inferences. When cluster-level covariates are included in the analyses model, CR also results in greater power to detect treatment effects, but power is low compared to unadjusted analyses when the number of clusters is small. Conclusions: CR should be used instead of simple randomization when performing factorial CRTs to avoid highly imbalanced designs and to obtain more precise inferences. Except when there are a small number of clusters, cluster-level covariates should be included in the analysis model to increase power and maintain coverage and type 1 error rates at their nominal levels.

Blurring cluster randomized trials and observational studies using Two-Stage TMLE to address sub-sampling, missingness, and minimal independent units

Blurring cluster randomized trials and observational studies: Two-Stage TMLE for subsampling, missingness, and few independent units.

Two-Stage TMLE to reduce bias and improve efficiency in cluster randomized trials

Handling incomplete outcomes and covariates in cluster-randomized trials: doubly-robust estimation, efficiency considerations, and sensitivity analysis

Leveraging baseline covariates to analyze small cluster-randomized trials with a rare binary outcome

Analysis of cohort stepped wedge cluster-randomized trials with non-ignorable dropout via joint modeling

Leveraging contact network structure in the design of cluster randomized trials

Novel Methods for the Analysis of Stepped Wedge Cluster Randomized Trials

Using Power Analysis to Choose the Unit of Randomization, Outcome, and Approach for Subgroup Analysis for a Multilevel Randomized Controlled Clinical Trial to Reduce Disparities in Cardiovascular Health

Intent-to-treat Analysis of Cluster Randomized Trials when Clusters Report Unidentifiable Outcome Proportions.

Adjusting for Selection Bias Due to Missing Eligibility Criteria in Emulated Target Trials

Adaptive Clinical Trials: Exploiting Sequential Patient Recruitment and Allocation

Comparing cluster-level dynamic treatment regimens using sequential, multiple assignment, randomized trials: Regression estimation and sample size considerations

Using Longitudinal Targeted Maximum Likelihood Estimation in Complex Settings with Dynamic Interventions

Maximin optimal cluster randomized designs for assessing treatment effect heterogeneity

Biomarker-Guided Adaptive Enrichment Design with Threshold Detection for Clinical Trials with Time-to-Event Outcome

Handling missing data when estimating causal effects with Targeted Maximum Likelihood Estimation

Covariate-constrained randomization in cluster randomized 2 × 2 factorial trials: application to a diabetes prevention study

The symbolic two-step method applied to cancer care delivery research: Safeguarding against designing an underpowered cluster randomized trial with a continuous outcome by accounting for the imprecision in the within- and between-center variation

Design optimisation and post-trial analysis in group sequential stepped-wedge cluster randomised trials