Abstract:Abstract Background We suggest an adaptive sample size calculation method for developing clinical prediction models, in which model performance is monitored sequentially as new data comes in. Methods We illustrate the approach using data for the diagnosis of ovarian cancer ( n = 5914, 33% event fraction) and obstructive coronary artery disease (CAD; n = 4888, 44% event fraction). We used logistic regression to develop a prediction model consisting only of a priori selected predictors and assumed linear relations for continuous predictors. We mimicked prospective patient recruitment by developing the model on 100 randomly selected patients, and we used bootstrapping to internally validate the model. We sequentially added 50 random new patients until we reached a sample size of 3000 and re-estimated model performance at each step. We examined the required sample size for satisfying the following stopping rule: obtaining a calibration slope ≥ 0.9 and optimism in the c-statistic (or AUC) < = 0.02 at two consecutive sample sizes. This procedure was repeated 500 times. We also investigated the impact of alternative modeling strategies: modeling nonlinear relations for continuous predictors and correcting for bias on the model estimates (Firth’s correction). Results Better discrimination was achieved in the ovarian cancer data (c-statistic 0.9 with 7 predictors) than in the CAD data (c-statistic 0.7 with 11 predictors). Adequate calibration and limited optimism in discrimination was achieved after a median of 450 patients (interquartile range 450–500) for the ovarian cancer data (22 events per parameter (EPP), 20–24) and 850 patients (750–900) for the CAD data (33 EPP, 30–35). A stricter criterion, requiring AUC optimism < = 0.01, was met with a median of 500 (23 EPP) and 1500 (59 EPP) patients, respectively. These sample sizes were much higher than the well-known 10 EPP rule of thumb and slightly higher than a recently published fixed sample size calculation method by Riley et al. Higher sample sizes were required when nonlinear relationships were modeled, and lower sample sizes when Firth’s correction was used. Conclusions Adaptive sample size determination can be a useful supplement to fixed a priori sample size calculations, because it allows to tailor the sample size to the specific prediction modeling context in a dynamic fashion.

Sample size determination for prediction models via learning‐type curves

Calculating the sample size required for developing a clinical prediction model

Adaptive sample size determination for the development of clinical prediction models

Spectroscopic and kinetic aspects of Elephas maximus hemoglobin.

Minimum sample size calculations for external validation of a clinical prediction model with a time‐to‐event outcome

Sample size for developing a prediction model with a binary outcome: targeting precise individual risk estimates to improve clinical decisions and fairness

Learning Curves for Drug Response Prediction in Cancer Cell Lines

A practical solution to estimate the sample size required for clinical prediction models generated from observational research on data

Decision Curve Analysis: a Technical Note

Sample size and power determination in joint modeling of longitudinal and survival data

Extension of a conditional performance score for sample size recalculation rules to the setting of binary endpoints

Sample size for binary logistic prediction models: Beyond events per variable criteria

An imputation method for estimating the learning curve in classification problems

Minimum sample size for developing a multivariable prediction model using multinomial logistic regression

Sample size requirements are not being considered in studies developing prediction models for binary outcomes: a systematic review

Sample size estimation for heterogeneous growth curve models with attrition

Effective sample size: A measure of individual uncertainty in predictions

Power and Sample Size for Dose-Finding Studies with Survival Endpoints under Model Uncertainty.

Sample size planning for classification models

Sample size calculation for two‐arm trials with time‐to‐event endpoint for nonproportional hazards using the concept of Relative Time when inference is built on comparing Weibull distributions

A Practical Simulation Method to Calculate Sample Size of Group Sequential Trials for Time-to-Event Data under Exponential and Weibull Distribution