Illusory generalizability of clinical prediction models

Adam M Chekroud,Matt Hawrilenko,Hieronimus Loho,Julia Bondar,Ralitza Gueorguieva,Alkomiet Hasan,Joseph Kambeitz,Philip R Corlett,Nikolaos Koutsouleris,Harlan M Krumholz,John H Krystal,Martin Paulus,Adam M. Chekroud,Philip R. Corlett,Harlan M. Krumholz,John H. Krystal
DOI: https://doi.org/10.1126/science.adg8538
IF: 56.9
2024-01-12
Science
Abstract:It is widely hoped that statistical models can improve decision-making related to medical treatments. Because of the cost and scarcity of medical outcomes data, this hope is typically based on investigators observing a model's success in one or two datasets or clinical contexts. We scrutinized this optimism by examining how well a machine learning model performed across several independent clinical trials of antipsychotic medication for schizophrenia. Models predicted patient outcomes with high accuracy within the trial in which the model was developed but performed no better than chance when applied out-of-sample. Pooling data across trials to predict outcomes in the trial left out did not improve predictions. These results suggest that models predicting treatment outcomes in schizophrenia are highly context-dependent and may have limited generalizability.
multidisciplinary sciences
What problem does this paper attempt to address?