Exploring the Generalization of Cancer Clinical Trial Eligibility Classifiers Across Diseases

Yumeng Yang,Ashley Gilliam,Ethan B Ludmir,Kirk Roberts
2024-03-26
Abstract:Clinical trials are pivotal in medical research, and NLP can enhance their success, with application in recruitment. This study aims to evaluate the generalizability of eligibility classification across a broad spectrum of clinical trials. Starting with phase 3 cancer trials, annotated with seven eligibility exclusions, then to determine how well models can generalize to non-cancer and non-phase 3 trials. To assess this, we have compiled eligibility criteria data for five types of trials: (1) additional phase 3 cancer trials, (2) phase 1 and 2 cancer trials, (3) heart disease trials, (4) type 2 diabetes trials, and (5) observational trials for any disease, comprising 2,490 annotated eligibility criteria across seven exclusion types. Our results show that models trained on the extensive cancer dataset can effectively handle criteria commonly found in non-cancer trials, such as autoimmune diseases. However, they struggle with criteria disproportionately prevalent in cancer trials, like prior malignancy. We also experiment with few-shot learning, demonstrating that a limited number of disease-specific examples can partially overcome this performance gap. We are releasing this new dataset of annotated eligibility statements to promote the development of cross-disease generalization in clinical trial classification.
Computation and Language,Machine Learning,Quantitative Methods
What problem does this paper attempt to address?
This paper discusses the generalization ability of clinical trial eligibility criteria classifiers among different diseases. The study focuses on evaluating whether models trained on a large amount of cancer data can effectively handle common criteria in non-cancer clinical trials, and how to improve classification performance through few-shot learning in the case of a small number of specific disease examples. The authors created a cross-disease clinical trial dataset containing 2490 annotated eligibility criteria and analyzed the performance of the BERT model in this regard. The results showed that while the model performed well in handling criteria for certain non-cancer diseases (such as autoimmune diseases), it faced challenges in handling criteria highly related to cancer (such as prior malignancy). The performance of the model was improved through few-shot learning. This study released a new annotated dataset to promote the development of clinical trial classification methods for cross-disease generalization, aiming to support the recruitment work of clinical trials.