MatchMiner-AI: An Open-Source Solution for Cancer Clinical Trial Matching

Ethan Cerami,Pavel Trukhanov,Morgan A. Paul,Michael J. Hassett,Irbaz B. Riaz,James Lindsay,Emily Mallaber,Harry Klein,Gufran Gungor,Matthew Galvin,Stephen C. Van Nostrand,Joyce Yu,Tali Mazor,Kenneth L. Kehl
2024-12-23
Abstract:Clinical trials drive improvements in cancer treatments and outcomes. However, most adults with cancer do not participate in trials, and trials often fail to enroll enough patients to answer their scientific questions. Artificial intelligence could accelerate matching of patients to appropriate clinical trials. Here, we describe the development and evaluation of the MatchMiner-AI pipeline for clinical trial searching and ranking. MatchMiner-AI focuses on matching patients to potential trials based on core criteria describing clinical "spaces," or disease contexts, targeted by a trial. It aims to accelerate the human work of identifying potential matches, not to fully automate trial screening. The pipeline includes modules for extraction of key information from a patient's longitudinal electronic health record; rapid ranking of candidate trial-patient matches based on embeddings in vector space; and classification of whether a candidate match represents a reasonable clinical consideration. Code and synthetic data are available at <a class="link-external link-https" href="https://huggingface.co/ksg-dfci/MatchMiner-AI" rel="external noopener nofollow">this https URL</a> . Model weights based on synthetic data are available at <a class="link-external link-https" href="https://huggingface.co/ksg-dfci/TrialSpace" rel="external noopener nofollow">this https URL</a> and <a class="link-external link-https" href="https://huggingface.co/ksg-dfci/TrialChecker" rel="external noopener nofollow">this https URL</a> . A simple cancer clinical trial search engine to demonstrate pipeline components is available at <a class="link-external link-https" href="https://huggingface.co/spaces/ksg-dfci/trial_search_alpha" rel="external noopener nofollow">this https URL</a> .
Artificial Intelligence,Machine Learning
What problem does this paper attempt to address?