Applying Large Language Models for Causal Structure Learning in Non Small Cell Lung Cancer

Narmada Naik,Ayush Khandelwal,Mohit Joshi,Madhusudan Atre,Hollis Wright,Kavya Kannan,Scott Hill,Giridhar Mamidipudi,Ganapati Srinivasa,Carlo Bifulco,Brian Piening,Kevin Matlock
DOI: https://doi.org/10.48550/arXiv.2311.07191
2023-11-13
Abstract:Causal discovery is becoming a key part in medical AI research. These methods can enhance healthcare by identifying causal links between biomarkers, demographics, treatments and outcomes. They can aid medical professionals in choosing more impactful treatments and strategies. In parallel, Large Language Models (LLMs) have shown great potential in identifying patterns and generating insights from text data. In this paper we investigate applying LLMs to the problem of determining the directionality of edges in causal discovery. Specifically, we test our approach on a deidentified set of Non Small Cell Lung Cancer(NSCLC) patients that have both electronic health record and genomic panel data. Graphs are validated using Bayesian Dirichlet estimators using tabular data. Our result shows that LLMs can accurately predict the directionality of edges in causal graphs, outperforming existing state-of-the-art methods. These findings suggests that LLMs can play a significant role in advancing causal discovery and help us better understand complex systems.
Artificial Intelligence,Machine Learning,Applications
What problem does this paper attempt to address?