FHIR-GPT Enhances Health Interoperability with Large Language Models

Yikuan Li,Hanyin Wang,Halid Z. Yerebakan,Yoshihisa Shinagawa,Yuan Luo
DOI: https://doi.org/10.1101/2023.10.17.23297028
2024-04-01
Abstract:Advancing health interoperability can significantly benefit health research, including phenotyping, clinical trial support, and public health surveillance. Federal agencies, including ONC, CDC, and CMS, have been collectively collaborating to promote interoperability by adopting Fast Healthcare Interoperability Resources (FHIR). However, the heterogeneous structures and formats of health data present challenges when transforming Electronic Health Record (EHR) data into FHIR resources. This challenge becomes more significant when critical health information is embedded in unstructured data rather than well-organized structured formats. Previous studies relied on multiple separate rule-based or deep learning-based NLP tools to complete the FHIR resource transformation, which demands substantial development costs, extensive training data, and meticulous integration of multiple individual NLP tools. In this study, we assessed the ability of large language models (LLMs) to transform clinical narratives into HL7 FHIR resources. We developed FHIR-GPT specifically for the transformation of clinical texts into FHIR medication statement resources. In our experiments using 3,671 snippets of clinical texts, FHIR-GPT demonstrated an exceptional exact match rate of over 90%, surpassing the performance of existing methods. FHIR-GPT improved the exact match rates of existing NLP pipelines by 3% for routes, 12% for dose quantities, 35% for reasons, 42% for forms, and over 50% for timing schedules. Our findings provide the foundations for leveraging LLMs to enhance health data interoperability. Future studies will aim to build upon these successes by extending the generation to additional FHIR resources.
Health Informatics
What problem does this paper attempt to address?
This paper aims to address the issue of medical data interoperability, specifically how to convert unstructured clinical text in Electronic Health Records (EHR) into data formats that comply with the Fast Healthcare Interoperability Resources (FHIR) standard. Current challenges include varying data formats across different healthcare institutions and critical medical information often being embedded in unstructured text. Existing methods rely on various rule-based or deep learning-based Natural Language Processing (NLP) tools to accomplish this conversion, but these methods require significant development costs, training data, and complex integration work. Therefore, the paper proposes a new method that utilizes Large Language Models (LLMs) for the conversion and develops a system called FHIR-GPT, specifically designed to convert clinical text into FHIR Medication Statement resources. Experimental results show that FHIR-GPT significantly outperforms existing NLP pipelines in terms of exact match rate, providing a new solution to improve medical data interoperability.