Identifying Symptoms of Delirium from Clinical Narratives Using Natural Language Processing

Aokun Chen,Daniel Paredes,Zehao Yu,Xiwei Lou,Roberta Brunson,Jamie N. Thomas,Kimberly A. Martinez,Robert J. Lucero,Tanja Magoc,Laurence M. Solberg,Urszula A. Snigurska,Sarah E. Ser,Mattia Prosperi,Jiang Bian,Ragnhildur I. Bjarnadottir,Yonghui Wu
2023-04-01
Abstract:Delirium is an acute decline or fluctuation in attention, awareness, or other cognitive function that can lead to serious adverse outcomes. Despite the severe outcomes, delirium is frequently unrecognized and uncoded in patients' electronic health records (EHRs) due to its transient and diverse nature. Natural language processing (NLP), a key technology that extracts medical concepts from clinical narratives, has shown great potential in studies of delirium outcomes and symptoms. To assist in the diagnosis and phenotyping of delirium, we formed an expert panel to categorize diverse delirium symptoms, composed annotation guidelines, created a delirium corpus with diverse delirium symptoms, and developed NLP methods to extract delirium symptoms from clinical notes. We compared 5 state-of-the-art transformer models including 2 models (BERT and RoBERTa) from the general domain and 3 models (BERT_MIMIC, RoBERTa_MIMIC, and GatorTron) from the clinical domain. GatorTron achieved the best strict and lenient F1 scores of 0.8055 and 0.8759, respectively. We conducted an error analysis to identify challenges in annotating delirium symptoms and developing NLP systems. To the best of our knowledge, this is the first large language model-based delirium symptom extraction system. Our study lays the foundation for the future development of computable phenotypes and diagnosis methods for delirium.
Computation and Language
What problem does this paper attempt to address?
The problem this paper attempts to address is the identification of under-coded and unrecognized delirium symptoms in electronic health records (EHRs). Specifically, the authors aim to extract and classify delirium symptoms from clinical narratives using natural language processing (NLP) techniques to assist in the diagnosis and phenotyping of delirium. The main objectives of the paper include: 1. **Identification and classification of delirium symptoms**: Develop annotation guidelines through an expert panel, create a corpus containing various delirium symptoms, and develop NLP methods to extract these symptoms from clinical notes. 2. **Comparison of different NLP models' performance**: Evaluate the performance of five state-of-the-art transformer models (including general-domain BERT and RoBERTa, as well as clinical-domain BERT_MIMIC, RoBERTa_MIMIC, and GatorTron) in extracting delirium symptoms. 3. **Improving the accuracy of delirium diagnosis**: Lay the foundation for the development of future computable phenotypes and diagnostic methods by systematically extracting and classifying delirium symptoms. The paper points out that although delirium is a serious medical condition, it is often overlooked or improperly coded in patients' electronic health records due to its transient and diverse nature. Therefore, developing NLP systems that can accurately identify delirium symptoms is of great significance for improving the diagnosis and intervention of delirium.