Abstract:Background: Stroke is a prevalent disease with a significant global impact. Effective assessment of stroke severity is vital for an accurate diagnosis, appropriate treatment, and optimal clinical outcomes. The National Institutes of Health Stroke Scale (NIHSS) is a widely used scale for quantitatively assessing stroke severity. However, the current manual scoring of NIHSS is labor-intensive, time-consuming, and sometimes unreliable. Applying artificial intelligence (AI) techniques to automate the quantitative assessment of stroke on vast amounts of electronic health records (EHRs) has attracted much interest. Objective: This study aims to develop an automatic, quantitative stroke severity assessment framework through automating the entire NIHSS scoring process on Chinese clinical EHRs. Methods: Our approach consists of two major parts: Chinese clinical named entity recognition (CNER) with a domain-adaptive pre-trained large language model (LLM) and automated NIHSS scoring. To build a high-performing CNER model, we first construct a stroke-specific, densely annotated dataset "Chinese Stroke Clinical Records" (CSCR) from EHRs provided by our partner hospital, based on a stroke ontology that defines semantically related entities for stroke assessment. We then pre-train a Chinese clinical LLM coined "CliRoberta" through domain-adaptive transfer learning and construct a deep learning-based CNER model that can accurately extract entities directly from Chinese EHRs. Finally, an automated, end-to-end NIHSS scoring pipeline is proposed by mapping the extracted entities to relevant NIHSS items and values, to quantitatively assess the stroke severity. Results: Results obtained on a benchmark dataset CCKS2019 and our newly created CSCR dataset demonstrate the superior performance of our domain-adaptive pre-trained LLM and the CNER model, compared with the existing benchmark LLMs and CNER models. The high F1 score of 0.990 ensures the reliability of our model in accurately extracting the entities for the subsequent automatic NIHSS scoring. Subsequently, our automated, end-to-end NIHSS scoring approach achieved excellent inter-rater agreement (0.823) and intraclass consistency (0.986) with the ground truth and significantly reduced the processing time from minutes to a few seconds. Conclusion: Our proposed automatic and quantitative framework for assessing stroke severity demonstrates exceptional performance and reliability through directly scoring the NIHSS from diagnostic notes in Chinese clinical EHRs. Moreover, this study also contributes a new clinical dataset, a pre-trained clinical LLM, and an effective deep learning-based CNER model. The deployment of these advanced algorithms can improve the accuracy and efficiency of clinical assessment, and help improve the quality, affordability and productivity of healthcare services.

PCV74 NATURAL LANGUAGE PROCESSING-ASSISTED RETROSPECTIVE SCORING OF NIH STROKE SCALE IN CHINA

Automatic quantitative stroke severity assessment based on Chinese clinical named entity recognition with domain-adaptive pre-trained large language model

Abstract P558: Identification of Embolic Stroke in Patients with Large Vessel Occlusion

Rationale and Design of Individualized Quality Improvement Based on the Computer Analysing System to Improve Stroke Management Quality Evaluation (CASE): a Multicenter Historically Controlled Study

Automated Extraction of Stroke Severity From Unstructured Electronic Health Records Using Natural Language Processing

Identifying stroke-related quantified evidence from electronic health records in real-world studies

Abstract MP15: Validation of Phenotyping Algorithms for Stroke from Electronic Health Records Using Natural Language Processing

A query interface for clinical research with Chinese electronic health record using Natural Language Processing

Automating Stroke Data Extraction From Free-Text Radiology Reports Using Natural Language Processing: Instrument Validation Study

Reliability and validity of a graphical computerized adaptive test Longshi scale for rapid assessment of activities of daily living in stroke survivors

Natural Language Processing for the Identification of Silent Brain Infarcts From Neuroimaging Reports

Using Natural Language Processing to Extract Clinically Useful Information from Chinese Electronic Medical Records

Mining Clinical Notes for Physical Rehabilitation Exercise Information: Natural Language Processing Algorithm Development and Validation Study

The reliability and validity of a slightly revised Chinese version simplified modified Rankin scale questionnaire

Production and validation of Putonghua- and Cantonese-Chinese language National Institutes of Health Stroke Scale training and certification videos

Understanding the performance and reliability of NLP tools: a comparison of four NLP tools predicting stroke phenotypes in radiology reports

The reliability and validity of a slightly optimized Chinese version simplified modified Rankin scale questionnaire

The Reliability and Validity of a Novel Chinese Version Simplified Modified Rankin Scale Questionnaire(2011)

Study of Relationship of NIHSS and TCM Standardized Sheet of Apoplexy Syndromes Diagnosis base on LVQ Neural Networks

Abstract P259: Using Natural Language Processing and Machine Learning to Identify Incident Stroke from Electronic Health Records

Validation of NINDS-CSN neuropsychological battery for vascular cognitive impairment in Chinese stroke patients