PCV74 NATURAL LANGUAGE PROCESSING-ASSISTED RETROSPECTIVE SCORING OF NIH STROKE SCALE IN CHINA

C. Chen,H. He,L. Zheng,J. Ruan,F. Xiao,M. Liu,B. Bi,Y. Zhu
DOI: https://doi.org/10.1016/j.jval.2020.04.176
IF: 5.156
2020-01-01
Value in Health
Abstract:The NIH Stroke Scale (NIHSS) is a commonly used measure to assess stroke severity. The retrospective assessment on NIHSS has been developed in western country. The purpose of this research was to assess the validity and reliability of a Natural Language Processing (NLP)-assisted algorithm for NIHSS scoring in China. First of all, medical records with NIHSS score were identified from 5 hospitals in China. Based on the algorithm of retrospective scoring on NIHSS items, a list of feature words was identified in written admission/discharge history and physical/neurological examination notes. HLT Sonar system, which was a NLP system developed by HLT data scientist, fully scanned the EMR and automatically recognized and extracted all relevant feature words into structured eCRF. 2 investigators parallelly reviewed the eCRF and developed the score for each NIHSS item. Missing data were scored as normal.Multiple statistic models were used to assess the agreement between retrospective scores developed by NLP system (NLP-assisted score) and the score directly recorded in EMR (EMR score). Linear regression and the Intra-class Correlation Coefficient (ICC) was used to assess the level of agreement between NLP-assisted scores and EMR scores. A total of 275 cases were identified with NIHSS score recorded in EMR and developed the retrospective score with NLP system. Agreement between NLP-assisted scores and EMR scores was high (r2=0.78, P<0.001; single measures ICC= 0.88, P<0.001). The NLP-assisted retrospective scoring algorithm is reliable in the local context of China, while the Artificial Intelligence system could largely improve the efficiency of the process. A large number of cases should be further analyzed on the validity and reliability for the NLP-assisted scoring algorithm on real-world EMR data in China.
What problem does this paper attempt to address?