Automated ICD coding for primary diagnosis via clinically interpretable machine learning

Xiaolin Diao,Yanni Huo,Shuai Zhao,Jing Yuan,Meng Cui,Yuxin Wang,Xiaodan Lian,Wei Zhao
DOI: https://doi.org/10.1016/j.ijmedinf.2021.104543
IF: 4.73
2021-09-01
International Journal of Medical Informatics
Abstract:BackgroundComputer-assisted clinical coding (CAC) based on automated coding algorithms has been expected to improve the International Classification of Disease, tenth version (ICD-10) coding quality and productivity, whereas studies oriented to primary diagnosis auto-coding are limited in the Chinese context.ObjectiveThis study aims at developing a machine learning (ML) model for automated primary diagnosis ICD-10 coding.MethodsA total of 71709 admissions in Fuwai hospital were included to carry out this study, corresponding to 168 primary diagnosis ICD-10 codes. Based on clinical implications, two feature engineering methods were used to process discharge diagnosis and procedure texts into sequential features and sequential grouping features respectively by which two kinds of models were built and compared. One baseline model using one-hot encoding features was considered. Light Gradient Boosting Machine (LightGBM) was adopted as the classifier, and grid search and cross-validation were used to select the optimal hyperparameters. SHapley Additive exPlanations (SHAP) values were applied to give the interpretability of models.ResultsOur best prediction model was developed based on sequential grouping features. It showed good performance in the test phase with accuracy and macro-averaged F1 (Macro-F1) of 95.2% and 88.3% respectively. The comparison of the models demonstrated the effectiveness of the sequential information and the grouping strategy in boosting model performance (P-value<0.01). Subgroup analysis of the best model on each individual code manifested that 91.1% of the codes achieved the F1 over 70.0%.ConclusionsOur model has been demonstrated its effectiveness for automated primary diagnosis coding in the Chinese context and its results are interpretable. Hence, it has the potential to assist clinical coders to improve coding efficiency and quality in Chinese inpatient settings.
health care sciences & services,computer science, information systems,medical informatics
What problem does this paper attempt to address?
This paper aims to develop a machine learning-based automated ICD coding model specifically for primary diagnosis coding, with a particular focus on the Chinese environment. ICD coding is an international disease classification system used for standardizing medical records, which is crucial for medical billing, resource allocation, and research. However, manual coding is time-consuming and prone to errors. In this paper, researchers used data from Beijing Fuxing Hospital to construct two feature engineering methods that transform discharge diagnosis and procedural text into sequence features and sequence grouping features to improve model interpretability. They employed LightGBM, a lightweight gradient boosting machine, as the classifier and selected the best hyperparameters through grid search and cross-validation. The model demonstrated high accuracy (95.2%) and macro-average F1 score (88.3%) during the testing phase. The study showed that incorporating sequence information and grouping strategies significantly improves model performance, with 91.1% of the codes achieving an F1 score above 70%. This model is expected to assist clinical coders in improving coding efficiency and quality. Furthermore, the paper highlights the interpretability of the model by using SHAP (SHapley Additive exPlanations) values to explain the model's decisions and demonstrates the model's robustness to codes with different frequencies. Although it may perform suboptimally on complex or rare disease coding, the model shows good coding effectiveness for common diseases, which can alleviate the workload of clinical coders, reduce errors, and contribute to the improvement of coding skills. Future work may explore external validation using more diverse datasets and methods for handling low-frequency complex codes.