Abstract:Background: Diagnosis codes are assigned to medical records in healthcare facilities by trained coders by reviewing all physician authored documents associated with a patient's visit. This is a necessary and complex task involving coders adhering to coding guidelines and coding all assignable codes. With the popularity of electronic medical records (EMRs), computational approaches to code assignment have been proposed in the recent years. However, most efforts have focused on single and often short clinical narratives, while realistic scenarios warrant full EMR level analysis for code assignment. Objective: We evaluate supervised learning approaches to automatically assign international classification of diseases (ninth revision) - clinical modification (ICD-9-CM) codes to EMRs by experimenting with a large realistic EMR dataset. The overall goal is to identify methods that offer superior performance in this task when considering such datasets. Methods: We use a dataset of 71,463 EMRs corresponding to in-patient visits with discharge date falling in a two year period (2011-2012) from the University of Kentucky (UKY) Medical Center. We curate a smaller subset of this dataset and also use a third gold standard dataset of radiology reports. We conduct experiments using different problem transformation approaches with feature and data selection components and employing suitable label calibration and ranking methods with novel features involving code co-occurrence frequencies and latent code associations. Results: Over all codes with at least 50 training examples we obtain a micro F-score of 0.48. On the set of codes that occur at least in 1% of the two year dataset, we achieve a micro F-score of 0.54. For the smaller radiology report dataset, the classifier chaining approach yields best results. For the smaller subset of the UKY dataset, feature selection, data selection, and label calibration offer best performance. Conclusions: We show that datasets at different scale (size of the EMRs, number of distinct codes) and with different characteristics warrant different learning approaches. For shorter narratives pertaining to a particular medical subdomain (e.g., radiology, pathology), classifier chaining is ideal given the codes are highly related with each other. For realistic in-patient full EMRs, feature and data selection methods offer high performance for smaller datasets. However, for large EMR datasets, we observe that the binary relevance approach with learning-to-rank based code reranking offers the best performance. Regardless of the training dataset size, for general EMRs, label calibration to select the optimal number of labels is an indispensable final step.

Active learning for medical code assignment

Active Learning on Medical Image

Clustered Automated Machine Learning (CAML) model for clinical coding multi-label classification

An empirical evaluation of supervised learning approaches in assigning diagnosis codes to electronic medical records

Automatic Medical Code Assignment via Deep Learning Approach for Intelligent Healthcare

Batch Mode Active Learning and Its Application to Medical Image Classification

Adversarial active learning for the identification of medical concepts and annotation inconsistency

A Scalable Workflow to Build Machine Learning Classifiers with Clinician-in-the-Loop to Identify Patients in Specific Diseases

A Comparative Study on Automatic Coding of Medical Letters with Explainability

Deep-ADCA: Development and Validation of Deep Learning Model for Automated Diagnosis Code Assignment Using Clinical Notes in Electronic Medical Records

Deep active learning for classifying cancer pathology reports

Learning from Undercoded Clinical Records for Automated International Classification of Diseases (ICD) Coding

Multi-Label Active Learning-Based Machine Learning Model for Heart Disease Prediction

A comprehensive survey on deep active learning in medical image analysis

On combining active and transfer learning for medical data classification

Focused Active Learning for Histopathological Image Classification

PCDAL: A Perturbation Consistency-Driven Active Learning Approach for Medical Image Segmentation and Classification

Active Learning with Noise Modeling for Medical Image Annotation

An Adaptive Query Approach for Extracting Medical Images for Disease Detection Applications

Using natural language processing for automated classification of disease and to identify misclassified ICD codes in cardiac disease

Multi-label natural language processing to identify diagnosis and procedure codes from MIMIC-III inpatient notes