Abstract:Abstract Background Clinical notes are unstructured text documents generated by clinicians during patient encounters, generally are annotated with International Classification of Diseases (ICD) codes, which give formatted information about the diagnosis and treatment. ICD code has shown its potentials in many fields, but manual coding is labor-intensive and error-prone, lead to researches of automatic coding. Two specific challenges of this task are (1) given an annotated clinical notes, the reasons behind specific diagnoses and treatments are implicit; (2) explainability is important for practical automatic coding method, the method should not only explain its prediction output but also have explainable internal mechanics. This study aims to develop an explainable CNN approach to address these two challenges. Method Our key idea is that for the automatic ICD coding task, the presence of informative snippets in the clinical text that correlated with each code plays an important role in the prediction of codes, and an informative snippet can be considered as a local and low-level feature. We infer that there exists a correspondence between a convolution filter and a local and low-level feature. Base on the inference, we come up with the Shallow and Wide Attention convolutional Mechanism (SWAM) to improve the CNN-based models’ ability to learn local and low-level features for each label. Results We evaluate our approach on MIMIC-III, an open-access dataset of ICU medical records. Our approach substantially outperforms previous results on top-50 medical code prediction on MIMIC-III dataset, the precision of the worst-performing 10% labels in previous works is increased from 0% to 53% on average. We attribute this improvement to SWAM, by which the wide architecture with attention mechanism gives the model ability to more extensively learn the unique features of different codes, and we prove it by an ablation experiment. Besides, we perform manual analysis of the performance imbalance between different codes, and preliminary conclude the characteristics that determine the difficulty of learning specific codes. Conclusions Our main contributions can be summarized into the following three: (1) We present local and low-level features, a.k.a. informative snippets play an important role in the automatic ICD coding task, and the informative snippets extracted from the clinical text provide explanations for each code. (2) We propose that there exists a correspondence between a convolution filter and a local and low-level feature. A combination of wide and shallow convolutional layer and attention layer can help the CNN-based models better learn local and low-level features. (3) We improved the precision of the worst-performing 10% labels from 0 to 53% on average.

Prediction of ICD Codes with Clinical BERT Embeddings and Text Augmentation with Label Balancing using MIMIC-III

Medical Code Prediction from Discharge Summary: Document to Sequence BERT using Sequence Attention

Medical code prediction via capsule networks and ICD knowledge

Mimic-IV-ICD: A new benchmark for eXtreme MultiLabel Classification

Multi-label natural language processing to identify diagnosis and procedure codes from MIMIC-III inpatient notes

TransICD: Transformer Based Code-wise Attention Model for Explainable ICD Coding

Improving ICD coding using Chapter based Named Entities and Attentional Models

Natural language processing of MIMIC-III clinical notes for identifying diagnosis and procedures with neural networks

Combining transformer-based model and GCN to predict ICD codes from clinical records

Accurate and Well-Calibrated ICD Code Assignment Through Attention Over Diverse Label Embeddings

Automatic Medical Code Assignment via Deep Learning Approach for Intelligent Healthcare

Predicting Multiple ICD-10 Codes from Brazilian-Portuguese Clinical Notes

ICDXML: enhancing ICD coding with probabilistic label trees and dynamic semantic representations

Automatic ICD Coding Based on Segmented ClinicalBERT with Hierarchical Tree Structure Learning.

An explainable CNN approach for medical codes prediction from clinical text

Ensemble neural models for ICD code prediction using unstructured and structured healthcare data

A two-stream deep model for automated ICD-9 code prediction in an intensive care unit

Towards Automated ICD Coding Using Deep Learning

Read, Attend, and Code: Pushing the Limits of Medical Codes Prediction from Clinical Notes by Machines

Deep-ADCA: Development and Validation of Deep Learning Model for Automated Diagnosis Code Assignment Using Clinical Notes in Electronic Medical Records

Towards BERT-based Automatic ICD Coding: Limitations and Opportunities