Automatic ICD Coding Based on Multi-granularity Feature Fusion

Yu Ying,Duan Junwen,Jiang Han,Wang Jianxin
DOI: https://doi.org/10.1007/978-3-031-23198-8_3
2023-01-01
Abstract:International Classification of Disease (ICD) coding is to assign standard codes, which describe the state of a patient, to a clinical note. It is challenging given the complexity and the number of codes. The ICD taxonomy is hierarchically organized with several level codes (chapter, category, subcategory and its subdivision). However, most existing studies focus on the prediction of the fine-grained subcategory codes, neglecting the hierarchical relations of ICD codes. Those models pay less attention to common features related to sibling subcategories. The common features could be helpful for rare sample prediction and could be captured in the task of coarse-grained code prediction. In this paper, we propose a multi-task learning model, which explicitly trains multiple classifiers for different code levels. Simultaneously, we capture the relations between finer-grained and coarser-grained labels through a reinforcement mechanism. Extensive experiments on an English and a Chinese dataset show that our approach achieves competitive performance compared with baseline models, especially on Macro-F1 results.
What problem does this paper attempt to address?