ClinicalRadioBERT: Knowledge-Infused Few Shot Learning for Clinical Notes Named Entity Recognition.

Saed Rezayi,Haixing Dai,Zhengliang Liu,Zihao Wu,Akarsh Hebbar,Andrew H. Burns,Lin Zhao,Dajiang Zhu,Quanzheng Li,Wei Liu,Sheng Li,Tianming Liu,Xiang Li
DOI: https://doi.org/10.1007/978-3-031-21014-3_28
2022-01-01
Abstract:Transformer based language models such as BERT have been widely applied to many domains through model pretraining and fine tuning. However, in low-resource scenarios such as clinical cases, customizing a BERT-based language model is still a challenging task. In this paper, we focus on the radiotherapy domain and train a ClinicalRadioBERT model for analyzing clinical notes through a two-step procedure. First, we fine tune a BioBERT model by exploiting full texts of radiotherapy literature and name this model as RadioBERT. Second, we propose a knowledge-infused few-shot learning (KI-FSL) approach that leverages domain knowledge and trains the ClinicalRadioBERT model for understanding radiotherapy clinical notes. We evaluate ClinicalRadioBERT on a newly collected clinical notes dataset and demonstrate its superiority over baselines on few-shot named entity recognition. We will apply the ClinicalRadioBERT to link BERT and medical imaging for radiotherapy.
What problem does this paper attempt to address?