Med-BERT: A Pre-Training Framework for Medical Records Named Entity Recognition

Ning Liu,Qian Hu,Huayun Xu,Xing Xu,Mengxin Chen
DOI: https://doi.org/10.1109/tii.2021.3131180
IF: 12.3
2022-01-01
IEEE Transactions on Industrial Informatics
Abstract:A large amount of data is generated every day with the development of Internet medical care, which is of great significance for the clinical decision support system and medical real-world research. Medical records named entity recognition (NER) is important on the aforementioned research topics under the premise of protecting patients' private information. In this article, we propose a medical dictionary enhanced bidirectional encoder representations from transformers (BERT), dubbed Med-BERT, to achieve better representations of long medical entities. On Med-BERT, we propose a span flat-lattice transformer (Span-FLAT) method on medical records NER, and the entity types include private information such as names and addresses, as well as medical information such as patient symptoms, signs, and diseases. Experimental results on two benchmark medical datasets show the effectiveness of Med-BERT, and the proposed Med-BERT-based Span-FLAT method remarkably outperforms the state-of-the-art methods on medical NER task.
What problem does this paper attempt to address?