Exploration of medical record text error correction based on clinical medical record pre- training language model

NAI Cun-jian,YANG Liang,CHEN Wen-chang,LI Lin-feng,REN Yu-fei,WANG Huo-ming,ZHANG Xiao-xiang
DOI: https://doi.org/10.3969/j.issn.1671-3982.2022.10.005
2022-01-01
Abstract:The presence of misspelled words in electronic medical records(EMR) is not only inconsistent with the national electronic medical record management norms, but also reduces the effectiveness of natural language processing techniques, which in turn affects the value mining and application of EMR. A method of automatic typo correction based on a pre-trained language model trained on a large corpus of real-world medical records was elaborated in this paper. Experiments showed that the method performed well in error detection and correction on both simulated and real-world datasets. The method operates with high efficiency and can support error correction of EMR during and after the event, which can effectively improve the quality and thus promote the application of EMR.
What problem does this paper attempt to address?