Named Entity Recognition Method of Legal Instruments Based on Improved Few-Shot Learning

Qiang Liu,Jianbin Wang,Jinbo Fu,Jiong Liu,Bo Chen
DOI: https://doi.org/10.1109/access.2024.3484765
IF: 3.9
2024-11-01
IEEE Access
Abstract:Automatically extracting critical information from voluminous legal instruments can significantly enhance judges' efficiency in handling cases and foster the development of intelligent courts. However, the scarcity of adequately annotated data in legal instruments presents a challenge to training a reliable model for named entity recognition. To address this issue, a novel named entity recognition approach for legal instruments based on improved few-shot learning is introduced in this paper. This approach enables named entity recognition of a large amount of unknown legal data with a small amount of annotated data on legal instruments. Firstly, data augmentation is employed to expand the training sample, and a BERT pre-training model is used to obtain word vectors of the text data. Subsequently, the NNshot model is applied to learn text data features and perform classification. Then, a Viterbi decoder based on label-pair rules is used to capture the dependencies between categories, correct the NNshot prediction results, and select the optimal classification sequence. Finally, the experimental results demonstrate that the proposed named entity recognition method is reasonably reliable and can overcome the challenge that entities cannot be accurately recognized due to insufficient annotation data.
computer science, information systems,telecommunications,engineering, electrical & electronic
What problem does this paper attempt to address?