FinBERT–MRC: Financial Named Entity Recognition Using BERT Under the Machine Reading Comprehension Paradigm

DOI: https://doi.org/10.1007/s11063-023-11266-5
IF: 2.565
2023-04-08
Neural Processing Letters
Abstract:Financial named entity recognition (FinNER) is a challenging task in the field of financial text information extraction, which aims to extract a large amount of financial knowledge from unstructured texts. It is widely accepted to use the sequence tagging framework to implement the FinNER tasks. However, such sequence tagging models cannot fully take advantage of the semantic information in the texts. Instead, we formulate the FinNER task as a machine reading comprehension (MRC) problem and propose a new model termed FinBERT–MRC. This formulation introduces significant prior information by utilizing well-designed queries, and extracts the start index and end index of the target entities without decoding modules such as conditional random fields (CRFs). We conduct experiments on a publicly available Chinese financial dataset ChFinAnn and a real-world business dataset AdminPunish. FinBERT–MRC achieves average scores of 92.78% and 96.80% on two datasets, respectively, with average gains + 3.75% and + 0.68% over some sequence tagging models including BiLSTM–CRF, BiGRU–CRF, BiLSTM–CNN–CRF, FinBERT–Tagger, and FinBERT–CRF. The source code is available at https://github.com/zyz0000/FinBERT-MRC.
computer science, artificial intelligence
What problem does this paper attempt to address?