Abstract:Military information is gradually overloaded due to the diversity of sources and the exponential growth in quantity, which greatly affects the accuracy of intelligence personnel in extracting and analyzing military information. The modern warfare approach has also evolved from the traditional physical domain to the cognitive domain, and competing for advantages in the cognitive domain has become a key objective of combat. Therefore, constructing domain knowledge graphs and mining the relationships between data play an important role in cognitive domain analysis. In this paper, we propose a convolutional network recognition method based on improved two-layer bi-directional BiLSTM networks named the BERT-α BiLSTMs-RECNN-CRF (BDBRC) model. For the difficulties of military entities generally having long names and low extraction accuracy, as well as the existence of a large number of composite entities that are difficult to recognize, an improved two-layer BiLSTM model is devised first. In view of the fact that the BiLSTMs model always extracts features equally in long-distance text sequences without actually considering the different influences of different sentence contexts, contribution factor a is added to extract the contribution of the above and below to the target entity in different sentences respectively. Then, aiming at the strong problem of the domain of military news texts and the high level of inter-entity ambiguity, we propose a method that utilizes a modified convolutional network (RECNN) for partial feature extraction and jointly with a modified two-layer BiLSTM network for entity recognition. The experiment on the self-constructed dataset shows that the F1 value of the model proposed in this paper reaches 93.18%, and the F1 value, P, and H of our model are all better than the baseline model, which verifies the performance of the model. At the same time, we use public data sets MSRA and CLUB2020, and the experimental results show that the model proposed in this paper also has a good performance in the public data set, verifying the universality of the model. It can provide methodological support for the construction of the military knowledge graph.

XLM-RoBERTa Model for Key Information Extraction on Military Document

Towards information extraction from ISR reports for decision support using a two-stage learning-based approach

REXEL: An End-to-end Model for Document-Level Relation Extraction and Entity Linking

Named entity recognition of military equipment based on BERT-BILSTM-CRF model

A Character-Level Document Key Information Extraction Method with Contrastive Learning.

BDBRC: A Chinese Military Entity Recognition Model Combining Context Contribution and Residual Dilatation Convolutional Networks

ViBERTgrid BiLSTM-CRF: Multimodal Key Information Extraction from Unstructured Financial Documents

GenKIE: Robust Generative Multimodal Document Key Information Extraction

Named Entity Recognition in Equipment Support Field Using Tri-Training Algorithm and Text Information Extraction Technology

IPerFEX-2023: Indonesian personal financial entity extraction using indoBERT-BiGRU-CRF model

LMDX: Language Model-based Document Information Extraction and Localization

Research on Military Equipment Entity Recognition and Knowledge Graph Construction Method Based on ALBERT-Bi-LSTM-CRF

CMNEE: A Large-Scale Document-Level Event Extraction Dataset based on Open-Source Chinese Military News

Enhancing Document Information Analysis with Multi-Task Pre-training: A Robust Approach for Information Extraction in Visually-Rich Documents

milIE: Modular & Iterative Multilingual Open Information Extraction

WebKE: Knowledge Extraction from Semi-structured Web with Pre-trained Markup Language Model

UMLS-KGI-BERT: Data-Centric Knowledge Integration in Transformers for Biomedical Entity Recognition

Towards Lingua Franca Named Entity Recognition with BERT

The Entity Relationship Extraction Method Using Improved RoBERTa and Multi-Task Learning

Research on Domain-Specific Knowledge Graph Based on the RoBERTa-wwm-ext Pretraining Model

xFinder: Robust and Pinpoint Answer Extraction for Large Language Models