Global Semantic Information Extraction Model for Chinese long text classification based on fine-tune BERT

Zongping Yang,Huihui Song,Jianping Li,Xiao Du
DOI: https://doi.org/10.1109/ITAIC54216.2022.9836921
2022-06-17
Abstract:Since Bidirectional Encoder Representation from Transformers (BERT) was proposed, BERT has obtained new state-of-the-art results in 11 Natural Language Processing (NLP) tasks and is the most advanced embedding model available. However, the pre-trained BERT model can process the maximum text sequence length is 512. Usually, people use text truncation method to make the sequence length match the preset value. But this processing can result in the loss of global information and lead to errors. In order to solve the above problem, we use the Long Short-Term Memory (LSTM) model on top of the BERT model for secondary extraction of features, while using the attention mechanism to optimize global features. This is our proposed BERTLSTMATT model. The experiment results on THUCNews dataset show that our model has better classification performance than other models.
Computer Science
What problem does this paper attempt to address?