Research on Information Extraction of LCSTS Dataset Based on an Improved BERTSum-LSTM Model

Yiming Chen,Haobin Chen,Simin Liu,Yunyun Liu,Fanhao Zhou,Bing Wei

2024-06-26

Abstract:With the continuous advancement of artificial intelligence, natural language processing technology has become widely utilized in various fields. At the same time, there are many challenges in creating Chinese news summaries. First of all, the semantics of Chinese news is complex, and the amount of information is enormous. Extracting critical information from Chinese news presents a significant challenge. Second, the news summary should be concise and clear, focusing on the main content and avoiding redundancy. In addition, the particularity of the Chinese language, such as polysemy, word segmentation, etc., makes it challenging to generate Chinese news summaries. Based on the above, this paper studies the information extraction method of the LCSTS dataset based on an improved BERTSum-LSTM model. We improve the BERTSum-LSTM model to make it perform better in generating Chinese news summaries. The experimental results show that the proposed method has a good effect on creating news summaries, which is of great importance to the construction of news summaries.

Computation and Language,Artificial Intelligence

What problem does this paper attempt to address?

The main focus of this paper is on how to improve the quality and efficiency of Chinese news summarization. Specifically, the researchers discussed the following key issues: 1. **Semantic Complexity of Chinese News**: Chinese news has complex semantic structures and a large amount of information, making it a significant challenge to extract key information. 2. **Conciseness and Clarity of Summaries**: News summaries need to be concise, highlight key points, and avoid redundancy. 3. **Specificity of the Chinese Language**: Issues such as polysemy and word segmentation make it difficult to generate high-quality Chinese news summaries. To address the above issues, the researchers proposed a method based on an improved BERTSum-LSTM model to handle the information extraction task in the LCSTS dataset and generate better Chinese news summaries. The model enhances performance through the following ways: - **Utilizing the BERT Model**: Used to encode the text to obtain deep semantic representations of each sentence. - **Combining the LSTM Model**: Captures the sequential dependencies in the text. - **Evaluating Sentence Importance**: Calculates the importance score of each sentence through a linear layer and selects the most important sentences based on the scores to generate the summary. Experimental results show that the proposed BERTSum-LSTM model outperforms the standalone LSTM and BERTSum models on the LCSTS dataset, especially achieving significant improvements in ROUGE-1, ROUGE-2, and ROUGE-L metrics. This indicates that the improved model can better handle the task of generating Chinese news summaries, helping to improve the quality and efficiency of the summaries.

Research on Information Extraction of LCSTS Dataset Based on an Improved BERTSum-LSTM Model

Global Semantic Information Extraction Model for Chinese long text classification based on fine-tune BERT

CLTS+: A New Chinese Long Text Summarization Dataset with Abstractive Summaries

Element-aware Summarization with Large Language Models: Expert-aligned Evaluation and Chain-of-Thought Method

T-BERTSum: Topic-Aware Text Summarization Based on BERT

Improved BIO-based Chinese Automatic Abstract-generation Model

Fine-tune BERT for Extractive Summarization

LCSTS: A Large Scale Chinese Short Text Summarization Dataset

News Text Summarization Method based on BART-TextRank Model

End-to-End Segmentation-based News Summarization

Approach to Predicting News -- A Precise Multi-LSTM Network With BERT

An Effective Contextual Language Modeling Framework for Speech Summarization with Augmented Features

Role of Natural Language Processing in Document Understanding and Semantic Analysis: A Chinese Perspective

Extraction of temporal information from social media messages using the BERT model

Template Based Chinese News Event Summarization

Abstractive Summarization Improved by WordNet-based Extractive Sentences

Text Summarization Using Large Language Models: A Comparative Study of MPT-7b-instruct, Falcon-7b-instruct, and OpenAI Chat-GPT Models

Long short-term memory (LSTM)-based news classification model

TriSum: Learning Summarization Ability from Large Language Models with Structured Rationale

Cross-language Document Summarization Via Extraction and Ranking of Multiple Summaries

A Convolution BiLSTM Neural Network Model for Chinese Event Extraction