Research on Information Extraction of LCSTS Dataset Based on an Improved BERTSum-LSTM Model

Yiming Chen,Haobin Chen,Simin Liu,Yunyun Liu,Fanhao Zhou,Bing Wei
2024-06-26
Abstract:With the continuous advancement of artificial intelligence, natural language processing technology has become widely utilized in various fields. At the same time, there are many challenges in creating Chinese news summaries. First of all, the semantics of Chinese news is complex, and the amount of information is enormous. Extracting critical information from Chinese news presents a significant challenge. Second, the news summary should be concise and clear, focusing on the main content and avoiding redundancy. In addition, the particularity of the Chinese language, such as polysemy, word segmentation, etc., makes it challenging to generate Chinese news summaries. Based on the above, this paper studies the information extraction method of the LCSTS dataset based on an improved BERTSum-LSTM model. We improve the BERTSum-LSTM model to make it perform better in generating Chinese news summaries. The experimental results show that the proposed method has a good effect on creating news summaries, which is of great importance to the construction of news summaries.
Computation and Language,Artificial Intelligence
What problem does this paper attempt to address?
The main focus of this paper is on how to improve the quality and efficiency of Chinese news summarization. Specifically, the researchers discussed the following key issues: 1. **Semantic Complexity of Chinese News**: Chinese news has complex semantic structures and a large amount of information, making it a significant challenge to extract key information. 2. **Conciseness and Clarity of Summaries**: News summaries need to be concise, highlight key points, and avoid redundancy. 3. **Specificity of the Chinese Language**: Issues such as polysemy and word segmentation make it difficult to generate high-quality Chinese news summaries. To address the above issues, the researchers proposed a method based on an improved BERTSum-LSTM model to handle the information extraction task in the LCSTS dataset and generate better Chinese news summaries. The model enhances performance through the following ways: - **Utilizing the BERT Model**: Used to encode the text to obtain deep semantic representations of each sentence. - **Combining the LSTM Model**: Captures the sequential dependencies in the text. - **Evaluating Sentence Importance**: Calculates the importance score of each sentence through a linear layer and selects the most important sentences based on the scores to generate the summary. Experimental results show that the proposed BERTSum-LSTM model outperforms the standalone LSTM and BERTSum models on the LCSTS dataset, especially achieving significant improvements in ROUGE-1, ROUGE-2, and ROUGE-L metrics. This indicates that the improved model can better handle the task of generating Chinese news summaries, helping to improve the quality and efficiency of the summaries.