Abstract:With its unique information-filtering function, text summarization technology has become a significant aspect of search engines and question-and-answer systems. However, existing models that include the copy mechanism often lack the ability to extract important fragments, resulting in generated content that suffers from thematic deviation and insufficient generalization. Specifically, Chinese automatic summarization using traditional generation methods often loses semantics because of its reliance on word lists. To address these issues, we proposed the novel BioCopy mechanism for the summarization task. By training the tags of predictive words and reducing the probability distribution range on the glossary, we enhanced the ability to generate continuous segments, which effectively solves the above problems. Additionally, we applied reinforced canonicality to the inputs to obtain better model results, making the model share the sub-network weight parameters and sparsing the model output to reduce the search space for model prediction. To further improve the model’s performance, we calculated the bilingual evaluation understudy (BLEU) score on the English dataset CNN/DailyMail to filter the thresholds and reduce the difficulty of word separation and the dependence of the output on the word list. We fully fine-tuned the model using the LCSTS dataset for the Chinese summarization task and conducted small-sample experiments using the CSL dataset. We also conducted ablation experiments on the Chinese dataset. The experimental results demonstrate that the optimized model can learn the semantic representation of the original text better than other models and performs well with small sample sizes.

Regularizing Output Distribution of Abstractive Chinese Social Media Text Summarization for Improved Semantic Consistency

Regularizing Output Distribution of Abstractive Chinese Social Media Text Summarization for Improved Semantic Consistency

Improving Semantic Relevance For Sequence-To-Sequence Learning Of Chinese Social Media Text Summarization

Abstractive Summarization Improved by WordNet-based Extractive Sentences

Abstract Summarization Model Based on Semantic Graphs and Entity Pointers

A Unified Model for Extractive and Abstractive Summarization using Inconsistency Loss

Autoencoder as Assistant Supervisor: Improving Text Representation for Chinese Social Media Text Summarization

Improved BIO-based Chinese Automatic Abstract-generation Model

Noised Consistency Training for Text Summarization

Topic-Guided Abstractive Text Summarization: a Joint Learning Approach

Abstractive Document Summarization via Neural Model with Joint Attention

Improving Social Media Text Summarization by Learning Sentence Weight Distribution

Abstractive text summarization model combining a hierarchical attention mechanism and multiobjective reinforcement learning

Text Summarization Generation Based on Semantic Similarity

A Syntax-Augmented and Headline-Aware Neural Text Summarization Method

Incorporating word attention with convolutional neural networks for abstractive summarization

HITS-based attentional neural model for abstractive summarization

Joint Parsing and Generation for Abstractive Summarization

Topic-Aware Abstractive Text Summarization

Balancing Lexical and Semantic Quality in Abstractive Summarization

Generative Adversarial Network for Abstractive Text Summarization.