Abstract:With its unique information-filtering function, text summarization technology has become a significant aspect of search engines and question-and-answer systems. However, existing models that include the copy mechanism often lack the ability to extract important fragments, resulting in generated content that suffers from thematic deviation and insufficient generalization. Specifically, Chinese automatic summarization using traditional generation methods often loses semantics because of its reliance on word lists. To address these issues, we proposed the novel BioCopy mechanism for the summarization task. By training the tags of predictive words and reducing the probability distribution range on the glossary, we enhanced the ability to generate continuous segments, which effectively solves the above problems. Additionally, we applied reinforced canonicality to the inputs to obtain better model results, making the model share the sub-network weight parameters and sparsing the model output to reduce the search space for model prediction. To further improve the model’s performance, we calculated the bilingual evaluation understudy (BLEU) score on the English dataset CNN/DailyMail to filter the thresholds and reduce the difficulty of word separation and the dependence of the output on the word list. We fully fine-tuned the model using the LCSTS dataset for the Chinese summarization task and conducted small-sample experiments using the CSL dataset. We also conducted ablation experiments on the Chinese dataset. The experimental results demonstrate that the optimized model can learn the semantic representation of the original text better than other models and performs well with small sample sizes.

Self-Attention Guided Copy Mechanism for Abstractive Summarization.

Learn to Copy from the Copying History - Correlational Copy Network for Abstractive Summarization.

Selective and Coverage Multi-head Attention for Abstractive Summarization

Structure-Infused Copy Mechanisms for Abstractive Summarization

Controlling Decoding for More Abstractive Summaries with Copy-Based Networks

Improved BIO-based Chinese Automatic Abstract-generation Model

HITS-based attentional neural model for abstractive summarization

Abstractive Summarization Improved by WordNet-based Extractive Sentences

An Abstractive Summarizer Based on Improved Pointer-Generator Network

Incorporating word attention with convolutional neural networks for abstractive summarization

Sparsity and Sentence Structure in Encoder-Decoder Attention of Summarization Systems

Text summarization based on multi-head self-attention mechanism and pointer network

Topic-Guided Abstractive Text Summarization: a Joint Learning Approach

Contrastive Attention Mechanism for Abstractive Sentence Summarization

Topic-Aware Abstractive Text Summarization

Abstractive Document Summarization with a Graph-Based Attentional Neural Model.

Extract-and-Abstract: Unifying Extractive and Abstractive Summarization within Single Encoder-Decoder Framework

Abstractive Summarization Using Attentive Neural Techniques

Fast Abstractive Summarization with Reinforce-Selected Sentence Rewriting

A Neural Attention Model for Abstractive Sentence Summarization