Abstract:In the era of social networks, the rapid growth of data mining in information retrieval and natural language processing makes automatic text summarization necessary. Currently, pretrained word embedding and sequence to sequence models can be effectively adapted in social network summarization to extract significant information with strong encoding capability. However, how to tackle the long text dependence and utilize the latent topic mapping has become an increasingly crucial challenge for these models. In this article, we propose a topic-aware extractive and abstractive summarization model named T-BERTSum, based on Bidirectional Encoder Representations from Transformers (BERTs). This is an improvement over previous models, in which the proposed approach can simultaneously infer topics and generate summarization from social texts. First, the encoded latent topic representation, through the neural topic model (NTM), is matched with the embedded representation of BERT, to guide the generation with the topic. Second, the long-term dependencies are learned through the transformer network to jointly explore topic inference and text summarization in an end-to-end manner. Third, the long short-term memory (LSTM) network layers are stacked on the extractive model to capture sequence timing information, and the effective information is further filtered on the abstractive model through a gated network. In addition, a two-stage extractive–abstractive model is constructed to share the information. Compared with the previous work, the proposed model T-BERTSum focuses on pretrained external knowledge and topic mining to capture more accurate contextual representations. Experimental results on the CNN/Daily mail and XSum datasets demonstrate that our proposed model achieves new state-of-the-art results while generating consistent topics compared with the most advanced method.

Text summarization based on multi-head self-attention mechanism and pointer network

An Abstractive Summarizer Based on Improved Pointer-Generator Network

Abstract Summarization Model Based on Semantic Graphs and Entity Pointers

Abstractive text summarization model combining a hierarchical attention mechanism and multiobjective reinforcement learning

A Syntax-Augmented and Headline-Aware Neural Text Summarization Method

HITS-based attentional neural model for abstractive summarization

Abstractive Summarization Improved by WordNet-based Extractive Sentences

Improved BIO-based Chinese Automatic Abstract-generation Model

Selective and Coverage Multi-head Attention for Abstractive Summarization

Deep learning-based extractive text summarization with word-level attention mechanism

Salience Estimation with Multi-Attention Learning for Abstractive Text Summarization

Improving the readability and saliency of abstractive text summarization using combination of deep neural networks equipped with auxiliary attention mechanism

Incorporating word attention with convolutional neural networks for abstractive summarization

Text Summarization Method Based on Gated Attention Graph Neural Network

Research on Text Summarization Model with Coverage Mechanism*

Neural Abstractive Summarization with Structural Attention

Abstractive method-based Text Summarization using Bidirectional Long Short-Term Memory and Pointer Generator Mode

Abstractive Document Summarization via Neural Model with Joint Attention

T-BERTSum: Topic-Aware Text Summarization Based on BERT

Self-Attention Guided Copy Mechanism for Abstractive Summarization.

Automatic summarization model based on clustering algorithm