Abstract:In the era of social networks, the rapid growth of data mining in information retrieval and natural language processing makes automatic text summarization necessary. Currently, pretrained word embedding and sequence to sequence models can be effectively adapted in social network summarization to extract significant information with strong encoding capability. However, how to tackle the long text dependence and utilize the latent topic mapping has become an increasingly crucial challenge for these models. In this article, we propose a topic-aware extractive and abstractive summarization model named T-BERTSum, based on Bidirectional Encoder Representations from Transformers (BERTs). This is an improvement over previous models, in which the proposed approach can simultaneously infer topics and generate summarization from social texts. First, the encoded latent topic representation, through the neural topic model (NTM), is matched with the embedded representation of BERT, to guide the generation with the topic. Second, the long-term dependencies are learned through the transformer network to jointly explore topic inference and text summarization in an end-to-end manner. Third, the long short-term memory (LSTM) network layers are stacked on the extractive model to capture sequence timing information, and the effective information is further filtered on the abstractive model through a gated network. In addition, a two-stage extractive–abstractive model is constructed to share the information. Compared with the previous work, the proposed model T-BERTSum focuses on pretrained external knowledge and topic mining to capture more accurate contextual representations. Experimental results on the CNN/Daily mail and XSum datasets demonstrate that our proposed model achieves new state-of-the-art results while generating consistent topics compared with the most advanced method.

MuchSUM: Multi-channel Graph Neural Network for Extractive Summarization

A GAN Based Video Summarization Method with Representation Loss

Multi-granularity heterogeneous graph attention networks for extractive document summarization

A Multi-Granularity Heterogeneous Graph for Extractive Text Summarization

Multi-View Metrics Enhanced Heterogeneous Graph Neural Network for Extractive Summarization

Multiplex Graph Neural Network for Extractive Text Summarization

Heterogeneous Graph Neural Networks for Extractive Document Summarization

Bipartite Graph Pre-training for Unsupervised Extractive Summarization with Graph Convolutional Auto-Encoders

Leveraging Graph to Improve Abstractive Multi-Document Summarization.

SEASum: Syntax-Enriched Abstractive Summarization

StarSum: A Star Architecture Based Model for Extractive Summarization

Multi-Document Abstractive Summarization Using Chunk-graph and Recurrent Neural Network

SgSum: Transforming Multi-document Summarization into Sub-graph Selection

Learning Summary Prior Representation for Extractive Summarization.

Memory-based Extractive Summarization

Graph-based Neural Multi-Document Summarization

Multi-Roles Graph Based Extractive Summarization

Video Summarization Generation Network Based on Dynamic Graph Contrastive Learning and Feature Fusion

An Integrated Graph Model For Document Summarization

A Comprehensive Survey on Graph Summarization with Graph Neural Networks

T-BERTSum: Topic-Aware Text Summarization Based on BERT