Abstract:Automatic text summarization (TS) plays a pivotal role in condensing large volumes of information into concise, coherent summaries, facilitating efficient information retrieval and comprehension. This paper presents a novel framework for abstractive TS of single documents, which integrates three dominant aspects: structural, semantic, and neural-based approaches. The proposed framework merges machine learning and knowledge-based techniques to achieve a unified methodology. The framework consists of three main phases: pre-processing, machine learning, and post-processing. In the pre-processing phase, a knowledge-based Word Sense Disambiguation (WSD) technique is employed to generalize ambiguous words, enhancing content generalization. Semantic content generalization is then performed to address out-of-vocabulary (OOV) or rare words, ensuring comprehensive coverage of the input document. Subsequently, the generalized text is transformed into a continuous vector space using neural language processing techniques. A deep sequence-to-sequence (seq2seq) model with an attention mechanism is employed to predict a generalized summary based on the vector representation. In the post-processing phase, heuristic algorithms and text similarity metrics are utilized to refine the generated summary further. Concepts from the generalized summary are matched with specific entities, enhancing coherence and readability. Experimental evaluations conducted on prominent datasets, including Gigaword, Duc 2004, and CNN/DailyMail, demonstrate the effectiveness of the proposed framework. Results indicate significant improvements in handling rare and OOV words, outperforming existing state-of-the-art deep learning techniques. The proposed framework presents a comprehensive and unified approach towards abstractive TS, combining the strengths of structure, semantics, and neural-based methodologies.

Sparsity and Sentence Structure in Encoder-Decoder Attention of Summarization Systems

SAC: Accelerating and Structuring Self-Attention Via Sparse Adaptive Connection.

Neural Abstractive Summarization with Structural Attention

A Neural Attention Model for Abstractive Sentence Summarization

Assessment of Transformer-Based Encoder-Decoder Model for Human-Like Summarization

Abstractive Summarization Using Attentive Neural Techniques

Selective Encoding for Abstractive Sentence Summarization

Curriculum-Guided Abstractive Summarization

Long Document Summarization with Top-down and Bottom-up Inference

Attention With Sparsity Regularization for Neural Machine Translation and Summarization

Leveraging Salience Analysis and Sparse Attention for Long Document Summarization

Selective and Coverage Multi-head Attention for Abstractive Summarization

Extra Global Attention Designation Using Keyword Detection in Sparse Transformer Architectures

Fast Abstractive Summarization with Reinforce-Selected Sentence Rewriting

Global Encoding for Abstractive Summarization

Self-Attention Guided Copy Mechanism for Abstractive Summarization.

SparseCoder: Identifier-Aware Sparse Transformer for File-Level Code Summarization

Attention is all you need for Videos: Self-attention based Video Summarization using Universal Transformers

Extract-and-Abstract: Unifying Extractive and Abstractive Summarization within Single Encoder-Decoder Framework

Neural Sequence-to-Sequence Modeling with Attention by Leveraging Deep Learning Architectures for Enhanced Contextual Understanding in Abstractive Text Summarization