Abstract:Extractive document summarization is a fundamental task in natural language processing (NLP). Recently, several Graph Neural Networks (GNNs) are proposed for this task. However, most existing GNN-based models can neither effectively encode semantic nodes of multiple granularity level apart from sentences nor substantially capture different cross-sentence meta-paths. To address these issues, we propose MHgatSum, a novel Multi-granularity Heterogeneous Graph ATtention networks for extractive document SUMmarization. Specifically, we first build a multi-granularity heterogeneous graph (HetG) for each document, which is better to represent the semantic meaning of the document. The HetG contains not only sentence nodes but also multiple other granularity effective semantic units with different semantic levels, including keyphrases and topics. These additional nodes act as the intermediary between sentences to build the meta-paths involved in sentence node (i.e., Sentence-Keyphrase-Sentence and Sentence-Topic-Sentence). Then, we propose a heterogeneous graph attention networks to embed the constructed HetG for extractive summarization, which enjoys multi-granularity semantic representations. The model is based on a hierarchical attention mechanism, including node-level and semantic-level attentions. The node-level attention can learn the importance between a node and its meta-path based neighbors, while the semantic-level attention is able to learn the importance of different meta-paths. Moreover, to better integrate sentence global knowledge, we further incorporate sentence node global importance in local node-level attention. We conduct empirical experiments on two benchmark datasets, which demonstrates the superiority of MHgatSum over previous SOTA models on the task of extractive summarization.

GRAFT: Graph Retrieval Augmented Fine Tuning for Multi-Hop Query Summarization

From Local to Global: A Graph RAG Approach to Query-Focused Summarization

Exploring simultaneous keyword and key sentence extraction: improve graph-based ranking using wikipedia.

Graph of Records: Boosting Retrieval Augmented Generation for Long-context Summarization with Graphs

GRAG: Graph Retrieval-Augmented Generation

G-Retriever: Retrieval-Augmented Generation for Textual Graph Understanding and Question Answering

Simple is Effective: The Roles of Graphs and Large Language Models in Knowledge-Graph-Based Retrieval-Augmented Generation

Graph Retrieval-Augmented Generation: A Survey

Nonfactoid Question Answering as Query-Focused Summarization With Graph-Enhanced Multihop Inference

Multi-granularity heterogeneous graph attention networks for extractive document summarization

Leveraging Graph to Improve Abstractive Multi-Document Summarization.

Meta Knowledge for Retrieval Augmented Large Language Models

Graph Neural Network Enhanced Retrieval for Question Answering of LLMs

GNN-RAG: Graph Neural Retrieval for Large Language Model Reasoning

A Multi-Granularity Heterogeneous Graph for Extractive Text Summarization

An Empirical Study of Retrieval Augmented Generation with Chain-of-Thought

RichRAG: Crafting Rich Responses for Multi-faceted Queries in Retrieval-Augmented Generation

Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks

Towards Multi-Source Retrieval-Augmented Generation via Synergizing Reasoning and Preference-Driven Retrieval

Speculative RAG: Enhancing Retrieval Augmented Generation through Drafting