Abstract:We introduce an extractive summarization system for meetings that leverages discourse structure to better identify salient information from complex multi-party discussions. Using discourse graphs to represent semantic relations between the contents of utterances in a meeting, we train a GNN-based node classification model to select the most important utterances, which are then combined to create an extractive summary. Experimental results on AMI and ICSI demonstrate that our approach surpasses existing text-based and graph-based extractive summarization systems, as measured by both classification and summarization metrics. Additionally, we conduct ablation studies on discourse structure and relation type to provide insights for future NLP applications leveraging discourse analysis theory.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is to extract key information in multi - party meetings to generate concise and coherent summaries. Specifically, the author proposes a method of using discourse structure to improve extractive summaries of meetings. Traditional methods have difficulties in dealing with long conversations or multi - party conversations, especially in identifying important information. The paper constructs a discourse graph, where each node represents an Elementary Discourse Unit (EDU) and each edge represents the semantic relationship between two EDUs, and uses Graph Neural Networks (GNN) to classify these nodes, thereby selecting the most important EDUs to generate extractive summaries. ### Main Contributions 1. **Utilizing Discourse Structure**: By introducing the discourse graph to represent the semantic relationships in the meeting content, the quality of the extractive summary is improved. 2. **Application of Graph Neural Networks (GNN)**: Using a GNN - based node classification model to select the most important EDUs. This method outperforms existing text - based and graph - based extractive summary systems on the AMIS and ICSI datasets. 3. **Ablation Study**: Analyzing the influence of different discourse relationship types and graph structures on the summary quality through ablation experiments, providing insights for future natural language processing applications. ### Specific Problems Solved - **Complexity of Multi - Party Meetings**: There are problems such as spontaneous interactions, incomplete sentences, and overlapping speeches in multi - party meetings, all of which affect the quality of automatic summaries. - **Information Sparsity**: The information density of the meeting content is low, and repetition and digression are likely to occur. - **Data Scarcity**: Many meeting contents are private, resulting in a limited amount of publicly available training data. - **Limitations of Existing Methods**: Existing extractive summary methods are not effective in dealing with long conversations, while generative summary methods are prone to hallucination. ### Method Innovations - **Construction of Discourse Graph**: By constructing a discourse graph, representing each Elementary Discourse Unit (EDU) as a node in the graph and the semantic relationship as an edge, the structural information of the meeting content can be better captured. - **GNN Classifier**: Using Graph Neural Networks (GNN) to classify nodes to determine which EDUs are important, and then generate extractive summaries. - **Multi - task Learning**: By combining different graph generators (such as Deep Sequential, Knowledge Enhanced, BERT - Line, etc.), evaluate the influence of different parsers on the summary quality. ### Experimental Results - **Classification Performance**: On the AMI and ICSI datasets, both RGCN and MixHop GCN outperform other baseline methods in classification performance. - **Summary Quality**: Evaluated by metrics such as ROUGE and BERTScore, the summaries generated by RGCN and MixHop GCN perform best in terms of information content and readability. - **Ablation Study**: Different types of discourse relationships have different impacts on the summary quality. For example, in the AMI dataset, Correction, Acknowledgement, and Explanation relationships have a greater impact on classification; while in the ICSI dataset, Result, Contrast, Narration, and Explanation relationships are more important. In conclusion, this paper effectively solves the key problems in multi - party meeting extractive summaries by introducing discourse structure and Graph Neural Networks, providing new ideas and methods for the development of automatic meeting summary technology.

Leveraging Discourse Structure for Extractive Meeting Summarization

Dialogue Discourse-Aware Graph Model and Data Augmentation for Meeting Summarization

Dialogue Discourse-Aware Graph Convolutional Networks for Abstractive Meeting Summarization

A Knowledge Graph-Based Abstractive Model Integrating Semantic and Structural Information for Summarizing Chinese Meetings

Generating Abstractive Summaries from Meeting Transcripts

Leveraging Graph to Improve Abstractive Multi-Document Summarization.

GoSum: Extractive Summarization of Long Documents by Reinforcement Learning and Graph Organized discourse state

Dialogue acts enhanced extract–abstract framework for meeting summarization

Abstractive Meeting Summarization by Hierarchical Adaptive Segmental Network Learning with Multiple Revising Steps

Action-Item-Driven Summarization of Long Meeting Transcripts

Tell me what I need to know: Exploring LLM-based (Personalized) Abstractive Multi-Source Meeting Summarization

ESSumm: Extractive Speech Summarization from Untranscribed Meeting

Summaries, Highlights, and Action items: Design, implementation and evaluation of an LLM-powered meeting recap system

Structure-Aware Abstractive Conversation Summarization via Discourse and Action Graphs

A Hierarchical Network for Abstractive Meeting Summarization with Cross-Domain Pretraining

Restructuring Conversations using Discourse Relations for Zero-shot Abstractive Dialogue Summarization

Discourse-Aware Unsupervised Summarization of Long Scientific Documents

SgSum: Transforming Multi-document Summarization into Sub-graph Selection

Investigating Consistency in Query-Based Meeting Summarization: A Comparative Study of Different Embedding Methods

A Two-Stage Long Text Summarization Method Based on Discourse Structure

An Integrated Graph Model For Document Summarization