Leveraging Discourse Structure for Extractive Meeting Summarization

Virgile Rennard,Guokan Shang,Michalis Vazirgiannis,Julie Hunter
2024-09-23
Abstract:We introduce an extractive summarization system for meetings that leverages discourse structure to better identify salient information from complex multi-party discussions. Using discourse graphs to represent semantic relations between the contents of utterances in a meeting, we train a GNN-based node classification model to select the most important utterances, which are then combined to create an extractive summary. Experimental results on AMI and ICSI demonstrate that our approach surpasses existing text-based and graph-based extractive summarization systems, as measured by both classification and summarization metrics. Additionally, we conduct ablation studies on discourse structure and relation type to provide insights for future NLP applications leveraging discourse analysis theory.
Computation and Language,Artificial Intelligence
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to extract key information in multi - party meetings to generate concise and coherent summaries. Specifically, the author proposes a method of using discourse structure to improve extractive summaries of meetings. Traditional methods have difficulties in dealing with long conversations or multi - party conversations, especially in identifying important information. The paper constructs a discourse graph, where each node represents an Elementary Discourse Unit (EDU) and each edge represents the semantic relationship between two EDUs, and uses Graph Neural Networks (GNN) to classify these nodes, thereby selecting the most important EDUs to generate extractive summaries. ### Main Contributions 1. **Utilizing Discourse Structure**: By introducing the discourse graph to represent the semantic relationships in the meeting content, the quality of the extractive summary is improved. 2. **Application of Graph Neural Networks (GNN)**: Using a GNN - based node classification model to select the most important EDUs. This method outperforms existing text - based and graph - based extractive summary systems on the AMIS and ICSI datasets. 3. **Ablation Study**: Analyzing the influence of different discourse relationship types and graph structures on the summary quality through ablation experiments, providing insights for future natural language processing applications. ### Specific Problems Solved - **Complexity of Multi - Party Meetings**: There are problems such as spontaneous interactions, incomplete sentences, and overlapping speeches in multi - party meetings, all of which affect the quality of automatic summaries. - **Information Sparsity**: The information density of the meeting content is low, and repetition and digression are likely to occur. - **Data Scarcity**: Many meeting contents are private, resulting in a limited amount of publicly available training data. - **Limitations of Existing Methods**: Existing extractive summary methods are not effective in dealing with long conversations, while generative summary methods are prone to hallucination. ### Method Innovations - **Construction of Discourse Graph**: By constructing a discourse graph, representing each Elementary Discourse Unit (EDU) as a node in the graph and the semantic relationship as an edge, the structural information of the meeting content can be better captured. - **GNN Classifier**: Using Graph Neural Networks (GNN) to classify nodes to determine which EDUs are important, and then generate extractive summaries. - **Multi - task Learning**: By combining different graph generators (such as Deep Sequential, Knowledge Enhanced, BERT - Line, etc.), evaluate the influence of different parsers on the summary quality. ### Experimental Results - **Classification Performance**: On the AMI and ICSI datasets, both RGCN and MixHop GCN outperform other baseline methods in classification performance. - **Summary Quality**: Evaluated by metrics such as ROUGE and BERTScore, the summaries generated by RGCN and MixHop GCN perform best in terms of information content and readability. - **Ablation Study**: Different types of discourse relationships have different impacts on the summary quality. For example, in the AMI dataset, Correction, Acknowledgement, and Explanation relationships have a greater impact on classification; while in the ICSI dataset, Result, Contrast, Narration, and Explanation relationships are more important. In conclusion, this paper effectively solves the key problems in multi - party meeting extractive summaries by introducing discourse structure and Graph Neural Networks, providing new ideas and methods for the development of automatic meeting summary technology.