Explainable News Summarization -- Analysis and mitigation of Disagreement Problem

Seema Aswani,Sujala D. Shetty
2024-10-24
Abstract:Explainable AI (XAI) techniques for text summarization provide valuable understanding of how the summaries are generated. Recent studies have highlighted a major challenge in this area, known as the disagreement problem. This problem occurs when different XAI methods offer contradictory explanations for the summary generated from the same input article. This inconsistency across XAI methods has been evaluated using predefined metrics designed to quantify agreement levels between them, revealing significant disagreement. This impedes the reliability and interpretability of XAI in this area. To address this challenge, we propose a novel approach that utilizes sentence transformers and the k-means clustering algorithm to first segment the input article and then generate the explanation of the summary generated for each segment. By producing regional or segmented explanations rather than comprehensive ones, a decrease in the observed disagreement between XAI methods is hypothesized. This segmentation-based approach was used on two news summarization datasets, namely Extreme Summarization(XSum) and CNN-DailyMail, and the experiment was conducted using multiple disagreement metrics. Our experiments validate the hypothesis by showing a significant reduction in disagreement among different XAI methods. Additionally, a JavaScript visualization tool is developed, that is easy to use and allows users to interactively explore the color-coded visualization of the input article and the machine-generated summary based on the attribution scores of each sentences.
Artificial Intelligence
What problem does this paper attempt to address?
The core problem that this paper attempts to solve is the "Disagreement Problem" of Explainable AI (XAI) techniques in text summarization generation. Specifically, when different XAI methods provide contradictory explanations for the summaries generated from the same input article, such inconsistency will reduce the reliability and interpretability of XAI methods. ### Problem Background With the wide application of deep - learning models (such as BART and PEGASUS) in text summarization tasks, although these models can generate high - quality summaries, their black - box nature leads to difficulties in understanding the model decision - making process. XAI techniques aim to solve this problem by providing transparent and interpretable explanations, helping users understand why certain words or sentences are included in or excluded from the summary. However, recent research shows that there are significant differences in the explanations generated by different XAI methods on the same input, which not only affects the reliability of XAI methods but also makes it difficult for users to trust these explanations. Therefore, how to reduce these differences has become an urgent problem to be solved. ### Paper Objectives To solve the above problems, this paper proposes the following objectives: 1. **Quantify the degree of disagreement between XAI methods**: By using predefined metrics (such as feature consistency based on top - k features, ranking consistency, Spearman rank correlation coefficient, etc.), evaluate the level of disagreement between different XAI methods. 2. **Explore whether segmented explanations can reduce disagreement**: Assume that by dividing the input article into semantically coherent segments and generating explanations for each segment, the disagreement between different XAI methods can be reduced. 3. **Verify the effectiveness of the segmentation method**: Through experiments, verify whether the segmentation method based on k - means clustering can improve the consistency and reliability of XAI explanations. 4. **Develop an interactive visualization tool**: Provide an easy - to - use JavaScript tool that enables users to intuitively explore and understand the results of XAI explanations. ### Method Overview The paper adopts a two - main - stage methodology: - **Stage A: Disagreement Analysis**: Apply multiple XAI methods (such as LIME, Gradient SHAP, Attention, and DeepLIFT) to generate explanations, and use predefined disagreement metrics to evaluate the consistency between these explanations. - **Stage B: Solve the problem through segmented explanations**: Introduce a segmentation method based on sentence embedding and k - means clustering, divide the input article into multiple segments, and then generate explanations for each segment to reduce the disagreement between different XAI methods. Through the research in these two stages, the paper verifies the effectiveness of the segmented explanation method and develops an interactive visualization tool, enhancing the interpretability of XAI - driven text summarization.