Fairness of Extractive Text Summarization

Anurag Shandilya,Kripabandhu Ghosh,Saptarshi Ghosh
DOI: https://doi.org/10.1145/3184558.3186947
2018-01-01
Abstract:We propose to evaluate extractive summarization algorithms from a completely new perspective. Considering that an extractive summarization algorithm selects a subset of the textual units in the input data for inclusion in the summary, we investigate whether this selection is fair. We use several summarization algorithms over datasets that have a sensitive attribute (e.g., gender, political leaning) associated with the textual units, and find that the generated summaries often have very different distributions of the said attribute. Specifically, some classes of the textual units are under-represented in the summaries according to the fairness notion of adverse impact. To our knowledge, this is the first work on fairness of summarization, and is likely to open up interesting research problems.
What problem does this paper attempt to address?