Explainable Authorship Identification in Cultural Heritage Applications

Mattia Setzu,Silvia Corbara,Anna Monreale,Alejandro Moreo,Fabrizio Sebastiani
DOI: https://doi.org/10.1145/3654675
2024-04-21
Journal on Computing and Cultural Heritage
Abstract:While a substantial amount of work has recently been devoted to improving the accuracy of computational Authorship Identification (AId) systems for textual data, little to no attention has been paid to endowing AId systems with the ability to explain the reasons behind their predictions. This substantially hinders the practical application of AId methods, since the predictions returned by such systems are hardly useful unless they are supported by suitable explanations. In this paper, we explore the applicability of existing general-purpose eXplainable Artificial Intelligence (XAI) techniques to AId, with a focus on explanations addressed to scholars working in cultural heritage. In particular, we assess the relative merits of three different types of XAI techniques (feature ranking, probing, factual and counterfactual selection) on three different AId tasks (authorship attribution, authorship verification, same-authorship verification) by running experiments on real AId textual data. Our analysis shows that, while these techniques make important first steps towards explainable Authorship Identification, more work remains to be done in order to provide tools that can be profitably integrated in the workflows of scholars.
computer science, interdisciplinary applications
What problem does this paper attempt to address?