On Sample Based Explanation Methods for Sequence-to-Sequence Applications

Yun Qin,Fan Zhang
DOI: https://doi.org/10.1109/ICCIA55271.2022.9828448
2022-01-01
Abstract:As deep learning models continue to develop more and more complex, the scale of natural language processing(NLP) datasets is also increasing, which challenges the ability of sample based explanation methods in terms of their interpretability, faithfulness, etc. In this work, we propose a matching influence function TracInS by selecting representative sequence-to-sequence applications that require high interpretability according to the needs of people who want to understand model behavior. Thereafter, we design enhancement based on TracInS, using arbitrary spans as fine-grained explanation units to achieve interpretability for sequence-to-sequence applications on valid datasets. Finally, we design targeted influence function evaluations, which are semantic-based and retraining-based evaluation methods, to verify that the influence function is effective. At the same time, the improvement of the experimental effect proves that the enhancement is more interpretable.
What problem does this paper attempt to address?