Attribution Rollout: a New Way to Interpret Visual Transformer

Li Xu,Xin Yan,Weiyue Ding,Zechao Liu
DOI: https://doi.org/10.1007/s12652-022-04354-2
2022-01-01
Abstract:Transformer-based models are dominating the field of natural language processing and are becoming increasingly popular in the field of computer vision. However, the black box characteristics of transformers seriously hamper their application in certain fields. Prior work relies on the raw attention scores or employs heuristic propagation along with the attention graph. In this work, we propose a new way to visualize model. The method computes attention scores based on attribution and then propagates these attention scores through the layers. This propagation involves attention layers and multi-head attention mechanism. Our method extracts salient dependencies in each layer to visualize prediction results. We benchmark our method on recent visual transformer networks and demonstrate its many advantages over the existing interpretability methods. Our code is available at: https://github.com/yxheartipp/attr-rollout .
What problem does this paper attempt to address?