Beyond Saliency: Understanding Convolutional Neural Networks from Saliency Prediction on Layer-Wise Relevance Propagation
Heyi Li,Yunke Tian,Klaus Mueller,Xin Chen
DOI: https://doi.org/10.1016/j.imavis.2019.02.005
IF: 3.86
2019-01-01
Image and Vision Computing
Abstract:Despite the tremendous achievements of deep convolutional neural networks(CNNs) in many computer vision tasks, understanding how they actually workremains a significant challenge. In this paper, we propose a novel two-stepunderstanding method, namely Salient Relevance (SR) map, which aims to shedlight on how deep CNNs recognize images and learn features from areas, referredto as attention areas, therein. Our proposed method starts out with alayer-wise relevance propagation (LRP) step which estimates a pixel-wiserelevance map over the input image. Following, we construct a context-awaresaliency map, SR map, from the LRP-generated map which predicts areas close tothe foci of attention instead of isolated pixels that LRP reveals. In humanvisual system, information of regions is more important than of pixels inrecognition. Consequently, our proposed approach closely simulates humanrecognition. Experimental results using the ILSVRC2012 validation dataset inconjunction with two well-established deep CNN models, AlexNet and VGG-16,clearly demonstrate that our proposed approach concisely identifies not onlykey pixels but also attention areas that contribute to the underlying neuralnetwork's comprehension of the given images. As such, our proposed SR mapconstitutes a convenient visual interface which unveils the visual attention ofthe network and reveals which type of objects the model has learned torecognize after training. The source code is available athttps://github.com/Hey1Li/Salient-Relevance-Propagation.