Abstract:We propose a technique for producing 'visual explanations' for decisions from a large class of Convolutional Neural Network (CNN)-based models, making them more transparent and explainable. Our approach—Gradient-weighted Class Activation Mapping (Grad-CAM), uses the gradients of any target concept (say 'dog' in a classification network or a sequence of words in captioning network) flowing into the final convolutional layer to produce a coarse localization map highlighting the important regions in the image for predicting the concept. Unlike previous approaches, Grad-CAM is applicable to a wide variety of CNN model-families: (1) CNNs with fully-connected layers (e.g.VGG), (2) CNNs used for structured outputs (e.g.captioning), (3) CNNs used in tasks with multi-modal inputs (e.g.visual question answering) or reinforcement learning, all without architectural changes or re-training. We combine Grad-CAM with existing fine-grained visualizations to create a high-resolution class-discriminative visualization, Guided Grad-CAM, and apply it to image classification, image captioning, and visual question answering (VQA) models, including ResNet-based architectures. In the context of image classification models, our visualizations (a) lend insights into failure modes of these models (showing that seemingly unreasonable predictions have reasonable explanations), (b) outperform previous methods on the ILSVRC-15 weakly-supervised localization task, (c) are robust to adversarial perturbations, (d) are more faithful to the underlying model, and (e) help achieve model generalization by identifying dataset bias. For image captioning and VQA, our visualizations show that even non-attention based models learn to localize discriminative regions of input image. We devise a way to identify important neurons through Grad-CAM and combine it with neuron names (Bau et al. in Computer vision and pattern recognition, 2017) to provide textual explanations for model decisions. Finally, we design and conduct human studies to measure if Grad-CAM explanations help users establish appropriate trust in predictions from deep networks and show that Grad-CAM helps untrained users successfully discern a 'stronger' deep network from a 'weaker' one even when both make identical predictions. Our code is available at <a href="https://github.com/ramprs/grad-cam/">https://github.com/ramprs/grad-cam/</a>, along with a demo on CloudCV (Agrawal et al., in: Mobile cloud visual media computing, pp 265–290. Springer, 2015) (<a href="http://gradcam.cloudcv.org">http://gradcam.cloudcv.org</a>) and a video at <a href="http://youtu.be/COjUB9Izk6E">http://youtu.be/COjUB9Izk6E</a>.

Exclusive Feature Constrained Class Activation Mapping for Better Visual Explanation.

Statistic-CAM: A Gradient-Free Visual Explanations for Deep Convolutional Network

Feature Activation Map: Visual Explanation of Deep Learning Models for Image Classification

FD-CAM: Improving Faithfulness and Discriminability of Visual Explanation for CNNs

CR-CAM: Generating explanations for deep neural networks by contrasting and ranking features

Grad-CAM: Why did you say that?

UnionCAM: enhancing CNN interpretability through denoising, weighted fusion, and selective high-quality class activation mapping

CAPE: CAM as a Probabilistic Ensemble for Enhanced DNN Interpretation

LFI-CAM: Learning Feature Importance for Better Visual Explanation

Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization

Shap-CAM: Visual Explanations for Convolutional Neural Networks Based on Shapley Value.

Overview of Class Activation Maps for Visualization Explainability

Grad++ScoreCAM: Enhancing Visual Explanations of Deep Convolutional Networks Using Incremented Gradient and Score- Weighted Methods

Reliable or Deceptive? Investigating Gated Features for Smooth Visual Explanations in CNNs

CAManim: Animating end-to-end network activation maps

Integrated Grad-CAM: Sensitivity-Aware Visual Explanation of Deep Convolutional Networks via Integrated Gradient-Based Scoring

KPCA-CAM: Visual Explainability of Deep Computer Vision Models using Kernel PCA

Group-CAM: Group Score-Weighted Visual Explanations for Deep Convolutional Networks

TAME: Attention Mechanism Based Feature Fusion for Generating Explanation Maps of Convolutional Neural Networks

Cluster-CAM: Cluster-weighted visual interpretation of CNNs' decision in image classification

Visual explanations with detailed spatial information for remote sensing image classification via channel saliency