Decom–CAM: Tell Me What You See, in Details! Feature-Level Interpretation Via Decomposition Class Activation Map

Yuguang Yang,Runtang Guo,Sheng Wu,Yimi Wang,Juan Zhang,Xuan Gong,Baochang Zhang
2023-01-01
Abstract:Interpretation of deep learning remains a very challenging problem. Althoughthe Class Activation Map (CAM) is widely used to interpret deep modelpredictions by highlighting object location, it fails to provide insight intothe salient features used by the model to make decisions. Furthermore, existingevaluation protocols often overlook the correlation between interpretabilityperformance and the model's decision quality, which presents a more fundamentalissue. This paper proposes a new two-stage interpretability method called theDecomposition Class Activation Map (Decom-CAM), which offers a feature-levelinterpretation of the model's prediction. Decom-CAM decomposes intermediateactivation maps into orthogonal features using singular value decomposition andgenerates saliency maps by integrating them. The orthogonality of featuresenables CAM to capture local features and can be used to pinpoint semanticcomponents such as eyes, noses, and faces in the input image, making it morebeneficial for deep model interpretation. To ensure a comprehensive comparison,we introduce a new evaluation protocol by dividing the dataset into subsetsbased on classification accuracy results and evaluating the interpretabilityperformance on each subset separately. Our experiments demonstrate that theproposed Decom-CAM outperforms current state-of-the-art methods significantlyby generating more precise saliency maps across all levels of classificationaccuracy. Combined with our feature-level interpretability approach, this papercould pave the way for a new direction for understanding the decision-makingprocess of deep neural networks.
What problem does this paper attempt to address?