Transparent Projection Networks for Interpretable Image Recognition

Zhao Yang,Chao Zhang,Chunlin Chen,Huaxiong Li
DOI: https://doi.org/10.1109/ijcnn60899.2024.10650509
2024-01-01
Abstract:The absence of interpretability in deep convolutional neural networks (CNNs) leaves us with the dilemma of how to explain their decision mechanism in terms of human-understandable semantics due to the black-box nature. In this paper, we present a novel network architecture called Transparent Projection Networks (ProNets) for interpretable image recognition, by constructing a meaningful latent space through transparent projection learning. Specifically, we exploit the expressiveness of NNs to learn a group of input-dependent pixel-level weights, which project input images into the latent space, thus allowing for a transparent weighted connection between raw pixel-level features and latent high-level representations. We organize our layers in light of ResNets with valid modifications, where we propose to develop new block designs to learn projection weights and use shortcuts to enable linear projection flexibly every few layers. Our model inherits the properties of linear models, decomposing the output into a linear combination of contributions from each input feature, which can deliver transparent interpretations of the reasoning process along with high visual quality. Further, as an interpretable model for image classification, through experimental studies on several benchmark datasets, ProNets achieve competitive accuracy results with classic CNNs like VGG and ResNets, while demonstrating a remarkably high level of interpretability.
What problem does this paper attempt to address?