Abstract:Are deep convolutional neural networks (CNNs) for image classification explainable by utility maximization with information acquisition costs? We demonstrate that deep CNNs behave equivalently (in terms of necessary and sufficient conditions) to rationally inattentive utility maximizers, a generative model used extensively in economics for human decision making. Our claim is based by extensive experiments on 200 deep CNNs from 5 popular architectures. The parameters of our interpretable model are computed efficiently via convex feasibility algorithms. As an application, we show that our economics-based interpretable model can predict the classification performance of deep CNNs trained with arbitrary parameters with accuracy exceeding 94% . This eliminates the need to re-train the deep CNNs for image classification. The theoretical foundation of our approach lies in Bayesian revealed preference studied in micro-economics. All our results are on GitHub and completely reproducible.

What problem does this paper attempt to address?

### What problem does this paper attempt to solve? This paper aims to explore whether the decisions of deep convolutional neural networks (CNNs) in image classification can be explained by a utility - maximization model based on the cost of information acquisition. Specifically, the author hopes to verify whether the behavior of deep CNNs is equivalent to the decision - making behavior of rational - inattention utility maximizers. This model is widely used in economics to describe human decision - making. #### Main problems 1. **Interpretability of deep CNNs**: - The core problem of the paper is: Can the decisions of deep CNNs in image classification be explained by Bayesian utility maximizers with rational inattention? - Through a large number of experiments, the author shows that the behavior of deep CNNs satisfies the necessary and sufficient conditions for rational - inattention utility maximization, and the degree of satisfaction of these conditions is very high, thus proving the validity of this explanation. 2. **Prediction performance**: - Another key problem is: How to use the interpretable model based on economics to predict the classification performance of deep CNNs? - The author shows that their interpretable model can predict the classification performance of deep CNNs trained with arbitrary parameters with an accuracy of more than 94%, thus avoiding the need to retrain deep CNNs. #### Methods and results - **Theoretical basis**: The paper is based on Bayesian revealed preference theory, which is a classical method for studying human decision - making in microeconomics. - **Experimental verification**: Through extensive experiments on 200 deep CNNs from 5 popular architectures, the author verifies the validity of the proposed theoretical model. - **Application demonstration**: By predicting the classification performance under different noise levels, the practical application value of this model is demonstrated. #### Formula representation The key formulas involved in the paper include: 1. **Posterior probability calculation**: \[ p(x|y)=\frac{\pi_0(x)\alpha_k(y|x)}{\sum_{x'\in X}\pi_0(x')\alpha_k(y|x')} \] where \(\pi_0(x)\) is the prior probability distribution and \(\alpha_k(y|x)\) is the attention strategy. 2. **Expected utility maximization**: \[ a\in\arg\max_{a'\in A}\mathbb{E}_x\{u_k(x,a')|y\}=\sum_{x\in X}p(x|y)u_k(x,a') \] 3. **Attention strategy optimization**: \[ \alpha_k\in\arg\max_{\alpha'}\mathbb{E}_y\left\{\max_{a\in A}\mathbb{E}_x\{u_k(x,a)|y\}\right\}-C(\alpha',\pi_0) \] Through these formulas, the author establishes the equivalence relationship between deep CNNs and rational - inattention utility maximizers and proves the validity of this relationship through experiments. ### Summary The main contribution of this paper is to propose an interpretable model based on economics, which can effectively explain the decision - making behavior of deep CNNs in image classification, and shows the practical application value of this model in predicting CNN performance.

Rationally Inattentive Utility Maximization for Interpretable Deep Image Classification

A Pixel-Level Explainable Approach of Convolutional Neural Networks and Its Application

Deeper Interpretability of Deep Networks

Interpretable Network Visualizations: A Human-in-the-Loop Approach for Post-hoc Explainability of CNN-based Image Classification

E Pluribus Unum Interpretable Convolutional Neural Networks

Interpreting Deep Neural Networks Through Variable Importance

A Survey of the Interpretability Aspect of Deep Learning Models

A Test Statistic Estimation-based Approach for Establishing Self-interpretable CNN-based Binary Classifiers

Multicriteria interpretability driven deep learning

Feature CAM: Interpretable AI in Image Classification

Interpretable Deep Learning Models: Enhancing Transparency and Trustworthiness in Explainable AI

PICNN: A Pathway towards Interpretable Convolutional Neural Networks

Feature-Based Interpretation of Image Classification With the Use of Convolutional Neural Networks

Interpretability of deep learning models: A survey of results

Explainable Artificial Intelligence: Understanding, Visualizing and Interpreting Deep Learning Models

Explaining Deep Convolutional Neural Networks for Image Classification by Evolving Local Interpretable Model-agnostic Explanations

Interpretable Deep Convolutional Neural Networks via Meta-learning

Solving the enigma: Deriving optimal explanations of deep networks

Learning Bottleneck Concepts in Image Classification

Minimalistic Explanations: Capturing the Essence of Decisions