Rationally Inattentive Utility Maximization for Interpretable Deep Image Classification

Kunal Pattanayak,Vikram Krishnamurthy
DOI: https://doi.org/10.48550/arXiv.2102.04594
2021-07-31
Abstract:Are deep convolutional neural networks (CNNs) for image classification explainable by utility maximization with information acquisition costs? We demonstrate that deep CNNs behave equivalently (in terms of necessary and sufficient conditions) to rationally inattentive utility maximizers, a generative model used extensively in economics for human decision making. Our claim is based by extensive experiments on 200 deep CNNs from 5 popular architectures. The parameters of our interpretable model are computed efficiently via convex feasibility algorithms. As an application, we show that our economics-based interpretable model can predict the classification performance of deep CNNs trained with arbitrary parameters with accuracy exceeding 94% . This eliminates the need to re-train the deep CNNs for image classification. The theoretical foundation of our approach lies in Bayesian revealed preference studied in micro-economics. All our results are on GitHub and completely reproducible.
Machine Learning,Theoretical Economics
What problem does this paper attempt to address?
### What problem does this paper attempt to solve? This paper aims to explore whether the decisions of deep convolutional neural networks (CNNs) in image classification can be explained by a utility - maximization model based on the cost of information acquisition. Specifically, the author hopes to verify whether the behavior of deep CNNs is equivalent to the decision - making behavior of rational - inattention utility maximizers. This model is widely used in economics to describe human decision - making. #### Main problems 1. **Interpretability of deep CNNs**: - The core problem of the paper is: Can the decisions of deep CNNs in image classification be explained by Bayesian utility maximizers with rational inattention? - Through a large number of experiments, the author shows that the behavior of deep CNNs satisfies the necessary and sufficient conditions for rational - inattention utility maximization, and the degree of satisfaction of these conditions is very high, thus proving the validity of this explanation. 2. **Prediction performance**: - Another key problem is: How to use the interpretable model based on economics to predict the classification performance of deep CNNs? - The author shows that their interpretable model can predict the classification performance of deep CNNs trained with arbitrary parameters with an accuracy of more than 94%, thus avoiding the need to retrain deep CNNs. #### Methods and results - **Theoretical basis**: The paper is based on Bayesian revealed preference theory, which is a classical method for studying human decision - making in microeconomics. - **Experimental verification**: Through extensive experiments on 200 deep CNNs from 5 popular architectures, the author verifies the validity of the proposed theoretical model. - **Application demonstration**: By predicting the classification performance under different noise levels, the practical application value of this model is demonstrated. #### Formula representation The key formulas involved in the paper include: 1. **Posterior probability calculation**: \[ p(x|y)=\frac{\pi_0(x)\alpha_k(y|x)}{\sum_{x'\in X}\pi_0(x')\alpha_k(y|x')} \] where \(\pi_0(x)\) is the prior probability distribution and \(\alpha_k(y|x)\) is the attention strategy. 2. **Expected utility maximization**: \[ a\in\arg\max_{a'\in A}\mathbb{E}_x\{u_k(x,a')|y\}=\sum_{x\in X}p(x|y)u_k(x,a') \] 3. **Attention strategy optimization**: \[ \alpha_k\in\arg\max_{\alpha'}\mathbb{E}_y\left\{\max_{a\in A}\mathbb{E}_x\{u_k(x,a)|y\}\right\}-C(\alpha',\pi_0) \] Through these formulas, the author establishes the equivalence relationship between deep CNNs and rational - inattention utility maximizers and proves the validity of this relationship through experiments. ### Summary The main contribution of this paper is to propose an interpretable model based on economics, which can effectively explain the decision - making behavior of deep CNNs in image classification, and shows the practical application value of this model in predicting CNN performance.