P-Shapley: Shapley Values on Probabilistic Classifiers

Haocheng Xia,Xiang Li,Junyuan Pang,Jinfei Liu,Kui Ren,Li Xiong
DOI: https://doi.org/10.14778/3654621.3654638
2024-01-01
Abstract:The Shapley value provides a unique approach to equitably gauge each player's contribution within a coalition and has extensive applications with various utility functions. In data valuation for machine learning, particularly for classification tasks, using classification accuracy as the utility function has become a de facto standard. However, accuracy can be an imprecise metric, potentially missing finer details crucial for valuation. In this paper, we propose the probability-based Shapley (P-Shapley) value, which leverages predicted probabilities to heighten utility differentiation. Several convex calibration functions are further incorporated for probability calibration. We prove that the P-Shapley value outperforms Shapley values based on accuracy or other coarse metrics in approximation stability and the discrimination of marginal utility change can be further improved by convex calibration functions. Extensive experiments on four real-world datasets demonstrate the effectiveness of our approaches.
What problem does this paper attempt to address?