Common-Sense Bias Discovery and Mitigation for Classification Tasks

Miao Zhang,Zee fryer,Ben Colman,Ali Shahriyari,Gaurav Bharaj
2024-02-08
Abstract:Machine learning model bias can arise from dataset composition: sensitive features correlated to the learning target disturb the model decision rule and lead to performance differences along the features. Existing de-biasing work captures prominent and delicate image features which are traceable in model latent space, like colors of digits or background of animals. However, using the latent space is not sufficient to understand all dataset feature correlations. In this work, we propose a framework to extract feature clusters in a dataset based on image descriptions, allowing us to capture both subtle and coarse features of the images. The feature co-occurrence pattern is formulated and correlation is measured, utilizing a human-in-the-loop for examination. The analyzed features and correlations are human-interpretable, so we name the method Common-Sense Bias Discovery (CSBD). Having exposed sensitive correlations in a dataset, we demonstrate that downstream model bias can be mitigated by adjusting image sampling weights, without requiring a sensitive group label supervision. Experiments show that our method discovers novel biases on multiple classification tasks for two benchmark image datasets, and the intervention outperforms state-of-the-art unsupervised bias mitigation methods.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The paper attempts to address the issue that the composition of datasets in machine learning models may lead to model bias. Specifically, when sensitive features are related to the learning objective, these features can interfere with the model's decision rules, resulting in performance differences between different features. Existing debiasing work mainly focuses on capturing salient features in images, such as the color of digits or the background of animals, but relying solely on the model's latent space is insufficient to understand the correlations of all dataset features. To tackle this challenge, the paper proposes a method based on extracting feature clusters from image descriptions to capture both fine and coarse features of images. By analyzing feature co-occurrence patterns and measuring correlations, human involvement is used for inspection, making the analyzed features and correlations interpretable. This method is called Common-Sense Bias Discovery (CSBD). The paper demonstrates that by adjusting image sampling weights, it is possible to mitigate bias in downstream models without the need for sensitive group label supervision. Experimental results show that this method discovers new biases in multiple classification tasks and that the intervention effect is superior to existing unsupervised bias mitigation methods.