Debiasing Classifiers by Amplifying Bias with Latent Diffusion and Large Language Models

Donggeun Ko,Dongjun Lee,Namjun Park,Wonkyeong Shim,Jaekwang Kim

2024-11-25

Abstract:Neural networks struggle with image classification when biases are learned and misleads correlations, affecting their generalization and performance. Previous methods require attribute labels (e.g. background, color) or utilizes Generative Adversarial Networks (GANs) to mitigate biases. We introduce DiffuBias, a novel pipeline for text-to-image generation that enhances classifier robustness by generating bias-conflict samples, without requiring training during the generation phase. Utilizing pretrained diffusion and image captioning models, DiffuBias generates images that challenge the biases of classifiers, using the top-$K$ losses from a biased classifier ($f_B$) to create more representative data samples. This method not only debiases effectively but also boosts classifier generalization capabilities. To the best of our knowledge, DiffuBias is the first approach leveraging a stable diffusion model to generate bias-conflict samples in debiasing tasks. Our comprehensive experimental evaluations demonstrate that DiffuBias achieves state-of-the-art performance on benchmark datasets. We also conduct a comparative analysis of various generative models in terms of carbon emissions and energy consumption to highlight the significance of computational efficiency.

Computer Vision and Pattern Recognition,Artificial Intelligence

What problem does this paper attempt to address?

The problem that this paper attempts to solve is that in image classification tasks, when neural networks learn the biases in the data set, these biases will lead to a decline in the generalization ability and performance of the model. Specifically, the paper focuses on how to reduce the over - dependence of the model on specific attributes or patterns in the data set (such as object shape, color, texture, etc.) by generating bias - conflict samples, thereby improving the robustness and generalization ability of the model. The paper proposes a new method named DiffuBias. This method uses pre - trained diffusion models and image captioning models to generate bias - conflict samples without additional training during the generation stage. This method can not only effectively reduce biases but also enhance the generalization ability of the classifier. By using pre - trained models, DiffuBias can generate high - quality bias - conflict samples without increasing additional learning costs, thereby achieving effective de - biasing of the classifier. In summary, the main contributions of this paper are as follows: 1. For the first time, use pre - trained latent diffusion models and image captioning models to generate synthetic bias - conflict samples to reduce the bias of the classifier. 2. This method does not require training the generation model, thus eliminating the learning cost. 3. Although there is no additional training cost, this model can still effectively amplify the bias samples, thereby processing the biased data set and reducing the bias of the classifier.

Debiasing Classifiers by Amplifying Bias with Latent Diffusion and Large Language Models

DiffInject: Revisiting Debias via Synthetic Data Generation using Diffusion-based Style Injection

Fighting Fire with Fire: Contrastive Debiasing without Bias-free Data via Generative Bias-transformation

DebiasDiff: Debiasing Text-to-image Diffusion Models with Self-discovering Latent Attribute Directions

Mitigating Social Biases in Text-to-Image Diffusion Models Via Linguistic-Aligned Attention Guidance

InvDiff: Invariant Guidance for Bias Mitigation in Diffusion Models

Balancing Act: Distribution-Guided Debiasing in Diffusion Models

Bias Begets Bias: The Impact of Biased Embeddings on Diffusion Models

Latent Directions: A Simple Pathway to Bias Mitigation in Generative AI

DeNetDM: Debiasing by Network Depth Modulation

VersusDebias: Universal Zero-Shot Debiasing for Text-to-Image Models via SLM-Based Prompt Engineering and Generative Adversary

MIST: Mitigating Intersectional Bias with Disentangled Cross-Attention Editing in Text-to-Image Diffusion Models

Enhancing Diffusion-Based Image Synthesis with Robust Classifier Guidance

Generated Bias: Auditing Internal Bias Dynamics of Text-To-Image Generative Models

Unlocking Intrinsic Fairness in Stable Diffusion

Unmasking Bias in Diffusion Model Training

Fair Diffusion: Instructing Text-to-Image Generation Models on Fairness

Utilizing Adversarial Examples for Bias Mitigation and Accuracy Enhancement

Model Debiasing by Learnable Data Augmentation

GradBias: Unveiling Word Influence on Bias in Text-to-Image Generative Models

Debiasify: Self-Distillation for Unsupervised Bias Mitigation