DDFA: a displacement and diffusion-based feature augmentation method for imbalanced image recognition

Huangyuan Wu,Bin Li,Lianfang Tian,Chao Dong
DOI: https://doi.org/10.1007/s00371-024-03673-z
IF: 2.835
2024-10-25
The Visual Computer
Abstract:Data-driven computer vision methods have achieved great success in multiple fields, but how to learn a balanced classifier on imbalanced distribution remains a great challenge for the data-driven method. The key issue behind long-tailed distribution is the information insufficiency of tail classes. Recent works adopted information augmentation (IA) to generate new tail class samples for mitigating this issue. However, the existing IA methods usually ignore feature drift, which hurt the decision boundary. Additionally, these methods only leverage limited information to generate samples, which cannot guarantee the quality of sample generation. To address these issues, we propose a displacement and diffusion-based feature augmentation (DDFA) method for learning a balanced model on imbalanced training distribution. Firstly, the feature reverse displacement module performs feature displacement on original tail features. It can mitigate the feature drift between the head class and the tail class. Subsequently, a long-tailed diffusion model is proposed to generate high-quality tail class samples with diversity and fidelity, which can mitigate the information insufficiency issue of tail class. Finally, the original samples and generated samples are combined in the feature space to promote balanced classifier learning. Experimental results on four challenging datasets demonstrate the effectiveness of the proposed DDFA method. The code is available at: https://github.com/wzh-why/DDFA.
computer science, software engineering
What problem does this paper attempt to address?