Exploring Fusion Techniques in Multimodal AI-Based Recruitment: Insights from FairCVdb

Swati Swati,Arjun Roy,Eirini Ntoutsi
2024-06-17
Abstract:Despite the large body of work on fairness-aware learning for individual modalities like tabular data, images, and text, less work has been done on multimodal data, which fuses various modalities for a comprehensive analysis. In this work, we investigate the fairness and bias implications of multimodal fusion techniques in the context of multimodal AI-based recruitment systems using the FairCVdb dataset. Our results show that early-fusion closely matches the ground truth for both demographics, achieving the lowest MAEs by integrating each modality's unique characteristics. In contrast, late-fusion leads to highly generalized mean scores and higher MAEs. Our findings emphasise the significant potential of early-fusion for accurate and fair applications, even in the presence of demographic biases, compared to late-fusion. Future research could explore alternative fusion strategies and incorporate modality-related fairness constraints to improve fairness. For code and additional insights, visit: <a class="link-external link-https" href="https://github.com/Swati17293/Multimodal-AI-Based-Recruitment-FairCVdb" rel="external noopener nofollow">this https URL</a>
Computers and Society,Computation and Language,Computer Vision and Pattern Recognition,Machine Learning
What problem does this paper attempt to address?
This paper aims to explore the fairness and bias impact of multimodal fusion techniques in AI-based recruitment systems. Specifically, the researchers used the FairCVdb dataset to evaluate the performance of two multimodal fusion techniques, Early Fusion and Late Fusion, under different bias settings. The study found that, in terms of gender and race, Early Fusion techniques can more accurately simulate real-world scenarios, achieving lower Mean Absolute Error (MAE), while Late Fusion leads to highly generalized scores and higher MAE. Therefore, the paper highlights the significant potential of Early Fusion in improving application accuracy and fairness and suggests that future research could explore Mid-Fusion strategies to further enhance the fairness and accuracy of decisions. Additionally, the study points out that although using multimodal data can enhance performance and reduce bias to some extent, blindly fusing data from all modalities does not necessarily yield the best results.