FACMIC: Federated Adaptative CLIP Model for Medical Image Classification

Yihang Wu,Christian Desrosiers,Ahmad Chaddad
2024-10-08
Abstract:Federated learning (FL) has emerged as a promising approach to medical image analysis that allows deep model training using decentralized data while ensuring data privacy. However, in the field of FL, communication cost plays a critical role in evaluating the performance of the model. Thus, transferring vision foundation models can be particularly challenging due to the significant resource costs involved. In this paper, we introduce a federated adaptive Contrastive Language Image Pretraining CLIP model designed for classification tasks. We employ a light-weight and efficient feature attention module for CLIP that selects suitable features for each client's data. Additionally, we propose a domain adaptation technique to reduce differences in data distribution between clients. Experimental results on four publicly available datasets demonstrate the superior performance of FACMIC in dealing with real-world and multisource medical imaging data. Our codes are available at <a class="link-external link-https" href="https://github.com/AIPMLab/FACMIC" rel="external noopener nofollow">this https URL</a>.
Computer Vision and Pattern Recognition,Artificial Intelligence,Image and Video Processing
What problem does this paper attempt to address?
### What problem does this paper attempt to solve? This paper aims to address two main issues encountered in medical image classification within Federated Learning (FL): 1. **Communication Cost**: In Federated Learning, the amount of data transmitted is usually proportional to the number of model parameters. For vision-language models with a large number of parameters (such as CLIP), this communication cost is very high, hindering their application in Federated Learning. 2. **Data Distribution Discrepancies**: There may be significant differences in data distribution between different clients, which can affect the accuracy of the model's predictions. This is especially true in the field of medical imaging, where images collected from different devices may introduce inconsistencies. To address these issues, the authors propose a new Federated Adaptive CLIP Model (FACMIC). This model improves performance through the following methods: - **Lightweight Feature Attention Module**: This module helps the model focus on useful features in each client's data, thereby reducing communication costs. - **Domain Adaptation Techniques**: These techniques are used to reduce data distribution discrepancies between different clients, enhancing the model's generalization ability. Experimental results show that FACMIC performs excellently in handling real-world and multi-source medical imaging data, outperforming existing Federated Learning methods.