MediCLIP: Adapting CLIP for Few-shot Medical Image Anomaly Detection

Ximiao Zhang,Min Xu,Dehui Qiu,Ruixin Yan,Ning Lang,Xiuzhuang Zhou
2024-05-18
Abstract:In the field of medical decision-making, precise anomaly detection in medical imaging plays a pivotal role in aiding clinicians. However, previous work is reliant on large-scale datasets for training anomaly detection models, which increases the development cost. This paper first focuses on the task of medical image anomaly detection in the few-shot setting, which is critically significant for the medical field where data collection and annotation are both very expensive. We propose an innovative approach, MediCLIP, which adapts the CLIP model to few-shot medical image anomaly detection through self-supervised fine-tuning. Although CLIP, as a vision-language model, demonstrates outstanding zero-/fewshot performance on various downstream tasks, it still falls short in the anomaly detection of medical images. To address this, we design a series of medical image anomaly synthesis tasks to simulate common disease patterns in medical imaging, transferring the powerful generalization capabilities of CLIP to the task of medical image anomaly detection. When only few-shot normal medical images are provided, MediCLIP achieves state-of-the-art performance in anomaly detection and location compared to other methods. Extensive experiments on three distinct medical anomaly detection tasks have demonstrated the superiority of our approach. The code is available at
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The paper attempts to address the problem of how to achieve efficient and accurate anomaly detection and localization in medical images with few-shot samples. Traditional methods usually require a large amount of annotated data to train models, which is both expensive and time-consuming in the medical field. MediCLIP achieves effective detection and localization of anomalies in medical images with only a small number of normal images by self-supervised fine-tuning of the CLIP model and using synthetic anomaly images to simulate common disease patterns. Specifically, the main contributions of the paper include: 1. **Proposing the MediCLIP model**: This is an innovative approach that adapts the CLIP model to the few-shot medical image anomaly detection task, improving the model's generalization ability through self-supervised fine-tuning. 2. **Designing a multi-task anomaly synthesis strategy**: By generating simulated anomaly images through various synthesis tasks, including CutPaste, GaussIntensityChange, and Source, it effectively simulates common anomaly patterns in medical images. 3. **Introducing learnable prompts and adapters**: Using learnable prompts and adapters to enhance the model's performance on medical images, avoiding complex manual prompt design, and enhancing the model's multi-scale lesion localization capability. 4. **Proving effectiveness through experiments**: Extensive experiments were conducted on three different medical image datasets (CheXpert, BrainMRI, and BUSI), and the results show that MediCLIP significantly outperforms existing methods in a few-shot setting, especially in anomaly detection and localization. In summary, the paper aims to reduce the development cost of medical image anomaly detection and improve model performance through innovative technical means, thereby providing strong support for clinical decision-making.