DiffMIC: Dual-Guidance Diffusion Network for Medical Image Classification

Yijun Yang,Huazhu Fu,Angelica I. Aviles-Rivero,Carola-Bibiane Schönlieb,Lei Zhu
2023-07-11
Abstract:Diffusion Probabilistic Models have recently shown remarkable performance in generative image modeling, attracting significant attention in the computer vision community. However, while a substantial amount of diffusion-based research has focused on generative tasks, few studies have applied diffusion models to general medical image classification. In this paper, we propose the first diffusion-based model (named DiffMIC) to address general medical image classification by eliminating unexpected noise and perturbations in medical images and robustly capturing semantic representation. To achieve this goal, we devise a dual conditional guidance strategy that conditions each diffusion step with multiple granularities to improve step-wise regional attention. Furthermore, we propose learning the mutual information in each granularity by enforcing Maximum-Mean Discrepancy regularization during the diffusion forward process. We evaluate the effectiveness of our DiffMIC on three medical classification tasks with different image modalities, including placental maturity grading on ultrasound images, skin lesion classification using dermatoscopic images, and diabetic retinopathy grading using fundus images. Our experimental results demonstrate that DiffMIC outperforms state-of-the-art methods by a significant margin, indicating the universality and effectiveness of the proposed model. Our code will be publicly available at <a class="link-external link-https" href="https://github.com/scott-yjyang/DiffMIC" rel="external noopener nofollow">this https URL</a>.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The paper aims to address several key issues in medical image classification. Specifically: 1. **Eliminating Noise and Disturbances**: Existing medical image classification methods often suffer from various blurred lesions and fine-grained tissues when dealing with different modalities (such as ultrasound, dermoscopy, and fundus images). Additionally, medical images generated under hardware limitations may carry noise and blur effects, reducing image quality. Therefore, this paper proposes a diffusion model-based method (named DiffMIC) to eliminate undesirable noise in medical images and robustly capture semantic representations. 2. **Introducing Dual-Granularity Conditional Guidance Strategy**: To improve regional attention at each step, the authors designed a dual-conditional guidance strategy (DCG) that utilizes multi-granularity information for conditional constraints during the diffusion process. 3. **Maximum Mean Discrepancy Regularization**: During the forward diffusion process, maximum mean discrepancy (MMD) regularization is applied to learn mutual information at each granularity, enabling the network to model robust feature representations shared between the entire image and its patches. 4. **Validating Effectiveness**: The researchers evaluated the effectiveness of DiffMIC on three different medical image classification tasks, including placental maturity grading, skin lesion classification, and diabetic retinopathy grading. Experimental results show that DiffMIC significantly outperforms existing state-of-the-art methods in all three tasks. In summary, this paper proposes a novel diffusion model-based framework to address issues such as noise elimination, fine-grained information capture, and feature representation modeling in medical image classification, demonstrating its superior performance across various medical images.