Rethinking Diffusion-Based Image Generators for Fundus Fluorescein Angiography Synthesis on Limited Data

Chengzhou Yu,Huihui Fang,Hongqiu Wang,Ting Deng,Qing Du,Yanwu Xu,Weihua Yang
2024-12-17
Abstract:Fundus imaging is a critical tool in ophthalmology, with different imaging modalities offering unique advantages. For instance, fundus fluorescein angiography (FFA) can accurately identify eye diseases. However, traditional invasive FFA involves the injection of sodium fluorescein, which can cause discomfort and risks. Generating corresponding FFA images from non-invasive fundus images holds significant practical value but also presents challenges. First, limited datasets constrain the performance and effectiveness of models. Second, previous studies have primarily focused on generating FFA for single diseases or single modalities, often resulting in poor performance for patients with various ophthalmic conditions. To address these issues, we propose a novel latent diffusion model-based framework, Diffusion, which introduces a fine-tuning protocol to overcome the challenge of limited medical data and unleash the generative capabilities of diffusion models. Furthermore, we designed a new approach to tackle the challenges of generating across different modalities and disease types. On limited datasets, our framework achieves state-of-the-art results compared to existing methods, offering significant potential to enhance ophthalmic diagnostics and patient care. Our code will be released soon to support further research in this field.
Computer Vision and Pattern Recognition,Artificial Intelligence
What problem does this paper attempt to address?
This paper attempts to solve the following problems: 1. **The problem of limited data**: - In the task of ophthalmic medical image generation, especially when generating fluorescein angiography (FFA) images from non - invasive fundus images, due to the small size of the data set, the performance and effectiveness of the model are limited. This makes it difficult for traditional generation models to achieve good results on small data sets. 2. **The generation challenges of cross - modality and multi - disease types**: - Previous studies have mainly focused on generating FFA images of a single disease or a single modality. For patients with multiple ophthalmic diseases, these methods often perform poorly. Therefore, how to effectively generate images between different modalities and disease types is an urgent problem to be solved. 3. **The stability and quality of the generation model**: - Although traditional generative adversarial networks (GANs) and their variants perform well in image generation tasks, they have problems such as unstable training and mode collapse. Especially in high - resolution retinal image generation, the generated images often contain artifacts, which affect their application in clinical diagnosis. To solve these problems, the author proposes a new framework based on the diffusion model, which specifically includes the following aspects: - **Introducing a fine - tuning protocol**: By fine - tuning the pre - trained latent diffusion model (LDM), the limitations of small - sample medical data are overcome, and the generation ability of the model is improved. - **A cross - modality generation method**: A new method is designed to deal with the generation challenges of different modalities and disease types, ensuring that the generated images can maintain high quality in various situations. - **An improved noise strategy**: An offset noise strategy is introduced to explicitly inject the perceptual features of the target modality into the generation process, making the generated images closer to the statistical characteristics and style of the target modality. Through these improvements, this framework has achieved better results than existing methods on a limited data set, showing its great potential in improving ophthalmic diagnosis and patient care.