Leveraging Pre-trained Models for FF-to-FFPE Histopathological Image Translation

Qilai Zhang,Jiawen Li,Peiran Liao,Jiali Hu,Tian Guan,Anjia Han,Yonghong He
2024-06-26
Abstract:The two primary types of Hematoxylin and Eosin (H&E) slides in histopathology are Formalin-Fixed Paraffin-Embedded (FFPE) and Fresh Frozen (FF). FFPE slides offer high quality histopathological images but require a labor-intensive acquisition process. In contrast, FF slides can be prepared quickly, but the image quality is relatively poor. Our task is to translate FF images into FFPE style, thereby improving the image quality for diagnostic purposes. In this paper, we propose Diffusion-FFPE, a method for FF-to-FFPE histopathological image translation using a pre-trained diffusion model. Specifically, we employ a one-step diffusion model as the generator and fine-tune it with LoRA adapters using adversarial learning objectives. To ensure that the model effectively captures both global structural information and local details, we propose a multi-scale feature fusion (MFF) module. This module utilizes two VAE encoders to extract features of varying image sizes and performs feature fusion before feeding them into the UNet. Furthermore, we utilize a pre-trained vision-language model for histopathology as the backbone for the discriminator to further improve performance We conducted FF-to-FFPE translation experiments on the TCGA-NSCLC datasets, and our method achieved better performance compared to other methods. The code and models are released at <a class="link-external link-https" href="https://github.com/QilaiZhang/Diffusion-FFPE" rel="external noopener nofollow">this https URL</a>.
Image and Video Processing,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
### What problem does this paper attempt to solve? The main goal of this paper is to improve image quality in pathological diagnosis by converting fresh frozen (FF) tissue slice images to formalin-fixed paraffin-embedded (FFPE) tissue slice images using a pre-trained model. Specifically, the authors propose a method called **Diffusion-FFPE**, which uses a pre-trained diffusion model as a generator and fine-tunes it with an adversarial learning objective. To ensure the model captures both global structural information and local details, they also introduce a multi-scale feature fusion (MFF) module. #### Main Contributions: 1. **Application of Pre-trained Models**: The method uses pre-trained models as the generator and discriminator, achieving good results in the FF to FFPE image conversion task. 2. **Multi-Scale Feature Fusion Module (MFF)**: This module captures information from pathological images at different scales, helping to generate finer details. 3. **Experimental Results**: Experiments on the TCGA-NSCLC dataset show that this method outperforms other open-source methods. #### Related Work: - Existing methods mainly use Generative Adversarial Network (GAN)-based approaches to achieve FF to FFPE image conversion. - Recently, some studies have started using diffusion models for unpaired image translation, but these methods usually require multi-step sampling, are slower, and perform poorly in maintaining image structure. #### Method Overview: - **Generator**: Uses a pre-trained one-step diffusion model SD-turbo as the generator, with a trainable LoRA adapter added to adapt to the data distribution of pathological images. - **Multi-Scale Feature Fusion (MFF)**: An MFF module is designed to combine global and local features to enhance the model's focus on small regions. - **Discriminator**: Uses a pre-trained visual model CONCH as the backbone network of the discriminator to improve training efficiency. - **Adversarial Learning Objective**: Adopts CycleGAN's loss functions, including adversarial loss, cycle consistency loss, and identity regularization loss. #### Experimental Results: - On the TCGA-NSCLC dataset, this method achieved an FID score of 15.78 and a KID score of 8.17×10^(-3), outperforming other methods. - Visualization results show that this method effectively corrects freezing artifacts in tissues, making the distinction between tissue and blank areas clearer. #### Conclusion: The Diffusion-FFPE method proposed in this paper leverages pre-trained models and multi-scale feature fusion techniques, showing excellent performance in the FF to FFPE image conversion task and demonstrating high potential for clinical applications.