Abstract:The two primary types of Hematoxylin and Eosin (H&E) slides in histopathology are Formalin-Fixed Paraffin-Embedded (FFPE) and Fresh Frozen (FF). FFPE slides offer high quality histopathological images but require a labor-intensive acquisition process. In contrast, FF slides can be prepared quickly, but the image quality is relatively poor. Our task is to translate FF images into FFPE style, thereby improving the image quality for diagnostic purposes. In this paper, we propose Diffusion-FFPE, a method for FF-to-FFPE histopathological image translation using a pre-trained diffusion model. Specifically, we employ a one-step diffusion model as the generator and fine-tune it with LoRA adapters using adversarial learning objectives. To ensure that the model effectively captures both global structural information and local details, we propose a multi-scale feature fusion (MFF) module. This module utilizes two VAE encoders to extract features of varying image sizes and performs feature fusion before feeding them into the UNet. Furthermore, we utilize a pre-trained vision-language model for histopathology as the backbone for the discriminator to further improve performance We conducted FF-to-FFPE translation experiments on the TCGA-NSCLC datasets, and our method achieved better performance compared to other methods. The code and models are released at <a class="link-external link-https" href="https://github.com/QilaiZhang/Diffusion-FFPE" rel="external noopener nofollow">this https URL</a>.

What problem does this paper attempt to address?

### What problem does this paper attempt to solve? The main goal of this paper is to improve image quality in pathological diagnosis by converting fresh frozen (FF) tissue slice images to formalin-fixed paraffin-embedded (FFPE) tissue slice images using a pre-trained model. Specifically, the authors propose a method called **Diffusion-FFPE**, which uses a pre-trained diffusion model as a generator and fine-tunes it with an adversarial learning objective. To ensure the model captures both global structural information and local details, they also introduce a multi-scale feature fusion (MFF) module. #### Main Contributions: 1. **Application of Pre-trained Models**: The method uses pre-trained models as the generator and discriminator, achieving good results in the FF to FFPE image conversion task. 2. **Multi-Scale Feature Fusion Module (MFF)**: This module captures information from pathological images at different scales, helping to generate finer details. 3. **Experimental Results**: Experiments on the TCGA-NSCLC dataset show that this method outperforms other open-source methods. #### Related Work: - Existing methods mainly use Generative Adversarial Network (GAN)-based approaches to achieve FF to FFPE image conversion. - Recently, some studies have started using diffusion models for unpaired image translation, but these methods usually require multi-step sampling, are slower, and perform poorly in maintaining image structure. #### Method Overview: - **Generator**: Uses a pre-trained one-step diffusion model SD-turbo as the generator, with a trainable LoRA adapter added to adapt to the data distribution of pathological images. - **Multi-Scale Feature Fusion (MFF)**: An MFF module is designed to combine global and local features to enhance the model's focus on small regions. - **Discriminator**: Uses a pre-trained visual model CONCH as the backbone network of the discriminator to improve training efficiency. - **Adversarial Learning Objective**: Adopts CycleGAN's loss functions, including adversarial loss, cycle consistency loss, and identity regularization loss. #### Experimental Results: - On the TCGA-NSCLC dataset, this method achieved an FID score of 15.78 and a KID score of 8.17×10^(-3), outperforming other methods. - Visualization results show that this method effectively corrects freezing artifacts in tissues, making the distinction between tissue and blank areas clearer. #### Conclusion: The Diffusion-FFPE method proposed in this paper leverages pre-trained models and multi-scale feature fusion techniques, showing excellent performance in the FF to FFPE image conversion task and demonstrating high potential for clinical applications.

Leveraging Pre-trained Models for FF-to-FFPE Histopathological Image Translation

F2FLDM: Latent Diffusion Models with Histopathology Pre-Trained Embeddings for Unpaired Frozen Section to FFPE Translation

FFPE++: Improving the quality of formalin-fixed paraffin-embedded tissue imaging via contrastive unpaired image-to-image translation

ST-MKSC: the FF-FFPE Stain Transfer Based on Multiple Key Structure Constraint.

PathoDuet: Foundation Models for Pathological Slide Analysis of H&E and IHC Stains

Advancing H&E-to-IHC Stain Translation in Breast Cancer: A Multi-Magnification and Attention-Based Approach

Synthetic Augmentation with Large-scale Unconditional Pre-training

A Multi-Perspective Self-Supervised Generative Adversarial Network for FS to FFPE Stain Transfer

Data Augmentation Based on DiscrimDiff for Histopathology Image Classification

Pix2Pix-based Stain-to-Stain Translation: A Solution for Robust Stain Normalization in Histopathology Images Analysis

Data-Efficient Histopathology Image Analysis with Deformation Representation Learning

Adaptive Supervised PatchNCE Loss for Learning H&E-to-IHC Stain Translation with Inconsistent Groundtruth Image Pairs

FDDM: Unsupervised Medical Image Translation with a Frequency-Decoupled Diffusion Model

HistoTransfer: Understanding Transfer Learning for Histopathology

A Federated Learning System for Histopathology Image Analysis With an Orchestral Stain-Normalization GAN

AV-GAN: Attention-Based Varifocal Generative Adversarial Network for Uneven Medical Image Translation

Versatile Stain Transfer in Histopathology Using a Unified Diffusion Framework

Improving Feature Extraction from Histopathological Images Through A Fine-tuning ImageNet Model

PMFN-SSL: Self-supervised learning-based progressive multimodal fusion network for cancer diagnosis and prognosis

LeFusion: Controllable Pathology Synthesis via Lesion-Focused Diffusion Models

Deep-learning-based cross-modality translation from Stokes image to bright-field contrast