Mix-Domain Contrastive Learning for Unpaired H&E-to-IHC Stain Translation
Song Wang,Zhong Zhang,Huan Yan,Ming Xu,Guanghui Wang
2024-08-31
Abstract:H&E-to-IHC stain translation techniques offer a promising solution for precise cancer diagnosis, especially in low-resource regions where there is a shortage of health professionals and limited access to expensive equipment. Considering the pixel-level misalignment of H&E-IHC image pairs, current research explores the pathological consistency between patches from the same positions of the image pair. However, most of them overemphasize the correspondence between domains or patches, overlooking the side information provided by the non-corresponding objects. In this paper, we propose a Mix-Domain Contrastive Learning (MDCL) method to leverage the supervision information in unpaired H&E-to-IHC stain translation. Specifically, the proposed MDCL method aggregates the inter-domain and intra-domain pathology information by estimating the correlation between the anchor patch and all the patches from the matching images, encouraging the network to learn additional contrastive knowledge from mixed domains. With the mix-domain pathology information aggregation, MDCL enhances the pathological consistency between the corresponding patches and the component discrepancy of the patches from the different positions of the generated IHC image. Extensive experiments on two H&E-to-IHC stain translation datasets, namely MIST and BCI, demonstrate that the proposed method achieves state-of-the-art performance across multiple metrics.
Image and Video Processing,Computer Vision and Pattern Recognition,Machine Learning
What problem does this paper attempt to address?
The problem that this paper attempts to solve is: in the absence of paired H&E (hematoxylin and eosin) - stained images and IHC (immunohistochemistry) - stained images, how to achieve the conversion from H&E - stained images to IHC - stained images. Specifically, the paper focuses on how to improve the accuracy and detail discrimination of the generated IHC images in terms of pathological consistency through the Mix - Domain Contrastive Learning (MDCL) method.
### Problem Background
1. **Differences between H&E Staining and IHC Staining**
- H&E staining is the most commonly used histochemical staining technique, which can visualize different tissue components and is widely used in cancer diagnosis.
- IHC staining specifically shows specific antigens or proteins through the interaction between labeled antibodies and intracellular antigens, which helps to more accurately identify cancer cell subtypes.
2. **Challenges**
- In low - resource areas, the lack of professional medical personnel and expensive equipment limits the application of IHC staining.
- It is physically impossible to obtain pixel - level aligned H&E - IHC image pairs because the same tissue section can only be stained once.
- Most of the existing research focuses on the pathological consistency between paired images, but ignores the side information provided by non - corresponding objects.
### Solutions Proposed in the Paper
To overcome the above challenges, the paper proposes a Mix - Domain Contrastive Learning (MDCL) method. The main features of this method are as follows:
1. **Mix - Domain Contrastive Loss**
- MDCL not only considers cross - domain information but also introduces intra - domain information.
- By estimating the correlation between the anchor patch and all patches in the matching image, the network is encouraged to learn additional contrastive knowledge from the mixed domain.
2. **Enhanced Pathological Consistency**
- MDCL enhances the pathological consistency between the generated IHC image and the input H&E image and improves the component differences between patches at different positions.
3. **Experimental Verification**
- Extensive experiments were carried out on two publicly available H&E - to - IHC staining conversion datasets (MIST and BCI), and the results show that this method has achieved state - of - the - art performance on multiple evaluation metrics.
### Formula Representation
The key formulas involved in the paper include:
- **Probability Calculation**
\[
P_{he}^i=\frac{\exp(z_i\cdot z_{he}^i / \tau)}{\exp(z_i\cdot z_{he}^i / \tau)+\sum_{j = 1, j\neq i}^{M}\exp(z_i\cdot z_{he}^j / \tau)}
\]
where \(z_i\) is the embedding of the anchor patch \(a_i\), \(z_{he}^i\) is the embedding of the corresponding H&E patch, and \(\tau\) is the temperature parameter.
- **Mix - Domain Contrastive Loss**
\[
L_{he}^i =-\log\left(\frac{\exp(z_i\cdot z_{he}^i / \tau)}{\sum_{j = 1}^{M}\exp(z_i\cdot z_{he}^j / \tau)+\sum_{j = 1, j\neq i}^{M}\exp(z_i\cdot z_j / \tau)}\right)
\]
where the first term represents cross - domain similarity and the second term represents intra - domain correlation.
Through these improvements, the MDCL method can achieve better results in the unpaired H&E - IHC staining conversion task, especially in terms of pathological consistency and detail generation.