Finn Behrendt,Debayan Bhattacharya,Robin Mieling,Lennart Maack,Julia Krüger,Roland Opfer,Alexander Schlaefer
Abstract:Unsupervised anomaly detection in Brain MRIs aims to identify abnormalities as outliers from a healthy training distribution. Reconstruction-based approaches that use generative models to learn to reconstruct healthy brain anatomy are commonly used for this task. Diffusion models are an emerging class of deep generative models that show great potential regarding reconstruction fidelity. However, they face challenges in preserving intensity characteristics in the reconstructed images, limiting their performance in anomaly detection. To address this challenge, we propose to condition the denoising mechanism of diffusion models with additional information about the image to reconstruct coming from a latent representation of the noise-free input image. This conditioning enables high-fidelity reconstruction of healthy brain structures while aligning local intensity characteristics of input-reconstruction pairs. We evaluate our method's reconstruction quality, domain adaptation features and finally segmentation performance on publicly available data sets with various pathologies. Using our proposed conditioning mechanism we can reduce the false-positive predictions and enable a more precise delineation of anomalies which significantly enhances the anomaly detection performance compared to established state-of-the-art approaches to unsupervised anomaly detection in brain MRI. Furthermore, our approach shows promise in domain adaptation across different MRI acquisitions and simulated contrasts, a crucial property of general anomaly detection methods.
Image and Video Processing,Computer Vision and Pattern Recognition,Machine Learning
What problem does this paper attempt to address?
This paper aims to address the challenges of unsupervised anomaly detection (UAD) in brain MRI. Specifically, the authors focus on how to reconstruct healthy brain anatomical structures through generative models and utilize this reconstruction ability to identify and localize abnormal regions. Although existing diffusion - model - based methods perform well in reconstruction quality, they have difficulties in maintaining the intensity features of the reconstructed images, which limits their performance in anomaly detection. To overcome this challenge, the authors propose a conditional diffusion model (cDDPM). By introducing the latent representation of the noise - free input image to guide the denoising process, high - fidelity reconstruction of healthy brain structures is achieved while aligning the local intensity features between the input and the reconstructed images. This method not only reduces false - positive predictions but also improves the accuracy of anomaly detection, significantly enhancing the performance of unsupervised anomaly detection.
### Problems Solved by the Paper:
1. **Maintaining the Intensity Features of Reconstructed Images**: Existing diffusion models have difficulty maintaining the intensity features of the original images during the reconstruction process, resulting in more false - positive results in anomaly detection. The paper solves this problem by introducing a conditional denoising mechanism and using the latent representation of the input image to guide the reconstruction process.
2. **Improving the Accuracy of Anomaly Detection**: By reducing false - positive predictions and more precisely delineating abnormal regions, the method proposed in the paper significantly improves the accuracy of anomaly detection.
3. **Enhancing Cross - Domain Adaptation Ability**: The method proposed in the paper shows good domain adaptation ability under different MRI acquisition methods and simulated contrasts, which is particularly important for general anomaly detection methods.
### Method Overview:
- **Conditional Diffusion Model (cDDPM)**: By introducing the latent representation of the input image during the denoising process, the model is guided to generate reconstruction results closer to the original image.
- **Extraction of Latent Representation**: A CNN - based image encoder is used to extract the latent representation from the input image.
- **Linear Transformation of Feature Maps**: In each residual block, the feature maps of Unet are scaled and translated according to the projected encoded representation to incorporate the information of the input image.
### Experimental Setup:
- **Dataset**: The training set uses healthy brain MRI scans from the IXI dataset, and the test set contains different pathological types from multiple public datasets, such as BraTS21, ATLAS, MSLUB, and WMH.
- **Pre - processing**: Pre - processing steps such as resampling, registration, skull removal, and bias - field correction are performed on the images.
- **Post - processing**: Binary segmentation maps are extracted from the residual maps through steps such as median filtering, brain mask erosion, threshold search, and connected - component filtering.
### Results and Discussion:
- **Reconstruction Quality**: The reconstruction quality is evaluated by metrics such as SSIM, PSNR, and LPIPS. The results show that cDDPM is superior to existing methods in maintaining local intensity features.
- **Domain Adaptation Ability**: Through experiments at different contrast levels, the domain adaptation ability of cDDPM on cross - domain data is verified.
- **Anomaly Detection Performance**: Experimental results on multiple datasets show that cDDPM performs well in the anomaly detection task and can effectively reduce false - positive predictions and improve the precision of abnormal regions.
In conclusion, this paper addresses the shortcomings of existing methods in maintaining the intensity features of reconstructed images by proposing a conditional diffusion model, significantly improving the performance of unsupervised anomaly detection, especially the domain adaptation ability under different MRI acquisition methods.