QID$^2$: An Image-Conditioned Diffusion Model for Q-space Up-sampling of DWI Data

Zijian Chen,Jueqi Wang,Archana Venkataraman
2024-09-04
Abstract:We propose an image-conditioned diffusion model to estimate high angular resolution diffusion weighted imaging (DWI) from a low angular resolution acquisition. Our model, which we call QID$^2$, takes as input a set of low angular resolution DWI data and uses this information to estimate the DWI data associated with a target gradient direction. We leverage a U-Net architecture with cross-attention to preserve the positional information of the reference images, further guiding the target image generation. We train and evaluate QID$^2$ on single-shell DWI samples curated from the Human Connectome Project (HCP) dataset. Specifically, we sub-sample the HCP gradient directions to produce low angular resolution DWI data and train QID$^2$ to reconstruct the missing high angular resolution samples. We compare QID$^2$ with two state-of-the-art GAN models. Our results demonstrate that QID$^2$ not only achieves higher-quality generated images, but it consistently outperforms the GAN models in downstream tensor estimation across multiple metrics. Taken together, this study highlights the potential of diffusion models, and QID$^2$ in particular, for q-space up-sampling, thus offering a promising toolkit for clinical and research applications.
Image and Video Processing,Computer Vision and Pattern Recognition,Machine Learning
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to generate high - angular - resolution diffusion - weighted imaging (DWI) data from low - angular - resolution DWI data. Specifically, the authors proposed an image - conditioned diffusion model QID2 to estimate high - angular - resolution DWI data, thereby overcoming many problems in traditional methods in clinical applications due to overly long acquisition times. ### Problem Background Diffusion - weighted imaging (DWI) is a non - invasive technique that detects the microstructure of brain tissue by measuring the anisotropic diffusion of water molecules. Usually, DWI acquisition requires the application of multiple magnetic gradients. The directions of these gradients are defined by b - vectors, and their intensities are controlled by b - values. Mathematically, these gradients can be represented as a set of coordinates on a spherical surface, and their domain is called q - space. Higher angular resolution (i.e., more intensive direction sampling) can improve the quality of DWI, such as improving tensor estimation and fiber tractography. However, increasing the angular resolution will prolong the acquisition time, which is impractical in a clinical environment because a longer acquisition time is not only expensive but also unbearable for patients and is likely to cause motion artifacts. ### Research Objectives To meet this challenge, this paper proposes an image - conditioned diffusion model named QID2, aiming to generate high - angular - resolution DWI data from low - angular - resolution DWI data. QID2 utilizes the U - Net architecture and cross - attention mechanism to guide the generation of the target image by referring to the low - resolution image. This model can effectively generate high - quality, high - angular - resolution DWI images while maintaining position information. ### Main Contributions 1. **High - Angular - Resolution DWI Generation**: QID2 can generate high - quality, high - angular - resolution DWI data from a limited number of initial scan directions. 2. **Superior Performance**: Experimental results show that QID2 is not only superior to existing GAN models in image quality but also performs better in downstream tasks such as tensor estimation. 3. **Potential for Clinical Applications**: This method provides a promising tool for clinical and research applications, especially in situations where fast, high - quality DWI data are required. ### Method Overview The core idea of QID2 is to utilize the denoising characteristics of the diffusion model and combine low - resolution DWI data as prior information to generate high - resolution DWI images in any target gradient direction. Specific steps include: - **Forward Noise Process**: Gradually add Gaussian noise to the generated image. - **Reverse Denoising Process**: Train the model to learn how to reverse this noise process to restore the original image. - **Cross - Attention Mechanism**: Utilize the cross - attention mechanism in the U - Net architecture to ensure that position information is fully considered during the generation process. In this way, QID2 can generate high - quality, high - angular - resolution DWI images while maintaining computational efficiency.