Integrating Deep Unfolding with Direct Diffusion Bridges for Computed Tomography Reconstruction

Herman Verinaz-Jadan,Su Yan
2024-09-15
Abstract:Computed Tomography (CT) is widely used in healthcare for detailed imaging. However, Low-dose CT, despite reducing radiation exposure, often results in images with compromised quality due to increased noise. Traditional methods, including preprocessing, post-processing, and model-based approaches that leverage physical principles, are employed to improve the quality of image reconstructions from noisy projections or sinograms. Recently, deep learning has significantly advanced the field, with diffusion models outperforming both traditional methods and other deep learning approaches. These models effectively merge deep learning with physics, serving as robust priors for the inverse problem in CT. However, they typically require prolonged computation times during sampling. This paper introduces the first approach to merge deep unfolding with Direct Diffusion Bridges (DDBs) for CT, integrating the physics into the network architecture and facilitating the transition from degraded to clean images by bypassing excessively noisy intermediate stages commonly encountered in diffusion models. Moreover, this approach includes a tailored training procedure that eliminates errors typically accumulated during sampling. The proposed approach requires fewer sampling steps and demonstrates improved fidelity metrics, outperforming many existing state-of-the-art techniques.
Image and Video Processing
What problem does this paper attempt to address?
This paper attempts to solve the problem of image quality degradation in low - dose computed tomography (Low - dose CT, LDCT). Specifically, although low - dose CT reduces the radiation exposure of patients, it will lead to an increase in image noise, thus affecting the accuracy of diagnosis. Traditional methods such as pre - processing, post - processing and model - based methods can improve image quality to a certain extent, but their effects are limited when dealing with high - noise data. In recent years, deep - learning methods, especially diffusion models, have made remarkable progress in improving the quality of CT image reconstruction, but these models usually require a long computing time for sampling. For this reason, this paper proposes a new framework that combines system physics with Direct Diffusion Bridges (DDBs) through deep unfolding technology for CT image reconstruction. This method accelerates the conversion process from degraded images to clear images and reduces the noise accumulation in the intermediate stage by integrating physical principles into the network architecture. In addition, this method also includes a customized training process that can eliminate the errors usually accumulated during the sampling process. The experimental results show that this method improves the fidelity index of image reconstruction while reducing the sampling steps, and is superior to many existing state - of - the - art techniques. ### Main contributions 1. **Integrating physical principles**: The physical principles are directly embedded into the network architecture through deep unfolding technology, rather than only being used in the sampling stage. 2. **Direct Diffusion Bridges (DDBs)**: Use DDBs to simplify the conversion process from degraded images to clear images and skip the intermediate stage with excessive noise. 3. **Customized training process**: A new training method is proposed to reduce the errors accumulated during the sampling process and improve the quality of image reconstruction. 4. **Reducing computational requirements**: Compared with traditional diffusion models, this method requires fewer sampling steps, thereby reducing computational requirements. ### Experimental results The paper conducted experiments on the Mayo Clinic low - dose CT challenge dataset, and the results show that this method performs excellently in terms of indicators such as Peak Signal - to - Noise Ratio (PSNR), Structural Similarity Index (SSIM) and Learned Perceptual Image Patch Similarity (LPIPS), and is superior to other existing methods. ### Conclusion The framework proposed in this paper significantly improves the reconstruction quality of low - dose CT images while reducing computational requirements by combining system physics and direct diffusion bridges. Future work will further explore the application of this method in other imaging inverse problems.