MSDiff: Multi-Scale Diffusion Model for Ultra-Sparse View CT Reconstruction

Pinhuang Tan,Mengxiao Geng,Jingya Lu,Liu Shi,Bin Huang,Qiegen Liu
2024-05-09
Abstract:Computed Tomography (CT) technology reduces radiation haz-ards to the human body through sparse sampling, but fewer sampling angles pose challenges for image reconstruction. Score-based generative models are widely used in sparse-view CT re-construction, performance diminishes significantly with a sharp reduction in projection angles. Therefore, we propose an ultra-sparse view CT reconstruction method utilizing multi-scale dif-fusion models (MSDiff), designed to concentrate on the global distribution of information and facilitate the reconstruction of sparse views with local image characteristics. Specifically, the proposed model ingeniously integrates information from both comprehensive sampling and selectively sparse sampling tech-niques. Through precise adjustments in diffusion model, it is capable of extracting diverse noise distribution, furthering the understanding of the overall structure of images, and aiding the fully sampled model in recovering image information more effec-tively. By leveraging the inherent correlations within the projec-tion data, we have designed an equidistant mask, enabling the model to focus its attention more effectively. Experimental re-sults demonstrated that the multi-scale model approach signifi-cantly improved the quality of image reconstruction under ultra-sparse angles, with good generalization across various datasets.
Image and Video Processing,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The paper primarily addresses the issue of Computed Tomography (CT) image reconstruction from ultra-sparse views. Traditional methods reduce radiation dose by decreasing the number of projection angles, but when the number of angles is significantly reduced, the quality of image reconstruction drops sharply, resulting in noticeable artifacts and distortions. To further reduce radiation dose while ensuring image quality, the authors propose a method based on a Multi-Scale Diffusion Model (MSDiff). The main contributions mentioned in the paper are as follows: 1. **Multi-Scale Diffusion Model for enhancing local texture and detail recovery**: Due to the extremely limited information contained in ultra-sparse view data, this method aims to make full use of the available data and compensate for the challenges brought by data loss through model extraction of local projection information. Combining local projections with structural details can improve the accuracy of image reconstruction and the recovery of local details while ensuring global consistency. 2. **Designing specialized masks to extract sparse prior information by leveraging the intrinsic correlation in projection data**: This method uses fewer prior clues to enhance the model's understanding of sparse data distribution. At the same time, the model can more accurately capture key features in the data, thereby more precisely recovering local textures and details during the reconstruction process. Specifically, the MSDiff method includes two diffusion models: the Full-View Diffusion Model (FDM) and the Sparse-View Diffusion Model (SDM). FDM is used to extract global information from fully sampled data and reduce noise and artifacts; SDM focuses on extracting specific local information from sparse projection data. These two models constrain each other through alternating iterations, significantly improving the quality of ultra-sparse view CT image reconstruction. The entire process includes steps such as using diffusion models to learn statistical distribution characteristics in the projection domain and iteratively reconstructing images.