Full-dose Whole-body PET Synthesis from Low-dose PET Using High-efficiency Denoising Diffusion Probabilistic Model: PET Consistency Model

Shaoyan Pan,Elham Abouei,Junbo Peng,Joshua Qian,Jacob F Wynne,Tonghe Wang,Chih-Wei Chang,Justin Roper,Jonathon A Nye,Hui Mao,Xiaofeng Yang
2024-04-17
Abstract:Objective: Positron Emission Tomography (PET) has been a commonly used imaging modality in broad clinical applications. One of the most important tradeoffs in PET imaging is between image quality and radiation dose: high image quality comes with high radiation exposure. Improving image quality is desirable for all clinical applications while minimizing radiation exposure is needed to reduce risk to patients. Approach: We introduce PET Consistency Model (PET-CM), an efficient diffusion-based method for generating high-quality full-dose PET images from low-dose PET images. It employs a two-step process, adding Gaussian noise to full-dose PET images in the forward diffusion, and then denoising them using a PET Shifted-window Vision Transformer (PET-VIT) network in the reverse diffusion. The PET-VIT network learns a consistency function that enables direct denoising of Gaussian noise into clean full-dose PET images. PET-CM achieves state-of-the-art image quality while requiring significantly less computation time than other methods. Results: In experiments comparing eighth-dose to full-dose images, PET-CM demonstrated impressive performance with NMAE of 1.278+/-0.122%, PSNR of 33.783+/-0.824dB, SSIM of 0.964+/-0.009, NCC of 0.968+/-0.011, HRS of 4.543, and SUV Error of 0.255+/-0.318%, with an average generation time of 62 seconds per patient. This is a significant improvement compared to the state-of-the-art diffusion-based model with PET-CM reaching this result 12x faster. Similarly, in the quarter-dose to full-dose image experiments, PET-CM delivered competitive outcomes, achieving an NMAE of 0.973+/-0.066%, PSNR of 36.172+/-0.801dB, SSIM of 0.984+/-0.004, NCC of 0.990+/-0.005, HRS of 4.428, and SUV Error of 0.151+/-0.192% using the same generation process, which underlining its high quantitative and clinical precision in both denoising scenario.
Image and Video Processing,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
This paper proposes a method called PET Consistency Model (PET-CM) for generating high-quality full-dose PET images from low-dose PET images. In PET imaging, there is a trade-off between image quality and radiation dose, where high-quality images come with high radiation exposure. PET-CM addresses this issue through a two-step process: adding Gaussian noise to the full-dose PET image in forward diffusion, and then denoising it using PET Window Visual Transformer (PET-VIT) network in backward diffusion. This method learns a consistency function that can directly convert Gaussian noise into clean full-dose PET images, while requiring significantly less computation time compared to other methods. Experimental results show that PET-CM performs well in the conversion from one-eighth dose and one-quarter dose PET images to full-dose images, exhibiting lower Normalized Mean Absolute Error (NMAE), higher Peak Signal-to-Noise Ratio (PSNR), Multiscale Structural Similarity Index (MS-SSIM), as well as high Human Ranking Score (HRS) and Standard Uptake Value Error Analysis. Compared to existing diffusion models, PET-CM improves the generation speed by 12 times while maintaining high-quality images, demonstrating its high quantitative and clinical accuracy in medical image denoising.