Noise suppression in photon-counting CT using unsupervised Poisson flow generative models

Dennis Hein,Staffan Holmin,Timothy Szczykutowicz,Jonathan S Maltz,Mats Danielsson,Ge Wang,Mats Persson
2024-01-10
Abstract:Deep learning has proven to be important for CT image denoising. However, such models are usually trained under supervision, requiring paired data that may be difficult to obtain in practice. Diffusion models offer unsupervised means of solving a wide range of inverse problems via posterior sampling. In particular, using the estimated unconditional score function of the prior distribution, obtained via unsupervised learning, one can sample from the desired posterior via hijacking and regularization. However, due to the iterative solvers used, the number of function evaluations (NFE) required may be orders of magnitudes larger than for single-step samplers. In this paper, we present a novel image denoising technique for photon-counting CT by extending the unsupervised approach to inverse problem solving to the case of Poisson flow generative models (PFGM)++. By hijacking and regularizing the sampling process we obtain a single-step sampler, that is NFE=1. Our proposed method incorporates posterior sampling using diffusion models as a special case. We demonstrate that the added robustness afforded by the PFGM++ framework yields significant performance gains. Our results indicate competitive performance compared to popular supervised, including state-of-the-art diffusion-style models with NFE=1 (consistency models), unsupervised, and non-deep learning-based image denoising techniques, on clinical low-dose CT data and clinical images from a prototype photon-counting CT system developed by GE HealthCare.
Medical Physics
What problem does this paper attempt to address?
The problem that this paper attempts to solve is image denoising in low - dose and photon - counting computed tomography (CT). Specifically, the authors propose a novel image denoising technique based on the unsupervised Poisson flow generative model (PFGM++) to reduce noise in CT images. This method is particularly suitable for image denoising in the absence of paired data, thus overcoming the problem that supervised learning methods are difficult to obtain paired data in practical applications. ### Background and Problem Description of the Paper X - ray computed tomography (CT) is a medical imaging technique widely used for disease diagnosis and treatment planning. However, even low - dose ionizing radiation has potential risks, so researchers have been striving to improve the diagnostic quality while keeping the dose as low as possible. Photon - counting CT (PCCT) can reduce the dose through photon - energy weighting and elimination of electronic noise by the latest CT detector technology, and can achieve higher spatial resolution and energy - resolved imaging in a single exposure. However, high - resolution spatial or energy imaging will reduce the number of photons in each voxel or energy bin, thereby increasing image noise. Therefore, excellent denoising performance is required, which may exceed the capabilities of the existing state - of - the - art denoising methods. ### Challenges of Deep Learning Methods Although deep learning methods have achieved remarkable success in low - dose and photon - counting CT image denoising, these methods usually require supervised learning for training, which requires paired datasets. In practical applications, it is very difficult to obtain paired clinical images, especially perfectly paired and registered images. In addition, methods based on simulating low - dose scans or adding noise maps from phantom scans may be affected by inaccurate system modeling or mismatches between patient and phantom geometries. In PCCT, the pulse - pileup effect behaves differently in high - dose and low - dose scans, further confounding these training schemes. Therefore, unsupervised and self - supervised methods are becoming more and more common. ### Main Contributions of the Paper This paper proposes a new image denoising technique, extending the posterior sampling Poisson flow generative model (PPFM) to the case without paired data. The main contributions include: 1. Proposing an unsupervised PPFM that can perform image denoising without paired data. 2. Demonstrating that the network can be efficiently trained on random patches extracted from real data and denoise full - resolution images by manipulating the sampling process. Using randomly extracted patches during the training process can save video memory and provide additional regularization. 3. The proposed method includes a posterior sampling diffusion model (EDM) as a special case when \( D \to \infty \). Experimental results show that choosing \( D \) as a hyperparameter can improve performance. 4. Evaluating the proposed method on clinical low - dose CT images and clinical images of the prototype PCCT system developed by GE HealthCare, demonstrating its competitiveness compared to the current state - of - the - art single - step sampling diffusion models (such as the consistency model). It is worth noting that the consistency model is trained by supervised learning, while the proposed method is unsupervised. Despite having more relaxed data requirements, the proposed method performs well in both quantitative and qualitative evaluations. ### Method Overview The paper proposes two main components: 1. A PFGM++ trained in an unsupervised manner for unconditional image generation. 2. A sampling scheme that ensures consistency with the input conditional image by regularizing the generation process. By combining the information of the prior distribution and the modified sampling scheme, this method can sample from the desired posterior distribution \( p(y|c) \) to solve the inverse problem. This strategy has been successfully applied to various inverse problems in diffusion models, and this paper extends it to PFGM++. ### Experiments and Results The paper conducted experiments on the Mayo low - dose CT dataset and clinical images of the prototype PCCT system developed by GE HealthCare. Quantitative evaluation metrics include the structural similarity index (SSIM), peak signal - to - noise ratio (PSNR), and perceptual similarity loss (LPIPS). Experimental results show that the proposed method performs well in both quantitative and qualitative evaluations, especially among unsupervised methods. Compared with supervised methods, although there is a slight performance gap, in the absence of paired data, its performance is very close to that of supervised methods.