Cascaded Convolutional Neural Networks with Perceptual Loss for Low Dose CT Denoising

Sepehr Ataei,Javad Alirezaie,Paul Babyn
DOI: https://doi.org/10.48550/arXiv.2006.14738
2020-06-26
Abstract:Low Dose CT Denoising research aims to reduce the risks of radiation exposure to patients. Recently researchers have used deep learning to denoise low dose CT images with promising results. However, approaches that use mean-squared-error (MSE) tend to over smooth the image resulting in loss of fine structural details in low contrast regions of the image. These regions are often crucial for diagnosis and must be preserved in order for Low dose CT to be used effectively in practice. In this work we use a cascade of two neural networks, the first of which aims to reconstruct normal dose CT from low dose CT by minimizing perceptual loss, and the second which predicts the difference between the ground truth and prediction from the perceptual loss network. We show that our method outperforms related works and more effectively reconstructs fine structural details in low contrast regions of the image.
Image and Video Processing,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The paper attempts to address the problem of reducing radiation exposure risk in low-dose CT (Computed Tomography) imaging while maintaining high image quality. Specifically, existing low-dose CT image denoising methods typically use Mean Squared Error (MSE) as the loss function, which can lead to overly smooth images and loss of fine structural details in low-contrast areas. These details are crucial for diagnosis and must be effectively preserved in low-dose CT images. To overcome this issue, the paper proposes a method using Cascaded Convolutional Neural Networks combined with Perceptual Loss to improve the denoising effect of low-dose CT images. Specifically, the method includes two neural networks: the first network reconstructs normal-dose CT images from low-dose CT images by minimizing perceptual loss; the second network predicts the difference between the real image and the output of the first network. Experimental results show that this method outperforms existing related methods in preserving image details, especially in low-contrast areas.