Abstract:Low-dose computed tomography (LDCT) is an effective way to reduce radiation exposure for patients. However, it will increase the noise of reconstructed CT images and affect the precision of clinical diagnosis. The majority of the current deep learning-based denoising methods are built on convolutional neural networks (CNNs), which concentrate on local information and have little capacity for multiple structures modeling. Transformer structures are capable of computing each pixel's response on a global scale, but their extensive computation requirements prevent them from being widely used in medical image processing. To reduce the impact of LDCT scans on patients, this paper aims to develop an image post-processing method by combining CNN and Transformer structures. This method can obtain a high-quality images from LDCT. A hybrid CNN-Transformer (HCformer) codec network model is proposed for LDCT image denoising. A neighborhood feature enhancement (NEF) module is designed to introduce the local information into the Transformer's operation, and the representation of adjacent pixel information in the LDCT image denoising task is increased. The shifting window method is utilized to lower the computational complexity of the network model and overcome the problems that come with computing the MSA (Multi-head self-attention) process in a fixed window. Meanwhile, W/SW-MSA (Windows/Shifted window Multi-head self-attention) is alternately used in two layers of the Transformer to gain the information interaction between various Transformer layers. This approach can successfully decrease the Transformer's overall computational cost. The AAPM 2016 LDCT grand challenge dataset is employed for ablation and comparison experiments to demonstrate the viability of the proposed LDCT denoising method. Per the experimental findings, HCformer can increase the image quality metrics SSIM, HuRMSE and FSIM from 0.8017, 34.1898, and 0.6885 to 0.8507, 17.7213, and 0.7247, respectively. Additionally, the proposed HCformer algorithm will preserves image details while it reduces noise. In this paper, an HCformer structure is proposed based on deep learning and evaluated by using the AAPM LDCT dataset. Both the qualitative and quantitative comparison results confirm that the proposed HCformer outperforms other methods. The contribution of each component of the HCformer is also confirmed by the ablation experiments. HCformer can combine the advantages of CNN and Transformer, and it has great potential for LDCT image denoising and other tasks.

Hformer: highly efficient vision transformer for low-dose CT denoising

HCformer: Hybrid CNN-Transformer for LDCT Image Denoising

CTformer: convolution-free Token2Token dilated vision transformer for low-dose CT denoising

CNN and Multi-Feature Extraction Based Denoising of CT Images

TransCT: Dual-path Transformer for Low Dose Computed Tomography

[Letter: Progress in the therapy of duodenal ulcer. How should one prove it?].

Pure Vision Transformer (CT-ViT) with Noise2Neighbors Interpolation for Low-Dose CT Image Denoising

Degradation Adaption Local-to-Global Transformer for Low-Dose CT Image Denoising

A dense and U-shaped transformer with dual-domain multi-loss function for sparse-view CT reconstruction

On the Complexity of Some Extensions of RCG Parsing

A new visual State Space Model for low-dose CT denoising

Low-Dose CT Denoising Algorithm Based on Image Cartoon Texture Decomposition

Noise‐assisted hybrid attention networks for low‐dose PET and CT denoising

MRFormer: Multiscale retractable transformer for medical image progressive denoising via noise level estimation

HPIDN: A Hierarchical prior-guided iterative denoising network with global-local fusion for enhancing low-dose CT images

Deep High-Resolution Network for Low Dose X-ray CT Denoising

DenoMamba: A fused state-space model for low-dose CT denoising

Parallel processing model for low-dose computed tomography image denoising

A Multi-Resolution Denoising Method for Low-Dose CT Based on the Reconstruction of Wavelet High-Frequency Channel

CT-Mamba: A Hybrid Convolutional State Space Model for Low-Dose CT Denoising

LIT-Former: Linking In-plane and Through-plane Transformers for Simultaneous CT Image Denoising and Deblurring