Abstract:Low-dose computed tomography (LDCT) is an effective way to reduce radiation exposure for patients. However, it will increase the noise of reconstructed CT images and affect the precision of clinical diagnosis. The majority of the current deep learning-based denoising methods are built on convolutional neural networks (CNNs), which concentrate on local information and have little capacity for multiple structures modeling. Transformer structures are capable of computing each pixel's response on a global scale, but their extensive computation requirements prevent them from being widely used in medical image processing. To reduce the impact of LDCT scans on patients, this paper aims to develop an image post-processing method by combining CNN and Transformer structures. This method can obtain a high-quality images from LDCT. A hybrid CNN-Transformer (HCformer) codec network model is proposed for LDCT image denoising. A neighborhood feature enhancement (NEF) module is designed to introduce the local information into the Transformer's operation, and the representation of adjacent pixel information in the LDCT image denoising task is increased. The shifting window method is utilized to lower the computational complexity of the network model and overcome the problems that come with computing the MSA (Multi-head self-attention) process in a fixed window. Meanwhile, W/SW-MSA (Windows/Shifted window Multi-head self-attention) is alternately used in two layers of the Transformer to gain the information interaction between various Transformer layers. This approach can successfully decrease the Transformer's overall computational cost. The AAPM 2016 LDCT grand challenge dataset is employed for ablation and comparison experiments to demonstrate the viability of the proposed LDCT denoising method. Per the experimental findings, HCformer can increase the image quality metrics SSIM, HuRMSE and FSIM from 0.8017, 34.1898, and 0.6885 to 0.8507, 17.7213, and 0.7247, respectively. Additionally, the proposed HCformer algorithm will preserves image details while it reduces noise. In this paper, an HCformer structure is proposed based on deep learning and evaluated by using the AAPM LDCT dataset. Both the qualitative and quantitative comparison results confirm that the proposed HCformer outperforms other methods. The contribution of each component of the HCformer is also confirmed by the ablation experiments. HCformer can combine the advantages of CNN and Transformer, and it has great potential for LDCT image denoising and other tasks.

Efficient Lightweight Image Denoising with Triple Attention Transformer

Vision Transformers for Single Image Dehazing

EWT: Efficient Wavelet-Transformer for Single Image Denoising

A Dynamic Network with Transformer for Image Denoising

DDT: Dual-branch Deformable Transformer for Image Denoising

An Efficient Dehazing Algorithm Based on the Fusion of Transformer and Convolutional Neural Network.

Exploration of Lightweight Single Image Denoising with Transformers and Truly Fair Training

CTFCD: Channel Transformer Based on Full Convolutional Decoder for Single Image Deraining

LGIT: local–global interaction transformer for low-light image denoising

An efficient lightweight network for image denoising using progressive residual and convolutional attention feature fusion

TripleFormer: improving transformer-based image classification method using multiple self-attention inputs

Self-Supervised Image Denoising for Real-World Images with Context-aware Transformer

An efficient multi‐scale transformer for satellite image dehazing

Frequency domain-enhanced transformer for single image deraining

HCformer: Hybrid CNN-Transformer for LDCT Image Denoising

Xformer: Hybrid X-Shaped Transformer for Image Denoising

MB-TaylorFormer: Multi-branch Efficient Transformer Expanded by Taylor Formula for Image Dehazing

Enhancing low-light images via skip cross-attention fusion and multi-scale lightweight transformer

Enhanced Frequency Fusion Network with Dynamic Hash Attention for image denoising

A cross Transformer for image denoising