Magnetic resonance image denoising for Rician noise using a novel hybrid transformer‐CNN network (HTC‐net) and self‐supervised pretraining
Shengjie Shi,Cheng Wang,Sa Xiao,Haidong Li,Xiuchao Zhao,Fumin Guo,Lei Shi,Xin Zhou
DOI: https://doi.org/10.1002/mp.17562
IF: 4.506
2024-12-08
Medical Physics
Abstract:Background Magnetic resonance imaging (MRI) is a crucial technique for both scientific research and clinical diagnosis. However, noise generated during MR data acquisition degrades image quality, particularly in hyperpolarized (HP) gas MRI. While deep learning (DL) methods have shown promise for MR image denoising, most of them fail to adequately utilize the long‐range information which is important to improve denoising performance. Furthermore, the sample size of paired noisy and noise‐free MR images also limits denoising performance. Purpose To develop an effective DL method that enhances denoising performance and reduces the requirement of paired MR images by utilizing the long‐range information and pretraining. Methods In this work, a hybrid Transformer‐convolutional neural network (CNN) network (HTC‐net) and a self‐supervised pretraining strategy are proposed, which effectively enhance the denoising performance. In HTC‐net, a CNN branch is exploited to extract the local features. Then a Transformer‐CNN branch with two parallel encoders is designed to capture the long‐range information. Within this branch, a residual fusion block (RFB) with a residual feature processing module and a feature fusion module is proposed to aggregate features at different resolutions extracted by two parallel encoders. After that, HTC‐net exploits the comprehensive features from the CNN branch and the Transformer‐CNN branch to accurately predict noise‐free MR images through a reconstruction module. To further enhance the performance on limited MRI datasets, a self‐supervised pretraining strategy is proposed. This strategy employs self‐supervised denoising to equip the HTC‐net with denoising capabilities during pretraining, and then the pre‐trained parameters are transferred to facilitate subsequent supervised training. Results Experimental results on the pulmonary HP 129Xe MRI dataset (1059 images) and IXI dataset (5000 images) all demonstrate the proposed method outperforms the state‐of‐the‐art methods, exhibiting superior preservation of edges and structures. Quantitatively, on the pulmonary HP 129Xe MRI dataset, the proposed method outperforms the state‐of‐the‐art methods by 0.254–0.597 dB in PSNR and 0.007–0.013 in SSIM. On the IXI dataset, the proposed method outperforms the state‐of‐the‐art methods by 0.3–0.927 dB in PSNR and 0.003–0.016 in SSIM. Conclusions The proposed method can effectively enhance the quality of MR images, which helps improve the diagnosis accuracy in clinical.
radiology, nuclear medicine & medical imaging