Dual-Domain Reconstruction Network Incorporating Multi-Level Wavelet Transform and Recurrent Convolution for Sparse View Computed Tomography Imaging

Juncheng Lin,Jialin Li,Jiazhen Dou,Liyun Zhong,Jianglei Di,Yuwen Qin
DOI: https://doi.org/10.3390/tomography10010011
2024-01-16
Tomography
Abstract:Sparse view computed tomography (SVCT) aims to reduce the number of X-ray projection views required for reconstructing the cross-sectional image of an object. While SVCT significantly reduces X-ray radiation dose and speeds up scanning, insufficient projection data give rise to issues such as severe streak artifacts and blurring in reconstructed images, thereby impacting the diagnostic accuracy of CT detection. To address this challenge, a dual-domain reconstruction network incorporating multi-level wavelet transform and recurrent convolution is proposed in this paper. The dual-domain network is composed of a sinogram domain network (SDN) and an image domain network (IDN). Multi-level wavelet transform is employed in both IDN and SDN to decompose sinograms and CT images into distinct frequency components, which are then processed through separate network branches to recover detailed information within their respective frequency bands. To capture global textures, artifacts, and shallow features in sinograms and CT images, a recurrent convolution unit (RCU) based on convolutional long and short-term memory (Conv-LSTM) is designed, which can model their long-range dependencies through recurrent calculation. Additionally, a self-attention-based multi-level frequency feature normalization fusion (MFNF) block is proposed to assist in recovering high-frequency components by aggregating low-frequency components. Finally, an edge loss function based on the Laplacian of Gaussian (LoG) is designed as the regularization term for enhancing the recovery of high-frequency edge structures. The experimental results demonstrate the effectiveness of our approach in reducing artifacts and enhancing the reconstruction of intricate structural details across various sparse views and noise levels. Our method excels in both performance and robustness, as evidenced by its superior outcomes in numerous qualitative and quantitative assessments, surpassing contemporary state-of-the-art CNNs or Transformer-based reconstruction methods.
radiology, nuclear medicine & medical imaging
What problem does this paper attempt to address?
The problem that this paper attempts to solve is that in Sparse View Computed Tomography (SVCT), due to insufficient projection data, serious streak artifacts and blurring occur in the reconstructed images, thus affecting the diagnostic accuracy of CT detection. Specifically, SVCT reduces the radiation dose and speeds up the scanning speed by reducing the number of X - ray projection angles, but this process will lead to a decline in the quality of image reconstruction, especially under low - dose and high - noise conditions. To solve these problems, the paper proposes a Dual - Domain Reconstruction Network that combines multi - level wavelet transform and recursive convolution. This network consists of a Sinogram Domain Network (SDN) and an Image Domain Network (IDN). By using multi - level wavelet transform in the two networks, the sinogram and CT image can be decomposed into different frequency components, and these components can be processed by independent network branches to restore detailed frequency information. In addition, in order to capture global textures, artifacts and shallow features, a Recurrent Convolution Unit (RCU) based on Convolutional Long - Short - Term Memory (Conv - LSTM) is designed, which can model long - distance dependencies through recursive calculations. At the same time, a multi - level frequency feature normalization fusion block (MFNF) of the self - attention mechanism is also proposed to enhance the restoration of high - frequency components. Finally, an edge loss function based on Laplacian of Gaussian (LoG) is designed as a regularization term to improve the restoration effect of high - frequency edge structures. The main contributions of the paper are as follows: 1. A new CT reconstruction model is proposed, which integrates the construction methods of multi - level wavelet transform and recursive convolution unit. The interpolated sinogram and CT image are decomposed into different frequency components by multi - level wavelet transform and restored by independent network branches respectively. 2. A Recurrent Convolution Unit (RCU) embedded with Convolutional Long - Short - Term Memory (Conv - LSTM) is designed to capture global redundant texture information and global artifact features in different frequency components. 3. In the high - frequency component restoration network branch, an improved multi - level frequency feature normalization fusion block (MFNF) is designed. The low - frequency components are aggregated through the normalization strategy of the self - attention mechanism, and further combined with the Adaptive Channel Soft Threshold Function (ACSTF) to filter noise and useless features in the channel dimension, so as to enhance the restoration of high - frequency feature information. 4. In the image - domain loss function, an edge - loss regularization term based on Laplacian of Gaussian (LoG) is designed to improve the fidelity and authenticity of high - frequency edge details and reduce the structural blurring caused by the Mean - Square - Error (MSE) loss function. Through the above methods, the model proposed in the paper performs well in reducing artifacts and enhancing the reconstruction of complex structural details, and is suitable for CT image reconstruction under various sparse viewing angles and noise levels.