DenoMamba: A fused state-space model for low-dose CT denoising

Şaban Öztürk,Oğuz Can Duran,Tolga Çukur
2024-09-20
Abstract:Low-dose computed tomography (LDCT) lower potential risks linked to radiation exposure while relying on advanced denoising algorithms to maintain diagnostic quality in reconstructed images. The reigning paradigm in LDCT denoising is based on neural network models that learn data-driven image priors to separate noise evoked by dose reduction from underlying tissue signals. Naturally, the fidelity of these priors depend on the model's ability to capture the broad range of contextual features evident in CT images. Earlier convolutional neural networks (CNN) are highly adept at efficiently capturing short-range spatial context, but their limited receptive fields reduce sensitivity to interactions over longer distances. Although transformers based on self-attention mechanisms have recently been posed to increase sensitivity to long-range context, they can suffer from suboptimal performance and efficiency due to elevated model complexity, particularly for high-resolution CT images. For high-quality restoration of LDCT images, here we introduce DenoMamba, a novel denoising method based on state-space modeling (SSM), that efficiently captures short- and long-range context in medical images. Following an hourglass architecture with encoder-decoder stages, DenoMamba employs a spatial SSM module to encode spatial context and a novel channel SSM module equipped with a secondary gated convolution network to encode latent features of channel context at each stage. Feature maps from the two modules are then consolidated with low-level input features via a convolution fusion module (CFM). Comprehensive experiments on LDCT datasets with 25\% and 10\% dose reduction demonstrate that DenoMamba outperforms state-of-the-art denoisers with average improvements of 1.4dB PSNR, 1.1% SSIM, and 1.6% RMSE in recovered image quality.
Image and Video Processing,Artificial Intelligence,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
### Problems the Paper Attempts to Solve This paper aims to address the issue of denoising low-dose computed tomography (LDCT) images. Specifically, the authors propose a new method called DenoMamba, which is based on the state space model (SSM) and aims to effectively remove noise introduced by reduced radiation dose while maintaining the diagnostic quality of the images. ### Background and Motivation 1. **Importance of Low-Dose CT**: - Low-dose CT (LDCT) reduces potential health risks, such as cancer, by lowering the radiation dose. - However, dose reduction leads to a decrease in the signal-to-noise ratio (SNR) of the images, thereby increasing noise and severely affecting image quality and diagnostic effectiveness. 2. **Limitations of Existing Methods**: - **Convolutional Neural Networks (CNN)**: While CNNs excel at capturing short-range spatial context, their limited receptive field restricts sensitivity to long-range context, especially when dealing with high-resolution CT images. - **Transformer Models**: Although transformers enhance sensitivity to long-range context through self-attention mechanisms, their high model complexity can impact performance and efficiency, particularly when processing high-resolution images. ### Solution 1. **DenoMamba Model**: - **Architecture**: DenoMamba adopts an hourglass structure, comprising encoder and decoder stages, each consisting of multiple FuseSSM blocks. - **FuseSSM Block**: Each FuseSSM block includes a spatial SSM module and a channel SSM module, which are used to extract spatial and channel context features, respectively. These features are integrated through a convolutional fusion module (CFM). - **Innovations**: - **Spatial SSM Module**: Extracts spatial context features through linear embedding, nonlinear activation, and deep convolutional encoding, followed by state space modeling. - **Channel SSM Module**: Extracts channel context features by transposing the input sequence, performing state space modeling, and then using a gated convolutional network. - **Residual Connections**: Retain low-level representations through residual connections, enhancing the model's robustness and performance. 2. **Experimental Results**: - Comprehensive experiments were conducted on LDCT datasets with 25% and 10% dose reductions. The results show that DenoMamba outperforms existing denoising methods in restoring image quality, with an average improvement of 1.4dB PSNR, 1.1% SSIM, and 1.6% RMSE. ### Summary DenoMamba effectively addresses the issue of denoising low-dose CT images by leveraging the advantages of state space models. It not only surpasses existing methods in performance but also maintains a low level of model complexity, making it suitable for processing high-resolution medical images.