Abstract:Inverse problems span across diverse fields. In medical contexts, computed tomography (CT) plays a crucial role in reconstructing a patient's internal structure, presenting challenges due to artifacts caused by inherently ill-posed inverse problems. Previous research advanced image quality via post-processing and deep unrolling algorithms but faces challenges, such as extended convergence times with ultra-sparse data. Despite enhancements, resulting images often show significant artifacts, limiting their effectiveness for real-world diagnostic applications. We aim to explore deep second-order unrolling algorithms for solving imaging inverse problems, emphasizing their faster convergence and lower time complexity compared to common first-order methods like gradient descent. In this paper, we introduce QN-Mixer, an algorithm based on the quasi-Newton approach. We use learned parameters through the BFGS algorithm and introduce Incept-Mixer, an efficient neural architecture that serves as a non-local regularization term, capturing long-range dependencies within images. To address the computational demands typically associated with quasi-Newton algorithms that require full Hessian matrix computations, we present a memory-efficient alternative. Our approach intelligently downsamples gradient information, significantly reducing computational requirements while maintaining performance. The approach is validated through experiments on the sparse-view CT problem, involving various datasets and scanning protocols, and is compared with post-processing and deep unrolling state-of-the-art approaches. Our method outperforms existing approaches and achieves state-of-the-art performance in terms of SSIM and PSNR, all while reducing the number of unrolling iterations required.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is how to effectively remove artifacts, improve image quality, and reduce the demand for computational resources in sparse - view computed tomography (Sparse - View CT) reconstruction. Specifically, existing methods have problems such as long convergence time, high computational cost, and difficulty in capturing long - range dependencies in images when dealing with ultra - sparse data. These problems limit the effectiveness of these methods in practical diagnostic applications. To this end, the authors propose a new deep second - order expansion network QN - Mixer based on the quasi - Newton method, aiming to improve the quality and efficiency of image reconstruction through faster convergence and lower time complexity. ### Main contributions: 1. **Introduction of QN - Mixer**: This is a new deep second - order expansion network that uses the implicit BFGS algorithm to approximate the Hessian matrix and combines the regularization term of deep learning to capture long - range dependencies in images. 2. **Proposal of Incept - Mixer**: This is a neural architecture that, as a non - local regularization term, combines the advantages of the multi - layer perceptron mixer (MLP - Mixer) and the Inception architecture and can better capture multi - scale information and long - range dependencies. 3. **Experimental verification**: Through extensive experiments and data set verification, it is proved that QN - Mixer has superior performance in sparse - view CT reconstruction, especially excellent in quantitative indicators (such as SSIM and PSNR), and at the same time requires fewer iterations than first - order expansion networks. ### Method overview: - **Quasi - Newton method**: QN - Mixer is based on the quasi - Newton method. It approximates the inverse of the Hessian matrix through the BFGS algorithm, thereby guiding the optimization direction and avoiding unnecessary zigzag paths. - **Incept - Mixer**: This module combines the attention mechanism of MLP - Mixer and the local invariant feature extraction ability of Inception blocks and can effectively capture long - range dependencies in images. - **Implicit BFGS update**: In order to reduce the computational burden, QN - Mixer operates in the latent space of gradient information and realizes the down - sampling and reconstruction of gradient information through an encoder and a decoder. ### Experimental results: - **Quantitative evaluation**: The experimental results on the AAPM and DeepLesion data sets show that QN - Mixer obtains the best PSNR and SSIM values under different projection views and noise levels. - **Visual comparison**: Compared with existing methods, QN - Mixer performs better in removing artifacts and preserving structures. - **OOD test**: On CT images with random white circles, QN - Mixer still performs well, further verifying its robustness and generalization ability. In conclusion, this paper solves the key problems in sparse - view CT reconstruction by introducing QN - Mixer and Incept - Mixer, providing a new solution for medical image processing.

QN-Mixer: A Quasi-Newton MLP-Mixer Model for Sparse-View CT Reconstruction

Deep Fusion Network Based Sparse View CT Reconstructions for Clinical Diagnostic Scanners

Generalized Deep Iterative Reconstruction for Sparse-View CT Imaging

Image-to-Image MLP-mixer for Image Reconstruction

Implicit Neural Representations for Robust Joint Sparse-View CT Reconstruction

Iterative reconstruction for limited-angle CT using implicit neural representation

CS-Mixer: A Cross-Scale Vision MLP Model with Spatial-Channel Mixing

Deep Inertia $L_p$ Half-Quadratic Splitting Unrolling Network for Sparse View CT Reconstruction

DREAM-Net: Deep Residual Error Iterative Minimization Network for Sparse-View CT Reconstruction

Sparse-view CT Reconstruction with 3D Gaussian Volumetric Representation

MVMS-RCN: A Dual-Domain Unfolding CT Reconstruction with Multi-sparse-view and Multi-scale Refinement-correction

MetaInv-Net: Meta Inversion Network for Sparse View CT Image Reconstruction

Deep-Neural-Network-Based Sinogram Synthesis for Sparse-View CT Image Reconstruction

I³Net: Inter-Intra-Slice Interpolation Network for Medical Slice Synthesis

Re-UNet: A Novel Multi-scale Reverse U-shaped Network Architecture for Low-dose CT Image Reconstruction

Sparse-view and limited-angle CT reconstruction with untrained networks and deep image prior

CAIR: Combining integrated attention with iterative optimization learning for sparse-view CT reconstruction

Enhancing Low-dose CT Image Reconstruction by Integrating Supervised and Unsupervised Learning

Deep-neural-network based sinogram synthesis for sparse-view CT image reconstruction

A Deep RNN for CT Image Reconstruction

Computationally efficient deep neural network for computed tomography image reconstruction