QN-Mixer: A Quasi-Newton MLP-Mixer Model for Sparse-View CT Reconstruction

Ishak Ayad,Nicolas Larue,Maï K. Nguyen
2024-03-29
Abstract:Inverse problems span across diverse fields. In medical contexts, computed tomography (CT) plays a crucial role in reconstructing a patient's internal structure, presenting challenges due to artifacts caused by inherently ill-posed inverse problems. Previous research advanced image quality via post-processing and deep unrolling algorithms but faces challenges, such as extended convergence times with ultra-sparse data. Despite enhancements, resulting images often show significant artifacts, limiting their effectiveness for real-world diagnostic applications. We aim to explore deep second-order unrolling algorithms for solving imaging inverse problems, emphasizing their faster convergence and lower time complexity compared to common first-order methods like gradient descent. In this paper, we introduce QN-Mixer, an algorithm based on the quasi-Newton approach. We use learned parameters through the BFGS algorithm and introduce Incept-Mixer, an efficient neural architecture that serves as a non-local regularization term, capturing long-range dependencies within images. To address the computational demands typically associated with quasi-Newton algorithms that require full Hessian matrix computations, we present a memory-efficient alternative. Our approach intelligently downsamples gradient information, significantly reducing computational requirements while maintaining performance. The approach is validated through experiments on the sparse-view CT problem, involving various datasets and scanning protocols, and is compared with post-processing and deep unrolling state-of-the-art approaches. Our method outperforms existing approaches and achieves state-of-the-art performance in terms of SSIM and PSNR, all while reducing the number of unrolling iterations required.
Image and Video Processing,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to effectively remove artifacts, improve image quality, and reduce the demand for computational resources in sparse - view computed tomography (Sparse - View CT) reconstruction. Specifically, existing methods have problems such as long convergence time, high computational cost, and difficulty in capturing long - range dependencies in images when dealing with ultra - sparse data. These problems limit the effectiveness of these methods in practical diagnostic applications. To this end, the authors propose a new deep second - order expansion network QN - Mixer based on the quasi - Newton method, aiming to improve the quality and efficiency of image reconstruction through faster convergence and lower time complexity. ### Main contributions: 1. **Introduction of QN - Mixer**: This is a new deep second - order expansion network that uses the implicit BFGS algorithm to approximate the Hessian matrix and combines the regularization term of deep learning to capture long - range dependencies in images. 2. **Proposal of Incept - Mixer**: This is a neural architecture that, as a non - local regularization term, combines the advantages of the multi - layer perceptron mixer (MLP - Mixer) and the Inception architecture and can better capture multi - scale information and long - range dependencies. 3. **Experimental verification**: Through extensive experiments and data set verification, it is proved that QN - Mixer has superior performance in sparse - view CT reconstruction, especially excellent in quantitative indicators (such as SSIM and PSNR), and at the same time requires fewer iterations than first - order expansion networks. ### Method overview: - **Quasi - Newton method**: QN - Mixer is based on the quasi - Newton method. It approximates the inverse of the Hessian matrix through the BFGS algorithm, thereby guiding the optimization direction and avoiding unnecessary zigzag paths. - **Incept - Mixer**: This module combines the attention mechanism of MLP - Mixer and the local invariant feature extraction ability of Inception blocks and can effectively capture long - range dependencies in images. - **Implicit BFGS update**: In order to reduce the computational burden, QN - Mixer operates in the latent space of gradient information and realizes the down - sampling and reconstruction of gradient information through an encoder and a decoder. ### Experimental results: - **Quantitative evaluation**: The experimental results on the AAPM and DeepLesion data sets show that QN - Mixer obtains the best PSNR and SSIM values under different projection views and noise levels. - **Visual comparison**: Compared with existing methods, QN - Mixer performs better in removing artifacts and preserving structures. - **OOD test**: On CT images with random white circles, QN - Mixer still performs well, further verifying its robustness and generalization ability. In conclusion, this paper solves the key problems in sparse - view CT reconstruction by introducing QN - Mixer and Incept - Mixer, providing a new solution for medical image processing.