Magnetic Resonance Image Processing Transformer for General Reconstruction

Guoyao Shen,Mengyu Li,Stephan Anderson,Chad W. Farris,Xin Zhang
2024-05-24
Abstract:Purpose: To develop and evaluate a deep learning model for general accelerated MRI reconstruction. Materials and Methods: This retrospective study built a magnetic resonance image processing transformer (MR-IPT) which includes multi-head-tails and a single shared window transformer main body. Three mutations of MR-IPT with different transformer structures were implemented to guide the design of our MR-IPT model. Pre-trained on the MRI set of RadImageNet including 672675 images with multiple anatomy categories, the model was further migrated and evaluated on fastMRI knee dataset with 25012 images for downstream reconstruction tasks. We performed comparison studies with three CNN-based conventional networks in zero- and few-shot learning scenarios. Transfer learning process was conducted on both MR-IPT and CNN networks to further validate the generalizability of MR-IPT. To study the model performance stability, we evaluated our model with various downstream dataset sizes ranging from 10 to 2500 images. Result: The MR-IPT model provided superior performance in multiple downstream tasks compared to conventional CNN networks. MR-IPT achieved a PSNR/SSIM of 26.521/0.6102 (4-fold) and 24.861/0.4996 (8-fold) in 10-epoch learning, surpassing UNet128 at 25.056/0.5832 (4-fold) and 22.984/0.4637 (8-fold). With the same large-scale pre-training, MR-IPT provided a 5% performance boost compared to UNet128 in zero-shot learning in 8-fold and 3% in 4-fold. Conclusion: MR-IPT framework benefits from its transformer-based structure and large-scale pre-training and can serve as a solid backbone in other downstream tasks with zero- and few-shot learning.
Image and Video Processing,Computer Vision and Pattern Recognition,Machine Learning,Medical Physics
What problem does this paper attempt to address?
This paper aims to develop and evaluate a deep learning model called Magnetic Resonance Image Processing Transformer (MR-IPT) for accelerating the general reconstruction of MRI. In this study, the authors constructed an MR-IPT with a multi-head and single-shared window transformation body, and implemented it with different variants of transformers. The model was pre-trained on the RadImageNet MRI dataset, which contains various anatomical categories, and then transferred and evaluated on the fastMRI knee dataset for downstream reconstruction tasks. The paper compared the performance of MR-IPT with three traditional convolutional neural network (CNN)-based networks in zero-shot and few-shot learning scenarios. The results showed that even in zero-shot learning, MR-IPT outperformed traditional CNN networks, especially in 4x and 8x accelerated reconstruction tasks. Furthermore, the performance of MR-IPT remained stable with a decrease in the size of the downstream dataset, while the performance of CNN relied more on the dataset size. The conclusion of the paper is that MR-IPT, benefiting from its transformer structure and large-scale pre-training, can be well adapted to other downstream tasks, especially in limited dataset scenarios, providing a powerful foundational framework.