3D U-KAN Implementation for Multi-modal MRI Brain Tumor Segmentation

Tianze Tang,Yanbing Chen,Hai Shu
2024-08-01
Abstract:We explore the application of U-KAN, a U-Net based network enhanced with Kolmogorov-Arnold Network (KAN) layers, for 3D brain tumor segmentation using multi-modal MRI data. We adapt the original 2D U-KAN model to the 3D task, and introduce a variant called UKAN-SE, which incorporates Squeeze-and-Excitation modules for global attention. We compare the performance of U-KAN and UKAN-SE against existing methods such as U-Net, Attention U-Net, and Swin UNETR, using the BraTS 2024 dataset. Our results show that U-KAN and UKAN-SE, with approximately 10.6 million parameters, achieve exceptional efficiency, requiring only about 1/4 of the training time of U-Net and Attention U-Net, and 1/6 that of Swin UNETR, while surpassing these models across most evaluation metrics. Notably, UKAN-SE slightly outperforms U-KAN.
Image and Video Processing,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The main objective of this paper is to evaluate and improve 3D multimodal MRI brain tumor segmentation techniques. Specifically, the research team applied U-KAN (a network based on U-Net and integrated with Kolmogorov-Arnold Network (KAN) layers) to the 3D brain tumor segmentation task and introduced a new variant, UKAN-SE, which combines the Squeeze-and-Excitation module to enhance the global attention mechanism. The paper evaluated U-KAN and UKAN-SE using the BraTS 2024 dataset and compared their performance with several existing models (such as U-Net, Attention U-Net, and Swin UNETR). The study found that U-KAN and UKAN-SE significantly reduced training time, approximately 1/4 of U-Net and Attention U-Net and about 1/6 of Swin UNETR, while having a similar number of parameters (around 10.6 million). Additionally, these two models outperformed other models on various evaluation metrics, with UKAN-SE showing particularly outstanding performance across multiple lesion types. Furthermore, the paper explored the potential of the KAN structure in improving model interpretability and suggested that future research could further optimize the configuration of KAN layers to enhance performance.