Multimodal Fusion with Cross-attention Transformer for HCC Early Recurrence Prediction from Multi-Phase CT and Clinical Data

Xianru Zhang,Fang Wang,Yinhao Li,Lanfen Lin,Hongjie Hu,Yen-Wei Chen
DOI: https://doi.org/10.1145/3696271.3696286
2024-01-01
Abstract:Hepatocellular carcinoma (HCC) stands as a primary cause of liver cancer-related diagnoses and fatalities worldwide, with patients often encountering significant risks of recurrence following resection surgeries. Accurately predicting the clinical outcomes, particularly early recurrence, is paramount for guiding clinical treatment decisions. In this study, we introduce a novel multimodal fusion approach utilizing a cross-attention transformer to forecast the prognosis of early recurrence in HCC patients. Leveraging the transformer architecture and cross-attention mechanism, we integrate multimodal CT images and clinical information to enhance diagnostic prediction performance effectively. Our experimental findings underscore the efficacy of the proposed framework, demonstrating its potential to improve prognostic accuracy in HCC management.
What problem does this paper attempt to address?