Multi-View Attention Learning for Residual Disease Prediction of Ovarian Cancer

Xiangneng Gao,Shulan Ruan,Jun Shi,Guoqing Hu,Wei Wei
2023-06-26
Abstract:In the treatment of ovarian cancer, precise residual disease prediction is significant for clinical and surgical decision-making. However, traditional methods are either invasive (e.g., laparoscopy) or time-consuming (e.g., manual analysis). Recently, deep learning methods make many efforts in automatic analysis of medical images. Despite the remarkable progress, most of them underestimated the importance of 3D image information of disease, which might brings a limited performance for residual disease prediction, especially in small-scale datasets. To this end, in this paper, we propose a novel Multi-View Attention Learning (MuVAL) method for residual disease prediction, which focuses on the comprehensive learning of 3D Computed Tomography (CT) images in a multi-view manner. Specifically, we first obtain multi-view of 3D CT images from transverse, coronal and sagittal views. To better represent the image features in a multi-view manner, we further leverage attention mechanism to help find the more relevant slices in each view. Extensive experiments on a dataset of 111 patients show that our method outperforms existing deep-learning methods.
Image and Video Processing,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The paper aims to address the issue of accurately predicting residual lesions in ovarian cancer treatment, which is crucial for clinical and surgical decision-making. Traditional methods are either highly invasive (such as laparoscopy) or time-consuming (such as manual analysis). Despite significant progress in automatic medical image analysis with deep learning in recent years, most studies underestimate the importance of three-dimensional (3D) image information, which may limit the performance of residual lesion prediction, especially on small-scale datasets. To address the above issues, the authors propose a new method called Multi-View Attention Learning (MuV AL) for predicting residual lesions in ovarian cancer. The MuV AL method specifically focuses on comprehensively learning 3D computed tomography (CT) images from multiple views (axial, coronal, and sagittal). Specifically, they first acquire 3D CT images from three different views and use attention mechanisms to identify the most relevant slices in each view, thereby better representing image features. Additionally, a pre-trained Med3D model is used to handle small-scale datasets. The main contributions of the paper can be summarized as follows: 1. **Application of multi-view 3D CT images**: It is observed that utilizing multi-view features of 3D CT images obtained from axial, coronal, and sagittal views has great potential for enhancing the learning of residual lesion prediction. 2. **MuV AL method**: A new multi-view attention learning method is proposed, which focuses on comprehensively learning 3D CT images in a multi-view manner. 3. **Experimental validation**: Extensive experiments and various metrics demonstrate the superiority and rationality of the proposed method compared to baseline methods. In summary, this study introduces the novel MuV AL method, effectively leveraging multi-view information in 3D CT images to improve the accuracy of residual lesion prediction in ovarian cancer, providing a valuable tool for clinical practice.