Multi-Modal Co-Learning for Liver Lesion Segmentation on PET-CT Images

Zhongliang Xue,Ping Li,Liang Zhang,Xiaoyuan Lu,Guangming Zhu,Peiyi Shen,Syed Afaq Ali Shah,Mohammed Bennamoun
DOI: https://doi.org/10.1109/TMI.2021.3089702
IF: 10.6
2021-01-01
IEEE Transactions on Medical Imaging
Abstract:Liver lesion segmentation is an essential process to assist doctors in hepatocellular carcinoma diagnosis and treatment planning. Multi-modal positron emission tomography and computed tomography (PET-CT) scans are widely utilized due to their complementary feature information for this purpose. However, current methods ignore the interaction of information across the two modalities during feature extraction, omit the co-learning of the feature maps of different resolutions, and do not ensure that shallow and deep features complement each others sufficiently. In this paper, our proposed model can achieve feature interaction across multi-modal channels by sharing the down-sampling blocks between two encoding branches to eliminate misleading features. Furthermore, we combine feature maps of different resolutions to derive spatially varying fusion maps and enhance the lesions information. In addition, we introduce a similarity loss function for consistency constraint in case that predictions of separated refactoring branches for the same regions vary a lot. We evaluate our model for liver tumor segmentation using a PET-CT scans dataset, compare our method with the baseline techniques for multi-modal (multi-branches, multi-channels and cascaded networks) and then demonstrate that our method has a significantly higher accuracy (p < 0.05) than the baseline models.
What problem does this paper attempt to address?