Abstract:Background and objective: [18f]-fluorodeoxyglucose (fdg) positron emission tomography – computed tomography (pet-ct) is now the preferred imaging modality for staging many cancers. Pet images characterize tumoral glucose metabolism while ct depicts the complementary anatomical localization of the tumor. Automatic tumor segmentation is an important step in image analysis in computer aided diagnosis systems. Recently, fully convolutional networks (fcns), with their ability to leverage annotated datasets and extract image feature representations, have become the state-of-the-art in tumor segmentation. There are limited fcn based methods that support multi-modality images and current methods have primarily focused on the fusion of multi-modality image features at various stages, i.e., early-fusion where the multi-modality image features are fused prior to fcn, late-fusion with the resultant features fused and hyper-fusion where multi-modality image features are fused across multiple image feature scales. Early- and late-fusion methods, however, have inherent, limited freedom to fuse complementary multi-modality image features. The hyper-fusion methods learn different image features across different image feature scales that can result in inaccurate segmentations, in particular, in situations where the tumors have heterogeneous textures. Methods: we propose a recurrent fusion network (rfn), which consists of multiple recurrent fusion phases to progressively fuse the complementary multi-modality image features with intermediary segmentation results derived at individual recurrent fusion phases: (1) the recurrent fusion phases iteratively learn the image features and then refine the subsequent segmentation results; and, (2) the intermediary segmentation results allows our method to focus on learning the multi-modality image features around these intermediary segmentation results, which minimize the risk of inconsistent feature learning. Results: we evaluated our method on two pathologically proven non-small cell lung cancer pet-ct datasets. We compared our method to the commonly used fusion methods (early-fusion, late-fusion and hyper-fusion) and the state-of-the-art pet-ct tumor segmentation methods on various network backbones (resnet, densenet and 3d-unet). Our results show that the rfn provides more accurate segmentation compared to the existing methods and is generalizable to different datasets. Conclusions: we show that learning through multiple recurrent fusion phases allows the iterative re-use of multi-modality image features that refines tumor segmentation results. We also identify that our rfn produces consistent segmentation results across different network architectures.

Tumor co-segmentation in PET/CT using multi-modality fully convolutional neural network

'You need them to know your ways': service users' views about valued dimensions of home care.

Head and neck tumor segmentation convolutional neural network robust to missing PET/CT modalities using channel dropout

Laboratory indices of prognosis in HIV infection.

MFCNet: A multi-modal fusion and calibration networks for 3D pancreas tumor segmentation on PET-CT images

Joint segmentation of tumors in 3D PET-CT images with a network fusing multi-view and multi-modal information

Multimodal Spatial Attention Module for Targeting Multimodal PET-CT Lung Tumor Segmentation

Lung Tumor Segmentation of PET/CT Using Dual Pyramid Mask R-CNN

Multitask connected U-Net: automatic lung cancer segmentation from CT images using PET knowledge guidance

Recurrent Feature Fusion Learning for Multi-Modality Pet-Ct Tumor Segmentation

Whole-body tumor segmentation of 18F -FDG PET/CT using a cascaded and ensembled convolutional neural networks

Deep Learning-Based Image Segmentation on Multimodal Medical Imaging

Efficient model-informed co-segmentation of tumors on PET/CT driven by clustering and classification information

Combining CNN and Hybrid Active Contours for Head and Neck Tumor Segmentation in CT and PET images

A deep learning model integrating FCNNs and CRFs for brain tumor segmentation

3D PET/CT tumor segmentation based on nnU-Net with GCN refinement

LSAM: L2-norm self-attention and latent space feature interaction for automatic 3D multi-modal head and neck tumor segmentation

Co-Learning Feature Fusion Maps from PET-CT Images of Lung Cancer

Automated Lung Cancer Segmentation Using a Dual-Modality Deep Learning Network with PET and CT Images

3D Reconstruction-Oriented Fully Automatic Multi-Modal Tumor Segmentation by Dual Attention-Guided VNet

Modality-level cross-connection and attentional feature fusion based deep neural network for multi-modal brain tumor segmentation