Abstract:Purpose Accurate segmentation of gross target volume (GTV) from computed tomography (CT) images is a prerequisite in radiotherapy for nasopharyngeal carcinoma (NPC). However, this task is very challenging due to the low contrast at the boundary of the tumor and the great variety of sizes and morphologies of tumors between different stages. Meanwhile, the data source also seriously affect the results of segmentation. In this paper, we propose a novel three-dimensional (3D) automatic segmentation algorithm that adopts cascaded multiscale local enhancement of convolutional neural networks (CNNs) and conduct experiments on multi-institutional datasets to address the above problems. Materials and Methods In this study, we retrospectively collected CT images of 257 NPC patients to test the performance of the proposed automatic segmentation model, and conducted experiments on two additional multi-institutional datasets. Our novel segmentation framework consists of three parts. First, the segmentation framework is based on a 3D Res-UNet backbone model that has excellent segmentation performance. Then, we adopt a multiscale dilated convolution block to enhance the receptive field and focus on the target area and boundary for segmentation improvement. Finally, a central localization cascade model for local enhancement is designed to concentrate on the GTV region for fine segmentation to improve the robustness. The Dice similarity coefficient (DSC), positive predictive value (PPV), sensitivity (SEN), average symmetric surface distance (ASSD) and 95% Hausdorff distance (HD95) are utilized as qualitative evaluation criteria to estimate the performance of our automated segmentation algorithm. Results The experimental results show that compared with other state-of-the-art methods, our modified version 3D Res-UNet backbone has excellent performance and achieves the best results in terms of the quantitative metrics DSC, PPR, ASSD and HD95, which reached 74.49 ± 7.81%, 79.97 ± 13.90%, 1.49 ± 0.65 mm and 5.06 ± 3.30 mm, respectively. It should be noted that the receptive field enhancement mechanism and cascade architecture can have a great impact on the stable output of automatic segmentation results with high accuracy, which is critical for an algorithm. The final DSC, SEN, ASSD and HD95 values can be increased to 76.23 ± 6.45%, 79.14 ± 12.48%, 1.39 ± 5.44mm, 4.72 ± 3.04mm. In addition, the outcomes of multi-institution experiments demonstrate that our model is robust and generalizable and can achieve good performance through transfer learning. Conclusions The proposed algorithm could accurately segment NPC in CT images from multi-institutional datasets and thereby may improve and facilitate clinical applications.

Segmentation Prompts Classification: A Nnunet-Based 3D Transfer Learning Framework with ROI Tokenization and Cross-Task Attention for Esophageal Cancer T-stage Diagnosis

PheoSeg: A 3D transfer learning framework for accurate abdominal CT pheochromocytoma segmentation and surgical grade prediction

3D RoI-aware U-Net for Accurate and Efficient Colorectal Tumor Segmentation

Eso-Net: A Novel 2.5D Segmentation Network with the Multi-Structure Response Filter for the Cancerous Esophagus

MSI-UNet: A Flexible UNet-Based Multi-Scale Interactive Framework for 3D Gastric Tumor Segmentation on CT Scans

3-D RoI-Aware U-Net for Accurate and Efficient Colorectal Tumor Segmentation

TTT-Unet: Enhancing U-Net with Test-Time Training Layers for Biomedical Image Segmentation

RDCTrans U-Net: A Hybrid Variable Architecture for Liver CT Image Segmentation

RTAU-Net: A novel 3D rectal tumor segmentation model based on dual path fusion and attentional guidance

HRU-Net: A High-Resolution Convolutional Neural Network for Esophageal Cancer Radiotherapy Target Segmentation

Multiscale Local Enhancement Deep Convolutional Networks for the Automated 3D Segmentation of Gross Tumor Volumes in Nasopharyngeal Carcinoma: A Multi-Institutional Dataset Study

3D Reconstruction-Oriented Fully Automatic Multi-Modal Tumor Segmentation by Dual Attention-Guided VNet

Automatic segmentation of esophageal cancer, metastatic lymph nodes and their adjacent structures in CTA images based on the UperNet Swin network

Early gastric cancer segmentation in gastroscopic images using a co-spatial attention and channel attention based triple-branch ResUnet

Esophageal Image Segmentation with Dual Attention Based on TransUNet

Multi-Scale Supervised 3D U-Net for Kidneys and Kidney Tumor Segmentation

MRI-based Head and Neck Tumor Segmentation Using nnU-Net with 15-fold Cross-Validation Ensemble

A Transformer-Guided Cross-Modality Adaptive Feature Fusion Framework for Esophageal Gross Tumor Volume Segmentation

S3TU-Net: Structured Convolution and Superpixel Transformer for Lung Nodule Segmentation

Comparative analysis of automatic segmentation of esophageal cancer using 3D Res-UNet on conventional and 40-keV virtual mono-energetic CT Images: a retrospective study

3D PET/CT tumor segmentation based on nnU-Net with GCN refinement