Abstract:Objective. Radiation therapy (RT) represents a prevalent therapeutic modality for head and neck (H&N) cancer. A crucial phase in RT planning involves the precise delineation of organs-at-risks (OARs), employing computed tomography (CT) scans. Nevertheless, the manual delineation of OARs is a labor-intensive process, necessitating individual scrutiny of each CT image slice, not to mention that a standard CT scan comprises hundreds of such slices. Furthermore, there is a significant domain shift between different institutions' H&N data, which makes traditional semi-supervised learning strategies susceptible to confirmation bias. Therefore, effectively using unlabeled datasets to support annotated datasets for model training has become a critical issue for preventing domain shift and confirmation bias. Approach. In this work, we proposed an innovative Cross-Domain Orthogon-based-Perspective Consistency (CD-OPC) strategy within a two-branch collaborative training framework, which compels the two sub-networks to acquire valuable features from unrelated perspectives. More specifically, a novel generative pretext task Cross-Domain Prediction (CDP) was designed for learning inherent properties of CT images. Then this prior knowledge was utilized to promote the independent learning of distinct features by the two sub-networks from identical inputs, thereby enhancing the perceptual capabilities of the sub-networks through Orthogon-based Pseudo-Labeling Knowledge Transfer (OPKT). Main results. Our CD-OPC model was trained on H&N datasets from nine different institutions, and validated on the four local intuitions' H&N datasets. Among all datasets CD-OPC achieved more advanced performance than other semi-supervised semantic segmentation algorithms. Significance. The CD-OPC method successfully mitigates domain shift and prevents network collapse. In addition, it enhances the network's perceptual abilities, and generates more reliable predictions, thereby further addressing the confirmation bias issue.

Semi-supervised classification by reaching consensus among modalities

Detecting cognitive impairments by agreeing on interpretations of linguistic features

Trusted 3D self-supervised representation learning with cross-modal settings

Semi-supervised Dynamic Counter Propagation Network

TGNN: A Joint Semi-supervised Framework for Graph-level Classification

Semi-Supervised Medical Image Segmentation Based on Deep Consistent Collaborative Learning

TCGM: an Information-Theoretic Framework for Semi-Supervised Multi-Modality Learning

Multimodality-Assisted Semi-Supervised Brain Tumor Segmentation in Nondominant Modality Based on Consistency Learning

Adaptive Graph Convolutional Collaboration Networks for Semi-supervised Classification

Consensus Focus for Object Detection and minority classes

Comprehensive Semi-Supervised Multi-Modal Learning.

Deep learning for head and neck semi-supervised semantic segmentation

Transductive Centroid Projection for Semi-supervised Large-Scale Recognition

Twisted Convolutional Networks (TCNs): Enhancing Feature Interactions for Non-Spatial Data Classification

Data-Driven Adaptive Consensus Learning From Network Topologies

Multimodal Semi-Supervised Learning for 3D Objects

A semi-supervised approach for the integration of multi-omics data based on transformer multi-head self-attention mechanism and graph convolutional networks

Deep Discriminative CNN with Temporal Ensembling for Ambiguously-Labeled Image Classification.

Robust Land Cover Classification with Multimodal Knowledge Distillation

ACN: Adversarial Co-training Network for Brain Tumor Segmentation with Missing Modalities

Transfer Classification for Distinct Manifestations with Shared Information