Abstract:A large labeled dataset is a key to the success of supervised deep learning, but for medical image segmentation, it is highly challenging to obtain sufficient annotated images for model training. In many scenarios, unannotated images are abundant and easy to acquire. Self-supervised learning (SSL) has shown great potentials in exploiting raw data information and representation learning. In this paper, we propose Hierarchical Self-Supervised Learning (HSSL), a new self-supervised framework that boosts medical image segmentation by making good use of unannotated data. Unlike the current literature on task-specific self-supervised pretraining followed by supervised fine-tuning, we utilize SSL to learn task-agnostic knowledge from heterogeneous data for various medical image segmentation tasks. Specifically, we first aggregate a dataset from several medical challenges, then pre-train the network in a self-supervised manner, and finally fine-tune on labeled data. We develop a new loss function by combining contrastive loss and classification loss, and pre-train an encoder-decoder architecture for segmentation tasks. Our extensive experiments show that multi-domain joint pre-training benefits downstream segmentation tasks and outperforms single-domain pre-training significantly. Compared to learning from scratch, our method yields better performance on various tasks (e.g., +0.69%documentclass[12pt]{minimal}usepackage{amsmath}usepackage{wasysym}usepackage{amsfonts}usepackage{amssymb}usepackage{amsbsy}usepackage{mathrsfs}usepackage{upgreek}setlength{oddsidemargin}{-69pt}egin{document}$$+0.69\%$$end{document} to +18.60%documentclass[12pt]{minimal}usepackage{amsmath}usepackage{wasysym}usepackage{amsfonts}usepackage{amssymb}usepackage{amsbsy}usepackage{mathrsfs}usepackage{upgreek}setlength{oddsidemargin}{-69pt}egin{document}$$+18.60\%$$end{document} in Dice with 5%documentclass[12pt]{minimal}usepackage{amsmath}usepackage{wasysym}usepackage{amsfonts}usepackage{amssymb}usepackage{amsbsy}usepackage{mathrsfs}usepackage{upgreek}setlength{oddsidemargin}{-69pt}egin{document}$$5\%$$end{document} of annotated data). With limited amounts of training data, our method can substantially bridge the performance gap with respect to denser annotations (e.g., 10%documentclass[12pt]{minimal}usepackage{amsmath}usepackage{wasysym}usepackage{amsfonts}usepackage{amssymb}usepackage{amsbsy}usepackage{mathrsfs}usepackage{upgreek}setlength{oddsidemargin}{-69pt}egin{document}$$10\%$$end{document} vs. 100%documentclass[12pt]{minimal}usepackage{amsmath}usepackage{wasysym}usepackage{amsfonts}usepackage{amssymb}usepackage{amsbsy}usepackage{mathrsfs}usepackage{upgreek}setlength{oddsidemargin}{-69pt}egin{document}$$100\%$$end{document} annotations).

Bootstrap Representation Learning for Segmentation on Medical Volumes and Sequences

MsVRL: Self-Supervised Multiscale Visual Representation Learning Via Cross-Level Consistency for Medical Image Segmentation

Self-supervised learning via inter-modal reconstruction and feature projection networks for label-efficient 3D-to-2D segmentation

A General Global and Local Pre-Training Framework for 3D Medical Image Segmentation.

Positional Information is a Strong Supervision for Volumetric Medical Image Segmentation

Self-Supervised Alignment Learning for Medical Image Segmentation

Semi-MedSeq: Semi-supervised Semantic Segmentation for Medical Image Sequences.

Enhancing Weakly Supervised 3D Medical Image Segmentation through Probabilistic-aware Learning

Self-supervised Learning of Dense Hierarchical Representations for Medical Image Segmentation

Hierarchical Self-supervised Learning for Medical Image Segmentation Based on Multi-domain Data Aggregation

Shape-Guided Dual Consistency Semi-Supervised Learning Framework for 3-D Medical Image Segmentation

Keypoint-Augmented Self-Supervised Learning for Medical Image Segmentation with Limited Annotation

Consistency-guided Meta-Learning for Bootstrapping Semi-Supervised Medical Image Segmentation

Leveraging Unlabeled Data for 3D Medical Image Segmentation through Self-Supervised Contrastive Learning

PA-Seg: Learning from Point Annotations for 3D Medical Image Segmentation using Contextual Regularization and Cross Knowledge Distillation

Robust Semi-supervised 3D Medical Image Segmentation with Diverse Joint-task Learning and Decoupled Inter-student Learning

An Efficient Semi-Supervised Framework with Multi-Task and Curriculum Learning for Medical Image Segmentation

MIS-FM: 3D Medical Image Segmentation using Foundation Models Pretrained on a Large-Scale Unannotated Dataset

Shape and boundary-aware multi-branch model for semi-supervised medical image segmentation

Semi-supervised Segmentation with Self-training Based on Quality Estimation and Refinement.

MVPCL: multi-view prototype consistency learning for semi-supervised medical image segmentation