CUTS: A Deep Learning and Topological Framework for Multigranular Unsupervised Medical Image Segmentation

Chen Liu,Matthew Amodio,Liangbo L. Shen,Feng Gao,Arman Avesta,Sanjay Aneja,Jay C. Wang,Lucian V. Del Priore,Smita Krishnaswamy
2024-06-26
Abstract:Segmenting medical images is critical to facilitating both patient diagnoses and quantitative research. A major limiting factor is the lack of labeled data, as obtaining expert annotations for each new set of imaging data and task can be labor intensive and inconsistent among annotators. We present CUTS, an unsupervised deep learning framework for medical image segmentation. CUTS operates in two stages. For each image, it produces an embedding map via intra-image contrastive learning and local patch reconstruction. Then, these embeddings are partitioned at dynamic granularity levels that correspond to the data topology. CUTS yields a series of coarse-to-fine-grained segmentations that highlight features at various granularities. We applied CUTS to retinal fundus images and two types of brain MRI images to delineate structures and patterns at different scales. When evaluated against predefined anatomical masks, CUTS improved the dice coefficient and Hausdorff distance by at least 10% compared to existing unsupervised methods. Finally, CUTS showed performance on par with Segment Anything Models (SAM, MedSAM, SAM-Med2D) pre-trained on gigantic labeled datasets.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
### Problems the paper attempts to solve The paper "CUTS: A Deep Learning and Topological Framework for Multigranular Unsupervised Medical Image Segmentation" aims to solve several key problems in medical image segmentation: 1. **Lack of labeled data**: Obtaining data labeled by experts is time - consuming and labor - intensive, and it is difficult to ensure consistency among different labelers. This limits the application of supervised learning methods in medical image segmentation. 2. **Insufficient cross - domain generalization ability**: Supervised learning models may perform poorly on images collected from different devices or patient groups, because these models are usually trained only on specific datasets. 3. **Diversity of segmentation granularity requirements**: Different application scenarios may require segmentation results with different granularities, and existing supervised learning methods are difficult to adapt to these requirements flexibly and usually need to relabel the data. To address these problems, the authors propose a completely unsupervised framework - CUTS (Contrastive and Unsupervised Training for multigranular medical image Segmentation), which combines the latest representation learning and data geometry and topology techniques and can automatically perform multi - granular medical image segmentation without relying on labeled data. ### Main contributions 1. **Two - stage framework**: CUTS first generates pixel - centered patch embeddings specific to the image through a convolutional encoder, and then uses the diffusion condensation method to cluster these embeddings at multiple granularity levels to achieve multi - scale segmentation. 2. **Novel optimization objective**: In the first stage, CUTS combines intra - image contrastive learning and local patch reconstruction to help the convolutional encoder learn a more expressive embedding space. 3. **Multi - scale clustering assignment**: In the second stage, CUTS uses the diffusion condensation method to provide multi - granular segmentation labels, which can highlight clinically relevant regions at different scales. ### Method overview The CUTS framework consists of two main stages: 1. **Embedding space learning**: - **Pixel - centered patches**: Each pixel and its surrounding local neighborhood are encoded into a high - dimensional embedding space. - **Optimization objectives**: - **Intra - image contrast loss**: Learn local features by contrasting positive and negative sample pairs. - **Local patch reconstruction loss**: Ensure that the embeddings retain the patch information. - **Final objective function**: A weighted sum of the contrast loss and the reconstruction loss. 2. **Multi - scale segmentation**: - **Diffusion condensation**: Identify natural data groupings through a dynamic process to generate multi - granular segmentation results. - **Persistent structures**: Identify clusters that remain separated in multiple iterations to form stable segmentation results. ### Experimental results The authors evaluated the performance of CUTS on three medical image datasets: 1. **Retinal fundus images**: Used to segment geographic atrophy (GA) areas. 2. **Brain MRI images (ventricles)**: Used to segment ventricles. 3. **Brain MRI images (tumor)**: Used to segment brain tumors. The experimental results show that CUTS performs well in both multi - granular segmentation and binary segmentation tasks, outperforms other unsupervised methods, and even exceeds the pre - trained Segment Anything Model (SAM) and its medical variants on some datasets. ### Summary CUTS solves the key problems of insufficient labeled data, weak cross - domain generalization ability, and diversity of segmentation granularity requirements in medical image segmentation through an unsupervised learning method, providing a new solution for medical image segmentation.