Self-supervised Learning for Chest CT - Training Strategies and Effect on Downstream Applications

Amara Tariq,Bhavik N. Patel,Imon Banerjee
DOI: https://doi.org/10.1101/2024.02.01.24302144
2024-02-05
Abstract:Self-supervised pretraining can reduce the amount of labeled training data needed by pre-learning fundamental visual characteristics of the medical imaging data. In this study, we investigate several self-supervised training strategies for chest computed tomography exams and their effects of downstream applications. we bench-mark five well-known self-supervision strategies (masked image region prediction, next slice prediction, rotation prediction, flip prediction and denoising) on 15M chest CT slices collected from four sites of Mayo Clinic enterprise. These models were evaluated for two downstream tasks on public datasets; pulmonary embolism (PE) detection (classification) and lung nodule segmentation. Image embeddings generated by these models were also evaluated for prediction of patient age, race, and gender to study inherent biases in models’ understanding of chest CT exams. Use of pretraining weights, especially masked regions prediction based weights, improved performance and reduced computational effort needed for downstream tasks compared to task-specific state-of-the-art (SOTA) models. Performance improvement for PE detection was observed for training dataset sizes as large as with maximum gain of 5% over SOTA. Segmentation model initialized with pretraining weights learned twice as fast as randomly initialized model. While gender and age predictors built using self-supervised training weights showed no performance improvement over randomly initialized predictors, the race predictor experienced a 10% performance boost when using self-supervised training weights. We released models and weights under open-source academic license. These models can then be finetuned with limited task-specific annotated data for a variety of downstream imaging tasks thus accelerating research in biomedical imaging informatics.
Radiology and Imaging
What problem does this paper attempt to address?
The problem that this paper attempts to solve is that in chest CT image data, the dependence on a large amount of labeled data is reduced through self - supervised learning methods, and the impact of different self - supervised training strategies on downstream tasks is studied. Specifically, the paper explores the effects of five self - supervised training strategies (masked image region prediction, next - patch prediction, rotation prediction, flip prediction, and denoising) on 15 million chest CT slices from four sites of Mayo Clinic. These models are then used to evaluate the performance of two downstream tasks: pulmonary embolism (PE) detection (a classification task) and pulmonary nodule segmentation (a segmentation task). In addition, the paper also evaluates the impact of the image embeddings generated by these models on predicting sensitive attributes such as patient age, race, and gender, in order to study the inherent bias in the model's understanding of chest CT examinations. The main objectives of the paper include: 1. **Reducing the dependence on labeled data**: Through self - supervised pre - training, reduce the amount of labeled data required for downstream tasks. 2. **Improving the performance of downstream tasks**: Evaluate the impact of different self - supervised strategies on the performance of downstream tasks, especially compared with the existing task - specific state - of - the - art (SOTA) models. 3. **Studying the generalization ability of the model**: Analyze the performance of self - supervised pre - training models in different downstream tasks, especially the performance improvement when the data set size changes. 4. **Evaluating the potential bias of the model**: Study whether self - supervised pre - training models will introduce bias when predicting patient - sensitive attributes such as age, race, and gender. Through these studies, the paper aims to promote the development of self - supervised learning techniques in the field of medical imaging and accelerate the research progress of biomedical imaging informatics.