Surrogate Supervision for Medical Image Analysis: Effective Deep Learning From Limited Quantities of Labeled Data

Nima Tajbakhsh,Yufei Hu,Junli Cao,Xingjian Yan,Yi Xiao,Yong Lu,Jianming Liang,Demetri Terzopoulos,Xiaowei Ding
DOI: https://doi.org/10.48550/arXiv.1901.08707
2019-01-25
Abstract:We investigate the effectiveness of a simple solution to the common problem of deep learning in medical image analysis with limited quantities of labeled training data. The underlying idea is to assign artificial labels to abundantly available unlabeled medical images and, through a process known as surrogate supervision, pre-train a deep neural network model for the target medical image analysis task lacking sufficient labeled training data. In particular, we employ 3 surrogate supervision schemes, namely rotation, reconstruction, and colorization, in 4 different medical imaging applications representing classification and segmentation for both 2D and 3D medical images. 3 key findings emerge from our research: 1) pre-training with surrogate supervision is effective for small training sets; 2) deep models trained from initial weights pre-trained through surrogate supervision outperform the same models when trained from scratch, suggesting that pre-training with surrogate supervision should be considered prior to training any deep 3D models; 3) pre-training models in the medical domain with surrogate supervision is more effective than transfer learning from an unrelated domain (e.g., natural images), indicating the practical value of abundant unlabeled medical image data.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
This paper aims to solve the problem of poor deep - learning performance in medical image analysis due to the limited amount of labeled training data. Specifically, the paper explores a simple method, that is, by assigning artificial labels to a large number of unlabeled medical images and using these labels for so - called surrogate supervision, to pre - train the deep neural network model for the target medical image analysis task. This method is especially suitable for tasks lacking sufficient labeled training data. The paper evaluates the effect of surrogate supervision when dealing with small - scale training sets by testing the effectiveness of three surrogate supervision schemes (rotation, reconstruction, coloring) in four different medical imaging applications. The study finds that the model pre - trained with surrogate supervision performs better on small training sets than the model trained from scratch or transferred from an unrelated field. In addition, for 3D medical image analysis tasks, surrogate supervision pre - training shows more significant advantages than training from scratch. This indicates that for deep - learning models in the medical field, using abundant unlabeled data for pre - training is an effective method, especially when labeled data is scarce.