Enhancing AI Diagnostics: Autonomous Lesion Masking via Semi-Supervised Deep Learning

Ting-Ruen Wei,Michele Hell,Dang Bich Thuy Le,Aren Vierra,Ran Pang,Mahesh Patel,Young Kang,Yuling Yan
2024-04-19
Abstract:This study presents an unsupervised domain adaptation method aimed at autonomously generating image masks outlining regions of interest (ROIs) for differentiating breast lesions in breast ultrasound (US) imaging. Our semi-supervised learning approach utilizes a primitive model trained on a small public breast US dataset with true annotations. This model is then iteratively refined for the domain adaptation task, generating pseudo-masks for our private, unannotated breast US dataset. The dataset, twice the size of the public one, exhibits considerable variability in image acquisition perspectives and demographic representation, posing a domain-shift challenge. Unlike typical domain adversarial training, we employ downstream classification outcomes as a benchmark to guide the updating of pseudo-masks in subsequent iterations. We found the classification precision to be highly correlated with the completeness of the generated ROIs, which promotes the explainability of the deep learning classification model. Preliminary findings demonstrate the efficacy and reliability of this approach in streamlining the ROI annotation process, thereby enhancing the classification and localization of breast lesions for more precise and interpretable diagnoses.
Computer Vision and Pattern Recognition,Artificial Intelligence,Machine Learning
What problem does this paper attempt to address?
The paper aims to address the issues of data scarcity and annotation complexity in medical image analysis, particularly in the classification of breast lesions in breast ultrasound (US) imaging. Specifically, the study proposes an unsupervised domain adaptation method to automatically generate image masks that outline the regions of interest (ROI) in breast ultrasound images, thereby distinguishing between benign and malignant breast lesions. This method leverages an original model trained with real annotations on a small-scale public breast ultrasound dataset and iteratively refines this model to generate pseudo-masks for a private unannotated breast ultrasound dataset. Unlike typical adversarial training, the study uses downstream classification results as a benchmark to guide the update of pseudo-masks in subsequent iterations. Preliminary findings indicate that this method is effective and reliable in simplifying the ROI annotation process and enhancing the classification and localization of breast lesions, contributing to more accurate and interpretable diagnoses. Additionally, the study highlights its advantages in addressing domain transfer issues between different datasets, especially in the medical field where data is limited and annotation is time-consuming and labor-intensive. In summary, this research provides an innovative solution that overcomes the challenges of data scarcity and annotation complexity, laying a solid foundation for more effective medical diagnosis.