SIFT-DBT: Self-supervised Initialization and Fine-Tuning for Imbalanced Digital Breast Tomosynthesis Image Classification

Yuexi Du,Regina J. Hooley,John Lewin,Nicha C. Dvornek
2024-03-20
Abstract:Digital Breast Tomosynthesis (DBT) is a widely used medical imaging modality for breast cancer screening and diagnosis, offering higher spatial resolution and greater detail through its 3D-like breast volume imaging capability. However, the increased data volume also introduces pronounced data imbalance challenges, where only a small fraction of the volume contains suspicious tissue. This further exacerbates the data imbalance due to the case-level distribution in real-world data and leads to learning a trivial classification model that only predicts the majority class. To address this, we propose a novel method using view-level contrastive Self-supervised Initialization and Fine-Tuning for identifying abnormal DBT images, namely SIFT-DBT. We further introduce a patch-level multi-instance learning method to preserve spatial resolution. The proposed method achieves 92.69% volume-wise AUC on an evaluation of 970 unique studies.
Image and Video Processing,Computer Vision and Pattern Recognition,Machine Learning
What problem does this paper attempt to address?
The paper aims to address the issue of data imbalance encountered in Digital Breast Tomosynthesis (DBT) images during breast cancer screening and diagnosis. Specifically, while DBT provides higher spatial resolution and more detailed information, the increase in data volume also leads to significant data imbalance, where only a small portion of the volume contains suspicious tissue. This data imbalance further exacerbates the imbalance in case distribution in actual data, causing the learned classification model to only predict the majority class. To solve this problem, the authors propose a new method—Self-supervised Initialization and Fine-Tuning for DBT (SIFT-DBT)—for identifying abnormal DBT images. Additionally, a patch-based multiple instance learning method is introduced to preserve spatial resolution. Experimental results show that this method achieves a volume-level AUC of 92.69% in the evaluation of 970 independent studies. Overall, the goal of this study is to improve the performance of DBT image classification through an improved self-supervised learning method, thereby helping radiologists identify high-risk scans more quickly and significantly improving screening efficiency by filtering out normal examination results.