CalAttnMix: an Instance-Attention Based Data Augmentation Method for Calcification Detection in Mammograms
Peng Huang,Jia Hua,Xiaochuan Geng,Jianrong Xu,Shiteng Suo,Yan Zhou,Yuanyuan Wang,Yi Guo
DOI: https://doi.org/10.1016/j.bspc.2023.105871
IF: 5.1
2024-01-01
Biomedical Signal Processing and Control
Abstract:Data augmentation is widely used as a pre-processing step to solve limited training data and class imbalance problems in deep-learning based calcification detection methods. However, current augmentation methods in breast mammograms fail to guarantee the class balance and diversity of generated calcifications, especially that randomly fusing instances with specified regions results in unrealistic images, further degrading the performances. Here, we propose an instance-attention based data augmentation method, named Calcification-Attention-Mix (CalAttnMix), which adaptively transfers multiple source calcification instances into diverse breast tissue backgrounds, aiding breast calcification detection. First, to seek feasible breast tissue candidate backgrounds, an Augmentation Region Proposal Mechanism (ARPM) is devised based on morphology constraints of calcifications. Then, without segmentation masks of calcification foregrounds, we design a transformer-based Foreground Prediction Module (FPM), along with a multi-task learning strategy, to localize instances accurately in a weakly-supervised manner. Finally, to generate diverse instances, an attention mix module is introduced to dynamically mix instances with background candidates, simulating various breast tissue surroundings. By focusing on the minor class, CalAttnMix generates diverse and balanced datasets for calcification detection. Experiments on the public and clinical dataset show that CalAttnMix achieves the lowest FID 22.152 and cosine similarity score 0.304, respectively, confirming the realism and diversity of generated images. CalAttnMix outperforms SOTA augmentation methods by 3.40% / 2.30% and 1.40% / 5.00% in the average recall/mAP, demonstrating the best detection results.