BTMuda: A Bi-level Multi-source unsupervised domain adaptation framework for breast cancer diagnosis

Yuxiang Yang,Xinyi Zeng,Pinxian Zeng,Binyu Yan,Xi Wu,Jiliu Zhou,Yan Wang
2024-08-30
Abstract:Deep learning has revolutionized the early detection of breast cancer, resulting in a significant decrease in mortality rates. However, difficulties in obtaining annotations and huge variations in distribution between training sets and real scenes have limited their clinical applications. To address these limitations, unsupervised domain adaptation (UDA) methods have been used to transfer knowledge from one labeled source domain to the unlabeled target domain, yet these approaches suffer from severe domain shift issues and often ignore the potential benefits of leveraging multiple relevant sources in practical applications. To address these limitations, in this work, we construct a Three-Branch Mixed extractor and propose a Bi-level Multi-source unsupervised domain adaptation method called BTMuda for breast cancer diagnosis. Our method addresses the problems of domain shift by dividing domain shift issues into two levels: intra-domain and inter-domain. To reduce the intra-domain shift, we jointly train a CNN and a Transformer as two paths of a domain mixed feature extractor to obtain robust representations rich in both low-level local and high-level global information. As for the inter-domain shift, we redesign the Transformer delicately to a three-branch architecture with cross-attention and distillation, which learns domain-invariant representations from multiple domains. Besides, we introduce two alignment modules - one for feature alignment and one for classifier alignment - to improve the alignment process. Extensive experiments conducted on three public mammographic datasets demonstrate that our BTMuda outperforms state-of-the-art methods.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the domain shift problem caused by data distribution differences in breast cancer diagnosis. In particular, when there are huge differences between the training set and the actual application scenarios, the clinical application of deep - learning models is limited. Specifically, the paper focuses on the application of unsupervised domain adaptation (UDA) methods in multi - source data. These methods aim to transfer knowledge from multiple labeled source domains to unlabeled target domains. However, existing UDA methods usually only utilize data from a single source domain, ignoring the potential benefits that multiple related source domains may bring, and have serious problems when dealing with cross - domain differences. To overcome these problems, the authors propose a three - level hybrid feature extractor and a two - layer multi - source unsupervised domain adaptation method (BTMuda) specifically for breast cancer diagnosis. This method solves these problems by dividing the domain shift problem into two levels - intra - domain shift and inter - domain shift. For intra - domain shift, the method jointly trains a CNN and a Transformer as two paths of the domain - mixed feature extractor to obtain a robust representation that contains both low - level local information and high - level global information. For inter - domain shift, the authors redesign the Transformer into a three - branched architecture with cross - attention and distillation mechanisms, thereby learning domain - invariant representations from multiple domains. In addition, two alignment modules - a feature alignment module and a classifier alignment module - are introduced to improve the alignment process. Extensive experiments on three publicly available mammogram datasets have proven that the proposed BTMuda method outperforms the current state - of - the - art methods.