Source-free Semantic Regularization Learning for Semi-supervised Domain Adaptation

Xinyang Huang,Chuang Zhu,Ruiying Ren,Shengjie Liu,Tiejun Huang
2025-01-02
Abstract:Semi-supervised domain adaptation (SSDA) has been extensively researched due to its ability to improve classification performance and generalization ability of models by using a small amount of labeled data on the target domain. However, existing methods cannot effectively adapt to the target domain due to difficulty in fully learning rich and complex target semantic information and relationships. In this paper, we propose a novel SSDA learning framework called semantic regularization learning (SERL), which captures the target semantic information from multiple perspectives of regularization learning to achieve adaptive fine-tuning of the source pre-trained model on the target domain. SERL includes three robust semantic regularization techniques. Firstly, semantic probability contrastive regularization (SPCR) helps the model learn more discriminative feature representations from a probabilistic perspective, using semantic information on the target domain to understand the similarities and differences between samples. Additionally, adaptive weights in SPCR can help the model learn the semantic distribution correctly through the probabilities of different samples. To further comprehensively understand the target semantic distribution, we introduce hard-sample mixup regularization (HMR), which uses easy samples as guidance to mine the latent target knowledge contained in hard samples, thereby learning more complete and complex target semantic knowledge. Finally, target prediction regularization (TPR) regularizes the target predictions of the model by maximizing the correlation between the current prediction and the past learned objective, thereby mitigating the misleading of semantic information caused by erroneous pseudo-labels. Extensive experiments on three benchmark datasets demonstrate that our SERL method achieves state-of-the-art performance.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
### Problems the paper attempts to solve This paper aims to solve a key problem in semi - supervised domain adaptation (SSDA): **How to effectively utilize a small amount of labeled data and a large amount of unlabeled data in the target domain to fully learn complex target semantic information and achieve better domain alignment**. Specifically, existing SSDA methods face challenges in the following aspects: 1. **Insufficient learning of target semantic information**: - Existing methods are difficult to comprehensively capture the rich semantic information in the target domain and the relationships between samples, resulting in poor performance of the model on the target domain. 2. **Feature representation biased towards the source domain**: - Since there is a large amount of labeled data in the source domain, the feature representation of the model is easily biased towards the source domain, thus affecting its generalization ability on the target domain. 3. **Impact of pseudo - label noise**: - Pseudo - labels in the target domain may be noisy, which will cause the model to learn wrong semantic information and then affect the model performance. To solve these problems, the author proposes a new framework - **Semantic Regularization Learning (SERL)**. SERL enhances the model's learning of target - domain semantic information through three regularization techniques: - **Semantic Probability Contrastive Regularization (SPCR)**: - Helps the model learn more discriminative feature representations from a probabilistic perspective and reduces the influence of low - confidence samples through adaptive weights. - **Hard - sample Mixup Regularization (HMR)**: - Uses simple samples as guidance to mine the potential knowledge in hard samples and helps the model learn more complex semantic distributions. - **Target Prediction Regularization (TPR)**: - Minimizes the influence of noisy pseudo - labels to ensure that the model learns correct semantic information. These techniques work together to enable the model to better adapt to the actual distribution of the target domain and significantly improve its classification performance and generalization ability. ### Summary The main contribution of this paper is to propose a new SSDA framework SERL, which makes full use of the semantic information of the target domain through semantic regularization strategies, solves the deficiencies of existing methods in target - domain semantic learning, and thus achieves more effective cross - domain adaptation.