Step-wise Distribution Alignment Guided Style Prompt Tuning for Source-free Cross-domain Few-shot Learning

Huali Xu,Yongxiang Liu,Li Liu,Shuaifeng Zhi,Shuzhou Sun,Tianpeng Liu,MingMing Cheng
2024-11-15
Abstract:Existing cross-domain few-shot learning (CDFSL) methods, which develop source-domain training strategies to enhance model transferability, face challenges with large-scale pre-trained models (LMs) due to inaccessible source data and training strategies. Moreover, fine-tuning LMs for CDFSL demands substantial computational resources, limiting practicality. This paper addresses the source-free CDFSL (SF-CDFSL) problem, tackling few-shot learning (FSL) in the target domain using only pre-trained models and a few target samples without source data or strategies. To overcome the challenge of inaccessible source data, this paper introduces Step-wise Distribution Alignment Guided Style Prompt Tuning (StepSPT), which implicitly narrows domain gaps through prediction distribution optimization. StepSPT proposes a style prompt to align target samples with the desired distribution and adopts a dual-phase optimization process. In the external process, a step-wise distribution alignment strategy factorizes prediction distribution optimization into a multi-step alignment problem to tune the style prompt. In the internal process, the classifier is updated using standard cross-entropy loss. Evaluations on five datasets demonstrate that StepSPT outperforms existing prompt tuning-based methods and SOTAs. Ablation studies further verify its effectiveness. Code will be made publicly available at \url{<a class="link-external link-https" href="https://github.com/xuhuali-mxj/StepSPT" rel="external noopener nofollow">this https URL</a>}.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem that this paper attempts to solve is cross - domain few - shot learning (SF - CDFSL) when the source data is inaccessible. Specifically, existing cross - domain few - shot learning (CDFSL) methods usually require the data and training strategies of the source domain to enhance the transfer ability of the model, but these methods face challenges when applied to large - scale pre - trained models (LMs) because the source - domain data and training strategies of these models are usually inaccessible. In addition, fine - tuning these large models for CDFSL tasks requires a large amount of computing resources, which limits their practical applications. Therefore, this paper studies the source - free cross - domain few - shot learning (SF - CDFSL) problem, aiming to solve the few - shot learning tasks in the target domain without accessing the source data and training strategies, using only pre - trained models and a small number of target samples. However, because the source data cannot be accessed, it becomes impossible to explicitly reduce the gap between the source domain and the target domain. To address this challenge, this paper proposes a novel method - Step - by - Step Distribution Alignment - Guided Style Prompt Tuning (StepSPT) to implicitly narrow the domain gap from the perspective of prediction - distribution optimization. **The main contributions of StepSPT are as follows**: 1. **Problem transformation and theoretical analysis**: This paper transforms the domain - alignment challenge between the source domain and the target domain in SF - CDFSL into a target - distribution - optimization problem, and provides theoretical analysis and guidance to solve this distribution - optimization problem. 2. **Method proposal**: The Step - by - Step Distribution Alignment - Guided Style Prompt Tuning (StepSPT) method is proposed to solve the SF - CDFSL problem. StepSPT introduces a style prompt to adjust the target distribution and adopts a two - stage optimization process, including an outer and an inner stage, in which the style prompt and classifier parameters are updated alternately. In the outer stage, the target distribution is gradually made closer to the ideal distribution through a multi - step distribution - alignment strategy; in the inner stage, the classifier is updated through a standard meta - training strategy while keeping the prompt parameters fixed, helping the model adapt to the specific features of the target domain. 3. **Performance verification**: Extensive evaluations on 5 datasets show that the proposed StepSPT outperforms existing prompt - tuning - based methods and other state - of - the - art methods. Detailed ablation studies further prove the effectiveness of each component. In conclusion, by introducing StepSPT, this paper provides a new method for effectively solving the cross - domain few - shot learning problem without accessing the source data, which has important theoretical and practical significance.