Collaborative Teaching with Attention Distillation for Multiple Cross-Domain Few-Shot Learning

Zhenduo Shang,Xiyao Liu,Xing Xie,Zhi Han
DOI: https://doi.org/10.1109/cyber63482.2024.10749467
2024-01-01
Abstract:Multiple Cross-Domain Few-Shot Learning (MCD-FSL) aims to improve the generalization ability of the model across unseen domains by utilizing the diverse knowledge of different teacher networks. Knowledge transferring among multiple domains is complex and difficult due to significantly different data distributions. In current tasks, effectively utilizing knowledge from multiple domains to improve model generalization in an unseen domain remains challenging. Therefore, we propose Collaborative Teaching with Attention Distillation (CTAD-Net), exploring a collaborative strategy among multiple teacher networks of teachers to address this issue. Specifically, we introduce a weight allocation (WA) module and a deep fusion (DF) module for distilling knowledge from the teacher networks to a single student network. Additionally, our CTAD-Net combines attention mechanisms and employs both response-based and relation-based knowledge distillation to transfer more comprehensive and effective knowledge. The experimental results on four fine-grained datasets have demonstrated the effectiveness of our proposed CTAD-Net approach.
What problem does this paper attempt to address?