A Teacher-Student Approach to Cross-Domain Transfer Learning with Multi-level Attention

Ying-Jhe Tang,Hen-Hsen Huang
DOI: https://doi.org/10.1145/3486622.3494009
2021-12-14
Abstract:The lack of training data forms a challenging issue for applying NLP models in a new domain. Previous work on cross-domain transfer learning aims to exploit the information from the source domains to do prediction for the target domain. To reduce the noises from the out-of-domain data and improve the model’s generalization ability, this work proposes a novel teacher-student approach with multi-task learning that transfers the information from source domains to the target domain with sophisticated weights determined by using attention mechanism at both the instance level and the domain level. The generalization ability is further enhanced by unsupervised data augmentation. We also introduce a subject detection task for co-training the main model. Our approach is evaluated not only on the widely-adopted English dataset, Amazon product reviews, but also on Chinese datasets including product reviews and the discussions about pop musicians. Experimental results show that our approach outperforms state-of-the-art models.
What problem does this paper attempt to address?