Worst-Case-Aware Curriculum Learning for Zero and Few Shot Transfer

Sheng Zhang,Xin Zhang,Weiming Zhang,Anders Søgaard
DOI: https://doi.org/10.48550/arXiv.2009.11138
2020-09-23
Abstract:Multi-task transfer learning based on pre-trained language encoders achieves state-of-the-art performance across a range of tasks. Standard approaches implicitly assume the tasks, for which we have training data, are equally representative of the tasks we are interested in, an assumption which is often hard to justify. This paper presents a more agnostic approach to multi-task transfer learning, which uses automated curriculum learning to minimize a new family of worst-case-aware losses across tasks. Not only do these losses lead to better performance on outlier tasks; they also lead to better performance in zero-shot and few-shot transfer settings.
Computation and Language,Machine Learning
What problem does this paper attempt to address?