Rethinking Meta-Learning from a Learning Lens

Jingyao Wang,Wenwen Qiang,Jiangmeng Li,Lingyu Si,Changwen Zheng
2024-09-13
Abstract:Meta-learning has emerged as a powerful approach for leveraging knowledge from previous tasks to solve new tasks. The mainstream methods focus on training a well-generalized model initialization, which is then adapted to different tasks with limited data and updates. However, it pushes the model overfitting on the training tasks. Previous methods mainly attributed this to the lack of data and used augmentations to address this issue, but they were limited by sufficient training and effective augmentation strategies. In this work, we focus on the more fundamental ``learning to learn'' strategy of meta-learning to explore what causes errors and how to eliminate these errors without changing the environment. Specifically, we first rethink the algorithmic procedure of meta-learning from a ``learning'' lens. Through theoretical and empirical analyses, we find that (i) this paradigm faces the risk of both overfitting and underfitting and (ii) the model adapted to different tasks promote each other where the effect is stronger if the tasks are more similar. Based on this insight, we propose using task relations to calibrate the optimization process of meta-learning and propose a plug-and-play method called Task Relation Learner (TRLearner) to achieve this goal. Specifically, it first obtains task relation matrices from the extracted task-specific meta-data. Then, it uses the obtained matrices with relation-aware consistency regularization to guide optimization. Extensive theoretical and empirical analyses demonstrate the effectiveness of TRLearner.
Machine Learning,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The paper attempts to address the issues of overfitting and underfitting that meta-learning faces when dealing with new tasks. Specifically: 1. **Problem Background**: - Meta-learning methods, especially optimization-based methods (such as MAML), although capable of quickly adapting to new tasks, tend to overfit when data is limited and may underfit for complex tasks. - Previous research mainly alleviated the overfitting problem through data augmentation or constraining the covariance matrix, but these methods require a large amount of training data and effective augmentation strategies, increasing training costs and performing poorly in practical applications. 2. **Core Issue**: - The paper re-examines meta-learning strategies from the perspective of "learning," exploring the causes of meta-learning errors and proposing a method to eliminate these errors without changing the environment (e.g., data) and model structure. 3. **Solution**: - The paper proposes a new method called Task Relation Learner (TRLearner), which utilizes the relationships between tasks to calibrate the optimization process of meta-learning. - TRLearner calculates a task relationship matrix by extracting task-specific metadata and introduces a correlation consistency regularization term to guide the optimization process. 4. **Theoretical Analysis and Empirical Results**: - Theoretically, it is proven that using task relationships can narrow the gap between training tasks and testing tasks, thereby reducing excess risk. - Experimental results show that introducing the task relationship matrix M into meta-learning can improve the model's generalization ability. In summary, the paper aims to improve the generalization ability and robustness of meta-learning models by introducing inter-task relationships, thereby overcoming the overfitting and underfitting issues in traditional meta-learning methods.