Hacking Task Confounder in Meta-Learning

Jingyao Wang,Yi Ren,Zeen Song,Jianqi Zhang,Changwen Zheng,Wenwen Qiang

2024-05-29

Abstract:Meta-learning enables rapid generalization to new tasks by learning knowledge from various tasks. It is intuitively assumed that as the training progresses, a model will acquire richer knowledge, leading to better generalization performance. However, our experiments reveal an unexpected result: there is negative knowledge transfer between tasks, affecting generalization performance. To explain this phenomenon, we conduct Structural Causal Models (SCMs) for causal analysis. Our investigation uncovers the presence of spurious correlations between task-specific causal factors and labels in meta-learning. Furthermore, the confounding factors differ across different batches. We refer to these confounding factors as "Task Confounders". Based on these findings, we propose a plug-and-play Meta-learning Causal Representation Learner (MetaCRL) to eliminate task confounders. It encodes decoupled generating factors from multiple tasks and utilizes an invariant-based bi-level optimization mechanism to ensure their causality for meta-learning. Extensive experiments on various benchmark datasets demonstrate that our work achieves state-of-the-art (SOTA) performance.

Machine Learning

What problem does this paper attempt to address?

The problem that this paper attempts to solve is the existence of negative knowledge transfer among tasks in meta - learning, which affects the generalization performance of the model on unseen tasks. Specifically, through experiments, the paper found that as the training progresses, the knowledge learned from the training tasks may be harmful to the unseen test tasks, leading to a decline in generalization performance. To explain this phenomenon, the author used causal theory for analysis and discovered spurious correlations between task - specific causal factors and labels. These correlations are called "task confounders". These confounders vary in different batches and further affect the generalization ability of the model. To solve this problem, the author proposed a plug - in meta - learning causal representation learner (Meta - learning Causal Representation Learner, MetaCRL), aiming to eliminate task confounders and thus improve the generalization performance of the model. MetaCRL contains two modules: a disentangling module and a causal module. The disentangling module is used to extract generative factors and eliminate task confounders, while the causal module ensures the causal relationships of these generative factors. Through these two modules, MetaCRL can dynamically eliminate task confounders during the meta - training process, significantly improving the performance of the model on various benchmark datasets.

Hacking Task Confounder in Meta-Learning

The Role of Deconfounding in Meta-learning

Rethinking Meta-Learning from a Learning Lens

Exploring Logically Dependent Multi task Learning with Causal Inference

A Meta Understanding of Meta-Learning

ConML: A Universal Meta-Learning Framework with Task-Level Contrastive Learning

Meta-Learning with Heterogeneous Tasks

Clustered Task-Aware Meta-Learning by Learning From Learning Paths

Set-based Meta-Interpolation for Few-Task Meta-Learning

ST-MAML: A Stochastic-Task based Method for Task-Heterogeneous Meta-Learning

Task-level Relations Modelling for Graph Meta-learning.

Towards Well-Generalizing Meta-Learning Via Adversarial Task Augmentation

The meta-learning method for the ensemble model based on situational meta-task

Meta Multi-Task Learning for Sequence Modeling.

MAML2: meta reinforcement learning via meta-learning for task categories

Revisiting Meta-Learning As Supervised Learning

On the Stability-Plasticity Dilemma in Continual Meta-Learning: Theory and Algorithm.

Meta-Learning Multi-task Communication.

Task Groupings Regularization: Data-Free Meta-Learning with Heterogeneous Pre-trained Models

Task Agnostic Continual Learning via Meta Learning

A Unified Causal View of Instruction Tuning