Abstract:In multi-task learning, a major challenge springs from a notorious issue known as negative transfer, which refers to the phenomenon that sharing the knowledge with dissimilar and hard tasks often results in a worsened performance. To circumvent this issue, we propose a novel multi-task learning method, which simultaneously learns latent task representations and a block-diagonal Latent Task Assignment Matrix (LTAM). Different from most of the previous work, pursuing the Block-Diagonal structure of LTAM (assigning latent tasks to output tasks) alleviates negative transfer via collaboratively grouping latent tasks and output tasks such that inter-group knowledge transfer and sharing is suppressed. This goal is challenging, since 1) our notion of Block-Diagonal Property extends the traditional notion for square matrices where the $i$-th column and the $i$-th column represents the same concept; 2) marginal constraints on rows and columns are also required for avoiding isolated latent/output tasks. Facing such challenges, we propose a novel regularizer by means of an equivalent spectral condition realizing this generalized block-diagonal property. Practically, we provide a relaxation scheme which improves the flexibility of the model. With the objective function given, we then propose an alternating optimization method, which not only tells how negative transfer is alleviated in our method but also reveals an interesting connection between our method and the optimal transport problem. Finally, the method is demonstrated on a simulation dataset, three real-world benchmark datasets and further applied to personalized attribute predictions.

Characterizing and Avoiding Negative Transfer

A Survey on Negative Transfer

Negative Transfer Detection in Transductive Transfer Learning

Online Transfer Learning: Negative Transfer and Effect of Prior Knowledge

Improving transfer learning in cross lingual opinion analysis through negative transfer detection

Restoring Latent Factors Against Negative Transfer Using Partial-Adaptation Nonnegative Matrix Factorization.

Towards A Unified Understanding and Improving of Adversarial Transferability

Target Domain Data induces Negative Transfer in Mixed Domain Training with Disjoint Classes

Cross-lingual Opinion Analysis Via Negative Transfer Detection.

Generalized Block-Diagonal Structure Pursuit: Learning Soft Latent Task Assignment Against Negative Transfer.

Mitigating Negative Transfer with Task Awareness for Sexism, Hate Speech, and Toxic Language Detection

Subgraph Pooling: Tackling Negative Transfer on Graphs

A Unified Approach to Interpreting and Boosting Adversarial Transferability

Loss-Balanced Task Weighting to Reduce Negative Transfer in Multi-Task Learning

A Bayesian Approach to (Online) Transfer Learning: Theory and Algorithms

Catastrophic Forgetting Meets Negative Transfer: Batch Spectral Shrinkage for Safe Transfer Learning.

Improving Transfer Learning by Introspective Reasoner.

Towards All-around Knowledge Transferring: Learning From Task-irrelevant Labels

Multi-Task Distillation: Towards Mitigating the Negative Transfer in Multi-Task Learning

Mitigate Negative Transfer with Similarity Heuristic Lifelong Prompt Tuning

Phase Transitions in Transfer Learning for High-Dimensional Perceptrons