An Adaptive Node Network Based on Multitask Deep Learning

Hongxia Wang,Xiao Jin,Yukun Du,Nan Zhang
DOI: https://doi.org/10.21203/rs.3.rs-2901753/v1
2023-01-01
Abstract:Abstract Multitask learning (MTL) improves the performance achieved on each task by exploiting the relevant information between tasks. At present, most of the mainstream deep MTL models are based on hard parameter sharing mechanisms, which can reduce the risk of model overfitting. However, negative knowledge transfer may occur, which hinders the performance improvement achieved for each task. In this paper, for situations when multiple tasks are jointly trained, we propose a deep MTL method for adaptive nodes. On the basis of the hard parameter sharing network architecture, the number of nodes in the network is dynamically updated by setting a continuous gradient difference-based sign threshold and a warm-up training iteration threshold through the relationships between the parameters and the loss function. After each task fully utilizes the shared information, adaptive nodes are used to further optimize each task, reducing the impact of negative migration. By using simulation and case studies, we demonstrate theoretical proof that the performance of the proposed method is better than that of the competing approach. MSC: 68T07; 68W99
What problem does this paper attempt to address?