Multi-task learning based on geometric invariance discriminative features

Yitong Liu,Lei Huang,Jie Li,Wenfeng Zhang,Yanxiu Sheng,Zhiqiang Wei
DOI: https://doi.org/10.1007/s10489-022-03617-x
IF: 5.3
2023-01-12
Applied Intelligence
Abstract:Multi-task learning (MTL) aims at tackling multiple tasks through one single network while guarantee all tasks can reach good performance. The main challenge in MTL is how to extract task-specific feature effectively. Existing task-specific feature extraction methods predominantly combine and stack convolutional neural networks (CNNs). However, these methods ignore two points: the geometric variations of the target object have different effects on each task; discriminative features for each task lack a mechanism to ensure they are focused on. In this work, we propose a Deformable-Attention Multi-Task Network (DAMTN) to improve the capability of extracting geometric invariance discriminative features. In particular, deformable convolution is introduced to learn geometric variation rules of the target object for different tasks, and attention mechanism helps task-specific networks focus on discriminative parts. The proposed DAMTN can be trained end-to-end. We empirically analyze the contribution of different components in the proposed method and demonstrate state-of-the-art performance on multiple classification tasks as well as semantic segmentation task and depth estimation task.
computer science, artificial intelligence
What problem does this paper attempt to address?