T3S: Improving Multi-Task Reinforcement Learning with Task-Specific Feature Selector and Scheduler.

Yuanqiang Yu,Tianpei Yang,Yongliang Lv,Yan Zheng,Jianye Hao
DOI: https://doi.org/10.1109/ijcnn54540.2023.10191536
2023-01-01
Abstract:Multi-task reinforcement learning (MTRL) is a technique to train multiple tasks simultaneously, where previous works usually train a single model to solve different tasks by sharing parameters across various tasks. However, these methods are faced with inter-task interference since what parameters should be shared across tasks is not addressed, dramatically reducing learning efficiency. To solve these problems, we propose a novel MTRL framework called Task-Specific feature Selector and Scheduler (T3S), which consists of two components: a feature selector and a task scheduler. Specifically, the feature selectors employ hypernetworks to construct task-specific soft masks, which can be applied by globally shared representation to construct task-specific features. The task scheduler selects tasks for learning through two metrics, where the selection probability is inversely proportional to task progress (e.g., success rate) and task learning speed. Experimental results show that T3S consistently outperforms the state-of-the-art MTRL algorithms on various robotics manipulation tasks.
What problem does this paper attempt to address?