Feature Partitioning for Efficient Multi-Task Architectures

Alejandro Newell,Lu Jiang,Chong Wang,Li-Jia Li,Jia Deng
DOI: https://doi.org/10.48550/arXiv.1908.04339
2019-08-13
Abstract:Multi-task learning holds the promise of less data, parameters, and time than training of separate models. We propose a method to automatically search over multi-task architectures while taking resource constraints into consideration. We propose a search space that compactly represents different parameter sharing strategies. This provides more effective coverage and sampling of the space of multi-task architectures. We also present a method for quick evaluation of different architectures by using feature distillation. Together these contributions allow us to quickly optimize for efficient multi-task models. We benchmark on Visual Decathlon, demonstrating that we can automatically search for and identify multi-task architectures that effectively make trade-offs between task resource requirements while achieving a high level of final performance.
Machine Learning,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?